mirror of
https://github.com/bitcoin/bitcoin.git
synced 2025-08-28 02:31:34 +02:00
Merge bitcoin/bitcoin#26593: tracing: Only prepare tracepoint arguments when actually tracing
0de3e96e33
tracing: use bitcoind pid in bcc tracing examples (0xb10c)411c6cfc6c
tracing: only prepare tracepoint args if attached (0xb10c)d524c1ec06
tracing: dedup TRACE macros & rename to TRACEPOINT (0xb10c) Pull request description: Currently, if the tracepoints are compiled (e.g. in depends and release builds), we always prepare the tracepoint arguments regardless of the tracepoints being used or not. We made sure that the argument preparation is as cheap as possible, but we can almost completely eliminate any overhead for users not interested in the tracepoints (the vast majority), by gating the tracepoint argument preparation with an `if(something is attached to this tracepoint)`. To achieve this, we use the optional semaphore feature provided by SystemTap. The first commit simplifies and deduplicates our tracepoint macros from 13 TRACEx macros to a single TRACEPOINT macro. This makes them easier to use and also avoids more duplicate macro definitions in the second commit. The Linux tracing tools I'm aware of (bcc, bpftrace, libbpf, and systemtap) all support the semaphore gating feature. Thus, all existing tracepoints and their argument preparation is gated in the second commit. For details, please refer to the commit messages and the updated documentation in `doc/tracing.md`. Also adding unit tests that include all tracepoint macros to make sure there are no compiler problems with them (e.g. some varadiac extension not supported). Reviewers might want to check: - Do the tracepoints still work for you? Do the examples in `contrib/tracing/` run on your system (as bpftrace frequently breaks on every new version, please test master too if it should't work for you)? Do the CI interface tests still pass? - Is the new documentation clear? - The `TRACEPOINT_SEMAPHORE(event, context)` macros places global variables in our global namespace. Is this something we strictly want to avoid or maybe move to all `TRACEPOINT_SEMAPHORE`s to a separate .cpp file or even namespace? I like having the `TRACEPOINT_SEMAPHORE()` in same file as the `TRACEPOINT()`, but open for suggestion on alternative approaches. - Are newly added tracepoints in the unit tests visible when using `readelf -n build/src/test/test_bitcoin`? You can run the new unit tests with `./build/src/test/test_bitcoin --run_test=util_trace_tests* --log_level=all`. <details><summary>Two of the added unit tests demonstrate that we are only processing the tracepoint arguments when attached by having a test-failure condition in the tracepoint argument preparation. The following bpftrace script can be used to demonstrate that the tests do indeed fail when attached to the tracepoints.</summary> `fail_tests.bt`: ```c #!/usr/bin/env bpftrace usdt:./build/src/test/test_bitcoin:test:check_if_attached { printf("the 'check_if_attached' test should have failed\n"); } usdt:./build/src/test/test_bitcoin:test:expensive_section { printf("the 'expensive_section' test should have failed\n"); } ``` Run the unit tests with `./build/src/test/test_bitcoin` and start `bpftrace fail_tests.bt -p $(pidof test_bitcoin)` in a separate terminal. The unit tests should fail with: ``` Running 594 test cases... test/util_trace_tests.cpp(31): error: in "util_trace_tests/test_tracepoint_check_if_attached": check false has failed test/util_trace_tests.cpp(51): error: in "util_trace_tests/test_tracepoint_manual_tracepoint_active_check": check false has failed *** 2 failures are detected in the test module "Bitcoin Core Test Suite" ``` </details> These links might provide more contextual information for reviewers: - [How SystemTap Userspace Probes Work by eklitzke](https://eklitzke.org/how-sytemtap-userspace-probes-work) (actually an example on Bitcoin Core; mentions that with semaphores "the overhead for an untraced process is effectively zero.") - [libbpf comment on USDT semaphore handling](1596a09b5d/src/usdt.c (L83-L92)
) (can recommend the whole comment for background on how the tracepoints and tracing tools work together) - https://sourceware.org/systemtap/wiki/UserSpaceProbeImplementation#Semaphore_Handling ACKs for top commit: willcl-ark: utACK0de3e96e33
laanwj: re-ACK0de3e96e33
jb55: utACK0de3e96e33
vasild: ACK0de3e96e33
Tree-SHA512: 0e5e0dc5e0353beaf5c446e4be03d447e64228b1be71ee9972fde1d6fac3fac71a9d73c6ce4fa68975f87db2b2bf6eee2009921a2a145e24d83a475d007a559b
This commit is contained in:
@@ -265,42 +265,52 @@ Arguments passed:
|
||||
|
||||
## Adding tracepoints to Bitcoin Core
|
||||
|
||||
To add a new tracepoint, `#include <util/trace.h>` in the compilation unit where
|
||||
the tracepoint is inserted. Use one of the `TRACEx` macros listed below
|
||||
depending on the number of arguments passed to the tracepoint. Up to 12
|
||||
arguments can be provided. The `context` and `event` specify the names by which
|
||||
the tracepoint is referred to. Please use `snake_case` and try to make sure that
|
||||
the tracepoint names make sense even without detailed knowledge of the
|
||||
implementation details. Do not forget to update the tracepoint list in this
|
||||
document.
|
||||
Use the `TRACEPOINT` macro to add a new tracepoint. If not yet included, include
|
||||
`util/trace.h` (defines the tracepoint macros) with `#include <util/trace.h>`.
|
||||
Each tracepoint needs a `context` and an `event`. Please use `snake_case` and
|
||||
try to make sure that the tracepoint names make sense even without detailed
|
||||
knowledge of the implementation details. You can pass zero to twelve arguments
|
||||
to the tracepoint. Each tracepoint also needs a global semaphore. The semaphore
|
||||
gates the tracepoint arguments from being processed if we are not attached to
|
||||
the tracepoint. Add a `TRACEPOINT_SEMAPHORE(context, event)` with the `context`
|
||||
and `event` of your tracepoint in the top-level namespace at the beginning of
|
||||
the file. Do not forget to update the tracepoint list in this document.
|
||||
|
||||
```c
|
||||
#define TRACE(context, event)
|
||||
#define TRACE1(context, event, a)
|
||||
#define TRACE2(context, event, a, b)
|
||||
#define TRACE3(context, event, a, b, c)
|
||||
#define TRACE4(context, event, a, b, c, d)
|
||||
#define TRACE5(context, event, a, b, c, d, e)
|
||||
#define TRACE6(context, event, a, b, c, d, e, f)
|
||||
#define TRACE7(context, event, a, b, c, d, e, f, g)
|
||||
#define TRACE8(context, event, a, b, c, d, e, f, g, h)
|
||||
#define TRACE9(context, event, a, b, c, d, e, f, g, h, i)
|
||||
#define TRACE10(context, event, a, b, c, d, e, f, g, h, i, j)
|
||||
#define TRACE11(context, event, a, b, c, d, e, f, g, h, i, j, k)
|
||||
#define TRACE12(context, event, a, b, c, d, e, f, g, h, i, j, k, l)
|
||||
```
|
||||
|
||||
For example:
|
||||
For example, the `net:outbound_message` tracepoint in `src/net.cpp` with six
|
||||
arguments.
|
||||
|
||||
```C++
|
||||
TRACE6(net, inbound_message,
|
||||
pnode->GetId(),
|
||||
pnode->m_addr_name.c_str(),
|
||||
pnode->ConnectionTypeAsString().c_str(),
|
||||
sanitizedType.c_str(),
|
||||
msg.data.size(),
|
||||
msg.data.data()
|
||||
);
|
||||
// src/net.cpp
|
||||
TRACEPOINT_SEMAPHORE(net, outbound_message);
|
||||
…
|
||||
void CConnman::PushMessage(…) {
|
||||
…
|
||||
TRACEPOINT(net, outbound_message,
|
||||
pnode->GetId(),
|
||||
pnode->m_addr_name.c_str(),
|
||||
pnode->ConnectionTypeAsString().c_str(),
|
||||
sanitizedType.c_str(),
|
||||
msg.data.size(),
|
||||
msg.data.data()
|
||||
);
|
||||
…
|
||||
}
|
||||
```
|
||||
If needed, an extra `if (TRACEPOINT_ACTIVE(context, event)) {...}` check can be
|
||||
used to prepare somewhat expensive arguments right before the tracepoint. While
|
||||
the tracepoint arguments are only prepared when we attach something to the
|
||||
tracepoint, an argument preparation should never hang the process. Hashing and
|
||||
serialization of data structures is probably fine, a `sleep(10s)` not.
|
||||
|
||||
```C++
|
||||
// An example tracepoint with an expensive argument.
|
||||
|
||||
TRACEPOINT_SEMAPHORE(example, gated_expensive_argument);
|
||||
…
|
||||
if (TRACEPOINT_ACTIVE(example, gated_expensive_argument)) {
|
||||
expensive_argument = expensive_calulation();
|
||||
TRACEPOINT(example, gated_expensive_argument, expensive_argument);
|
||||
}
|
||||
```
|
||||
|
||||
### Guidelines and best practices
|
||||
@@ -318,12 +328,6 @@ the tracepoint. See existing examples in [contrib/tracing/].
|
||||
|
||||
[contrib/tracing/]: ../contrib/tracing/
|
||||
|
||||
#### No expensive computations for tracepoints
|
||||
Data passed to the tracepoint should be inexpensive to compute. Although the
|
||||
tracepoint itself only has overhead when enabled, the code to compute arguments
|
||||
is always run - even if the tracepoint is not used. For example, avoid
|
||||
serialization and parsing.
|
||||
|
||||
#### Semi-stable API
|
||||
Tracepoints should have a semi-stable API. Users should be able to rely on the
|
||||
tracepoints for scripting. This means tracepoints need to be documented, and the
|
||||
@@ -347,7 +351,7 @@ first six argument fields. Binary data can be placed in later arguments. The BCC
|
||||
supports reading from all 12 arguments.
|
||||
|
||||
#### Strings as C-style String
|
||||
Generally, strings should be passed into the `TRACEx` macros as pointers to
|
||||
Generally, strings should be passed into the `TRACEPOINT` macros as pointers to
|
||||
C-style strings (a null-terminated sequence of characters). For C++
|
||||
`std::strings`, [`c_str()`] can be used. It's recommended to document the
|
||||
maximum expected string size if known.
|
||||
@@ -370,9 +374,9 @@ $ gdb ./build/src/bitcoind
|
||||
…
|
||||
(gdb) info probes
|
||||
Type Provider Name Where Semaphore Object
|
||||
stap net inbound_message 0x000000000014419e /build/src/bitcoind
|
||||
stap net outbound_message 0x0000000000107c05 /build/src/bitcoind
|
||||
stap validation block_connected 0x00000000002fb10c /build/src/bitcoind
|
||||
stap net inbound_message 0x000000000014419e 0x0000000000d29bd2 /build/src/bitcoind
|
||||
stap net outbound_message 0x0000000000107c05 0x0000000000d29bd0 /build/src/bitcoind
|
||||
stap validation block_connected 0x00000000002fb10c 0x0000000000d29bd8 /build/src/bitcoind
|
||||
…
|
||||
```
|
||||
|
||||
@@ -388,7 +392,7 @@ Displaying notes found in: .note.stapsdt
|
||||
stapsdt 0x0000005d NT_STAPSDT (SystemTap probe descriptors)
|
||||
Provider: net
|
||||
Name: outbound_message
|
||||
Location: 0x0000000000107c05, Base: 0x0000000000579c90, Semaphore: 0x0000000000000000
|
||||
Location: 0x0000000000107c05, Base: 0x0000000000579c90, Semaphore: 0x0000000000d29bd0
|
||||
Arguments: -8@%r12 8@%rbx 8@%rdi 8@192(%rsp) 8@%rax 8@%rdx
|
||||
…
|
||||
```
|
||||
@@ -407,7 +411,7 @@ between distributions. For example, on
|
||||
|
||||
```
|
||||
$ tplist -l ./build/src/bitcoind -v
|
||||
b'net':b'outbound_message' [sema 0x0]
|
||||
b'net':b'outbound_message' [sema 0xd29bd0]
|
||||
1 location(s)
|
||||
6 argument(s)
|
||||
…
|
||||
|
Reference in New Issue
Block a user