bitcoin

mirror of https://github.com/bitcoin/bitcoin.git synced 2026-01-20 15:19:07 +01:00

Author	SHA1	Message	Date
Pieter Wuille	6a8fa821b8	clusterlin: add support for loading existing linearization (feature)	2025-12-18 16:01:22 -05:00
Pieter Wuille	da48ed9f34	clusterlin: ReadLinearization for non-topological (tests) Rather than using an ad-hoc no-dependency copy of the graph when a potentially non-topological linearization is needed in the clusterlin fuzz test, add this directly as a feature in ReadLinearization(). This is preparation for a later commit where another use for such a function is added.	2025-12-18 15:49:07 -05:00
Pieter Wuille	c461259fb6	clusterlin: add class implementing SFL state (preparation) This adds a data structure representing the optimization state for the spanning-forest linearization algorithm (SFL), plus a fuzz test for its correctness. This is preparation for switching over Linearize() to use this algorithm. See https://delvingbitcoin.org/t/spanning-forest-cluster-linearization/1419 for a description of the algorithm.	2025-12-18 15:49:01 -05:00
Pieter Wuille	95bfe7d574	clusterlin: replace benchmarks with SFL-hard ones (bench) This also adds a per-cost variant of each.	2025-12-18 14:17:28 -05:00
Pieter Wuille	86dd550a9b	clusterlin: add known-correct optimal linearization tests (tests)	2025-12-18 14:17:28 -05:00
merge-script	516ae5ede4	Merge bitcoin/bitcoin#31533 : fuzz: Add fuzz target for block index tree and related validation events `db2d39f642` fuzz: add subtest for re-downloading a previously pruned block (Eugene Siegel) `45f5b2dac3` fuzz: Add fuzzer for block index (Martin Zumsande) `c011e3aa54` test: Wrap validation functions with TestChainstateManager (Martin Zumsande) Pull request description: This adds a fuzz target for the block index and various events in validation that interact with it. It can create arbitrary tree-like structure of block indexes, simulating (so far) the following events: - Adding a header - Receiving the full block (may be valid or not) - `ActivateBestChain()` - Reorging the chain to a new chain tip (possibly encountering invalid blocks on the way) - Pruning a block in the best chain - Receiving a previously pruned block again (`getblockfrompeer`) It might be interesting / possible to extend this to more events, such as dealing with more than one chainstate (assumeutxo). The test skips all actual validation of header/ block / transaction data by just simulating the outcome, and also doesn't interact with the data directory. The main goal is to ensure the integrity of the block index tree in all fuzzed constellations, by calling `CheckBlockIndex()` at the end of each iteration. Compared to #29158 this approach has a more limited scope (by skipping all actual validation), but it is fast - it doesn't do a full init sequence on each iteration, but "cleans up" after itself by resetting the global validation state after each iteration. ACKs for top commit: Crypt-iQ: reACK `db2d39f642` maflcko: review ACK `db2d39f642` 🍶 sedited: Re-ACK `db2d39f642` Tree-SHA512: 76cd5f8f4d7d7258620b46d7438bad4508c3bdc98825b48b60f694b5a9838e2b2cf4967c0ead181f86f66f4939ddfe552471851b9d18f84f584c03dd7e09fc43	2025-12-18 15:26:42 +00:00
Lőrinc	85314dc0bf	kernel: revert accidentally removed copyright header See: `7990463b10 (diff-04e685224f1ac5bfd91d47d8d7528a2e44f94fab5535d4b6b5af79b5a13aeb93L1-L12)`	2025-12-18 16:07:31 +01:00
merge-script	80b1b5917d	Merge bitcoin/bitcoin#34088 : log: Use `__func__` for -logsourcelocations `facd3d56cc` log: Use `__func__` for -logsourcelocations (MarcoFalke) Pull request description: The `-logsourcelocations` option was recently changed to print the full function signature, as a side-effect of moving toward `std::source_location` internally. This is fine, but at least for me, it makes debugging functional test failures harder, because the log is just so massively verbose, with questionable benefit. I think the historically used file name, line number, and plain `__func__` name are more than sufficient for `-logsourcelocations`. So switch back to using that. For reference, a verbose log may look like: ``` ... node0 2025-12-17T07:28:37.528146Z [init] [checkqueue.h:147] [CCheckQueue<T, R>::CCheckQueue(unsigned int, int) [with T = CScriptCheck; R = std::pair<ScriptError_t, std::__cxx11::basic_string<char> >]] Script verificatio n uses 1 additional threads ... ``` I don't think there is value in printing stuff, like the (anon) namespace, the class template args, or the functionn (template) args. The following should be more than sufficient: ``` ... node0 2025-12-17T09:45:57.017122Z [init] [checkqueue.h:147] [CCheckQueue] Script verification uses 1 additional threads ... ACKs for top commit: ajtowns: ACK `facd3d56cc` -- those long signatures are terrible stickies-v: ACK `facd3d56cc` Tree-SHA512: 22fd1f0074fc6e85754967f9219659f57c905005a2bea9176f0b439abec324d7e6c2f875c8951934a3b11ef7e9d7e38d5d5d307e2bd1e000bc27ee85635cd668	2025-12-18 12:17:20 +00:00
merge-script	8d38b6f5f1	Merge bitcoin/bitcoin#34091 : fuzz: doc: remove any mention to `address_deserialize_v2` `caf4843a59` fuzz: doc: remove any mention to address_deserialize_v2 (brunoerg) Pull request description: We don't have `address_deserialize_v2` target anymore since `fac81affb5` (we used to have `address_deserialize_v1_notime`, `address_deserialize_v1_withtime` and `address_deserialize_v2` but now we only have a single `address_deserialize` target) so it removes any mention to it. ACKs for top commit: maflcko: review ACK `caf4843a59` 🎾 marcofleon: ACK `caf4843a59` Tree-SHA512: 539d69edbfe4ca11eb0701ed5c789ad81976e3e85e8a229e39e9dc1b1c72264f01d10a1c16d0a3bb4a354794412dc8b625298f4f72430905a00b65faeaa37d6b	2025-12-18 11:35:41 +00:00
Ryan Ofsky	ab513103df	Merge bitcoin/bitcoin#33192 : refactor: unify container presence checks `d9319b06cf` refactor: unify container presence checks - non-trivial counts (Lőrinc) `039307554e` refactor: unify container presence checks - trivial counts (Lőrinc) `8bb9219b63` refactor: unify container presence checks - find (Lőrinc) Pull request description: ### Summary Instead of counting occurrences in sets and maps, the C++20 `::contains` method expresses the intent unambiguously and can return early on first encounter. ### Context Applied clang‑tidy's [readability‑container‑contains](https://clang.llvm.org/extra/clang-tidy/checks/readability/container-contains.html) check, though many cases required manual changes since tidy couldn't fix them automatically. ### Changes The changes made here were: \| From \| To \| \|------------------------\|------------------\| \| `m.find(k) == m.end()` \| `!m.contains(k)` \| \| `m.find(k) != m.end()` \| `m.contains(k)` \| \| `m.count(k)` \| `m.contains(k)` \| \| `!m.count(k)` \| `!m.contains(k)` \| \| `m.count(k) == 0` \| `!m.contains(k)` \| \| `m.count(k) != 1` \| `!m.contains(k)` \| \| `m.count(k) == 1` \| `m.contains(k)` \| \| `m.count(k) < 1` \| `!m.contains(k)` \| \| `m.count(k) > 0` \| `m.contains(k)` \| \| `m.count(k) != 0` \| `m.contains(k)` \| > Note that `== 1`/`!= 1`/`< 1` only apply to simple [maps](https://en.cppreference.com/w/cpp/container/map/contains)/[sets](https://en.cppreference.com/w/cpp/container/set/contains) and had to be changed manually. There are many other cases that could have been changed, but we've reverted most of those to reduce conflict with other open PRs. ----- <details> <summary>clang-tidy command on Mac</summary> ```bash rm -rfd build && \ cmake -B build \ -DCMAKE_C_COMPILER="$(brew --prefix llvm)/bin/clang" \ -DCMAKE_CXX_COMPILER="$(brew --prefix llvm)/bin/clang++" \ -DCMAKE_OSX_SYSROOT="$(xcrun --show-sdk-path)" \ -DCMAKE_C_FLAGS="-target arm64-apple-macos11" \ -DCMAKE_CXX_FLAGS="-target arm64-apple-macos11" \ -DCMAKE_EXPORT_COMPILE_COMMANDS=ON -DBUILD_BENCH=ON -DBUILD_FUZZ_BINARY=ON -DBUILD_FOR_FUZZING=ON "$(brew --prefix llvm)/bin/run-clang-tidy" -quiet -p build -j$(nproc) -checks='-*,readability-container-contains' \| grep -v 'clang-tidy' ``` </details> Note: this is a take 2 of https://github.com/bitcoin/bitcoin/pull/33094 with fewer contentious changes. ACKs for top commit: optout21: reACK `d9319b06cf` sedited: ACK `d9319b06cf` janb84: re ACK `d9319b06cf` pablomartin4btc: re-ACK `d9319b06cf` ryanofsky: Code review ACK `d9319b06cf`. I manually reviewed the full change, and it seems there are a lot of positive comments about this and no more very significant conflicts, so I will merge it shortly. Tree-SHA512: e4415221676cfb88413ccc446e5f4369df7a55b6642347277667b973f515c3c8ee5bfa9ee0022479c8de945c89fbc9ff61bd8ba086e70f30298cbc1762610fe1	2025-12-17 16:17:29 -05:00
MarcoFalke	facd3d56cc	log: Use `__func__` for -logsourcelocations	2025-12-17 18:35:49 +01:00
merge-script	e5c600dc0e	Merge bitcoin/bitcoin#34063 : Make `transaction_indentifier` hex string constructor evaluated at comptime `5ac3579520` refactor: Add compile-time-checked hex txid (rustaceanrob) Pull request description: Suggested by l0rinc as a comment in #34004. There are tests that utilize `FromHex` that will only fail during runtime if malformed. Adds a compile time constructor that can be caught by LSPs. ACKs for top commit: l0rinc: ACK `5ac3579520` maflcko: review ACK `5ac3579520` 🦎 rkrux: crACK `5ac3579520` Tree-SHA512: b0bae2bf0b8cd8c9a90765a14c46146313cf8b224a29d58a253e65ca95c4205c0beddea9c49ae58901e72c8c5202b91695d074ffb1c48e448d2e5606eb1bd5b4	2025-12-17 17:16:30 +00:00
Hennadii Stepanov	41f2cc6d3d	Merge bitcoin-core/gui#919 : move-only: MAX_BLOCK_TIME_GAP to src/qt `fa5ed16aa4` move-only: MAX_BLOCK_TIME_GAP to src/qt (MarcoFalke) Pull request description: `MAX_BLOCK_TIME_GAP` was used in some incorrect heuristics, which were removed in commit `e30b6ea194`. This leaves a single module in src/qt using the constant. Instead of exposing it in a central kernel header, just move it to the single gui module that uses it. ACKs for top commit: sedited: ACK `fa5ed16aa4` hebasto: ACK `fa5ed16aa4`, I have reviewed the code and it looks OK. Tree-SHA512: d0e0e5257f6585d793bfed118d61a3e5d56b2be397fa3b09b34db64e3e018eba9f223cd56541d258b422119fdd7501f07cd3bb8ad5dc28b535922aa21ea76fa6	2025-12-17 17:11:59 +00:00
merge-script	a005fdff6c	Merge bitcoin/bitcoin#34074 : A few followups after introducing `/rest/blockpart/` endpoint `59b93f11e8` rest: print also HTTP response reason in case of an error (Roman Zeyde) `7fe94a0493` rest: add a test for unsuported `/blockpart/` request type (Roman Zeyde) `55d0d19b5c` rest: deduplicate `interface_rest.py` negative tests (Roman Zeyde) `89eb531024` rest: update release notes for `/blockpart/` endpoint (Roman Zeyde) `41118e17f8` blockstorage: simplify partial block read validation (Roman Zeyde) `599effdeab` rest: reformat `uri_prefixes` initializer list (Roman Zeyde) Pull request description: The commits below should resolve a few leftovers from #33657. ACKs for top commit: l0rinc: ACK `59b93f11e8` hodlinator: re-ACK `59b93f11e8` Tree-SHA512: ae45e08edd315018e11283b354fb32f9658f5829c956554dc662a81c2e16397def7c3700e6354e0a91ff03c850def35638a69ec2668b7c015d25d6fed42b92bb	2025-12-17 15:09:15 +00:00
brunoerg	caf4843a59	fuzz: doc: remove any mention to address_deserialize_v2	2025-12-17 11:57:11 -03:00
MarcoFalke	fa5ed16aa4	move-only: MAX_BLOCK_TIME_GAP to src/qt Can be reviewed via the git option: --color-moved=dimmed-zebra	2025-12-17 13:55:03 +01:00
Eugene Siegel	db2d39f642	fuzz: add subtest for re-downloading a previously pruned block This imitates the use of the getblockfrompeer rpc. Note that currently pruning is limited to blocks in the active chain. Co-authored-by: Martin Zumsande <mzumsande@gmail.com>	2025-12-16 11:25:46 -05:00
Martin Zumsande	45f5b2dac3	fuzz: Add fuzzer for block index This fuzz target creates arbitrary tree-like structure of indices, simulating the following events: - Adding a header to the block tree db - Receiving the full block (may be valid or not) - Reorging to a new chain tip (possibly encountering invalid blocks on the way) - pruning The test skips all actual validation of header/ block / transaction data by just simulating the outcome, and also doesn't interact with the data directory. The main goal is to test the integrity of the block index tree in all fuzzed constellations, by calling CheckBlockIndex() at the end of each iteration.	2025-12-16 11:25:46 -05:00
Martin Zumsande	c011e3aa54	test: Wrap validation functions with TestChainstateManager This allows to access them in the fuzz test in the next commit without making them public. Co-authored-by: TheCharlatan <seb.kung@gmail.com>	2025-12-16 11:25:46 -05:00
merge-script	13891a8a68	Merge bitcoin/bitcoin#34050 : fuzz: exercise `ComputeMerkleRoot` without `mutated` parameter `7e9de20c0c` fuzz: exercise `ComputeMerkleRoot` without mutated parameter (Lőrinc) Pull request description: The `mutated` parameter in `ComputeMerkleRoot` unlocks a different path that was always exercised in the fuzz test. Adjusted to be fuzzer to pass `nullptr` as well to make sure that path is also tested: `24ed820d4f/src/consensus/merkle.cpp (L49-L53)` Follow-up to https://github.com/bitcoin/bitcoin/pull/33805#discussion_r2589073735 ACKs for top commit: frankomosh: ACK [`7e9de20`](`7e9de20c0c`) hodlinator: ACK `7e9de20c0c` sedited: ACK `7e9de20c0c` Tree-SHA512: bf27029ac04003447b24a95544ec863f9ceca6c28d51ea811dd6ca2b412a2a780bb9fdbcdc82719f39dd710a746eb2446263e8377d67a8be52a1694571d03498	2025-12-16 14:25:55 +00:00
merge-script	4f11ef058b	Merge bitcoin/bitcoin#30214 : refactor: Improve assumeutxo state representation `82be652e40` doc: Improve ChainstateManager documentation, use consistent terms (Ryan Ofsky) `af455dcb39` refactor: Simplify pruning functions (TheCharlatan) `ae85c495f1` refactor: Delete ChainstateManager::GetAll() method (Ryan Ofsky) `6a572dbda9` refactor: Add ChainstateManager::ActivateBestChains() method (Ryan Ofsky) `491d827d52` refactor: Add ChainstateManager::m_chainstates member (Ryan Ofsky) `e514fe6116` refactor: Delete ChainstateManager::SnapshotBlockhash() method (Ryan Ofsky) `ee35250683` refactor: Delete ChainstateManager::IsSnapshotValidated() method (Ryan Ofsky) `d9e82299fc` refactor: Delete ChainstateManager::IsSnapshotActive() method (Ryan Ofsky) `4dfe383912` refactor: Convert ChainstateRole enum to struct (Ryan Ofsky) `352ad27fc1` refactor: Add ChainstateManager::ValidatedChainstate() method (Ryan Ofsky) `a229cb9477` refactor: Add ChainstateManager::CurrentChainstate() method (Ryan Ofsky) `a9b7f5614c` refactor: Add Chainstate::StoragePath() method (Ryan Ofsky) `840bd2ef23` refactor: Pass chainstate parameters to MaybeCompleteSnapshotValidation (Ryan Ofsky) `1598a15aed` refactor: Deduplicate Chainstate activation code (Ryan Ofsky) `9fe927b6d6` refactor: Add Chainstate m_assumeutxo and m_target_utxohash members (Ryan Ofsky) `6082c84713` refactor: Add Chainstate::m_target_blockhash member (Ryan Ofsky) `de00e87548` test: Fix broken chainstatemanager_snapshot_init check (Ryan Ofsky) Pull request description: This PR contains the first part of #28608, which tries to make assumeutxo code more maintainable, and improve it by not locking `cs_main` for a long time when the snapshot block is connected, and by deleting the snapshot validation chainstate when it is no longer used, instead of waiting until the next restart. The changes in this PR are just refactoring. They make `Chainstate` objects self-contained, so for example, it is possible to determine what blocks to connect to a chainstate without querying `ChainstateManager`, and to determine whether a Chainstate is validated without basing it on inferences like `&cs != &ActiveChainstate()` or `GetAll().size() == 1`. The PR also tries to make assumeutxo terminology less confusing, using "current chainstate" to refer to the chainstate targeting the current network tip, and "historical chainstate" to refer to the chainstate downloading old blocks and validating the assumeutxo snapshot. It removes uses of the terms "active chainstate," "usable chainstate," "disabled chainstate," "ibd chainstate," and "snapshot chainstate" which are confusing for various reasons. ACKs for top commit: maflcko: re-review ACK `82be652e40` 🕍 fjahr: re-ACK `82be652e40` sedited: Re-ACK `82be652e40` Tree-SHA512: 81c67abba9fc5bb170e32b7bf8a1e4f7b5592315b4ef720be916d5f1f5a7088c0c59cfb697744dd385552f58aa31ee36176bae6a6e465723e65861089a1252e5	2025-12-16 14:03:34 +00:00
Hennadii Stepanov	41bf8f2d5e	Merge bitcoin-core/gui#877 : Add a menu action to restore then migrate a legacy wallet `14371fd1fc` gui: Add a menu item to restore then migrate a wallet file (Ava Chow) `f11a7d248c` gui: Add restore_and_migrate function to restore then migrate a wallet (Ava Chow) `16ab6dfc10` gui: Move actual migration part of migrate() to its own function (Ava Chow) `4ec2d18a07` wallet, interfaces, gui: Expose load_after_restore parameter (Ava Chow) Pull request description: Some users will have a backup of their legacy wallet. These cannot be restored since the "Restore Wallet" action expects to be able to load the wallet after restoring, and this fails for legacy wallets now that they are deleted. Furthermore, the "Migrate Wallet" action only allows users to migrate wallets that are in the wallets directory, so such backups cannot be migrated from the GUI. This PR resolves this issue by adding a menu item in the "Migrate Wallet" menu which allows users to select their backup file so that it will first be restored but not loaded, and then migrated. Depends on https://github.com/bitcoin/bitcoin/pull/32620 ACKs for top commit: hebasto: ACK `14371fd1fc`. Tree-SHA512: 2b09c012f4c70d0cb283305bf3d1a18ae5a2bfb80977c91544ac1fbc29d6360df49438cfdc8f66661ddb42ddab728c8ef1f9e0d7031877fbd846f9cea957398e	2025-12-15 15:42:40 +00:00
merge-script	2210feb446	Merge bitcoin/bitcoin#34051 : log: Remove brittle and confusing LogPrintLevel `fa8a5d215c` log: Remove brittle and confusing LogPrintLevel (MarcoFalke) `fac24bbec8` test: Clarify logging_SeverityLevels test (MarcoFalke) `f273167661` ipc: separate log statements per level (stickies-v) `94c51ae540` libevent: separate log statements per level (stickies-v) Pull request description: `LogPrintLevel` has many issues: * It encourages to log several levels in one source location. This is problematic, because all levels (even warnings and errors) will be rate limited equally for the same location. * Its warning and error logs are specially formatted compared to all other warning and error logs in the codebase, making them harder to spot (both in the debug log and in the code). * It is verbose to type and read. * It is confusing, because the majority of code uses the `Log$LEVEL(...)` macros. Having less ways to achieve the same makes the code more consistent and easier to review. Fix all issues by removing it ACKs for top commit: stickies-v: re-ACK `fa8a5d215c` ajtowns: ACK `fa8a5d215c` pablomartin4btc: re-ACK `fa8a5d215c` Tree-SHA512: 9fbb04962d9c26e566338694a7725b3c0e88ef733322d890bcc6aeddb45266c754e7c885c69bbfebd1588cc09912c6784cfc00e69882f1271a8c87d201490478	2025-12-14 12:30:48 +00:00
Roman Zeyde	41118e17f8	blockstorage: simplify partial block read validation Use `SaturatingAdd` following https://github.com/bitcoin/bitcoin/pull/33657#discussion_r2610832092.	2025-12-14 10:44:12 +01:00
Roman Zeyde	599effdeab	rest: reformat `uri_prefixes` initializer list There was an extra indentation level (found during #33657): ``` $ git show -U0 `07135290c1` \| ./contrib/devtools/clang-format-diff.py -p1 -i -v ```	2025-12-14 10:04:37 +01:00
rustaceanrob	5ac3579520	refactor: Add compile-time-checked hex txid Suggested by @l0rinc in #34004 Message by @l0rinc: This adds a consteval constructor to transaction_identifier (Txid/Wtxid) to allow parsing hex strings at compile-time. This replaces runtime FromHex checks in tests, ensuring that malformed hardcoded hashes cause build failures rather than runtime test failures. Test variables are explicitly marked constexpr. This is required to workaround a regression in GCC 14 (Bug 117501) where the compiler incorrectly flags consteval initialization of non-constexpr variables as "statements with no effect". GCC Bug: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117501 Reproducer: https://godbolt.org/z/xb5TMaPs6 Co-authored-by: l0rinc <pap.lorinc@gmail.com>	2025-12-13 18:23:48 +00:00
MarcoFalke	fa8a5d215c	log: Remove brittle and confusing LogPrintLevel	2025-12-13 13:43:24 +01:00
MarcoFalke	fac24bbec8	test: Clarify logging_SeverityLevels test The test was a bit confusing, because it just referred to the "global log level" without explicitly specifying what it is. The level is set though the LogSetup constructor. However, it is easier to follow unit tests, if they are self-contained. So just set the level to Debug explicitly here. Also, add a new debug_3 log, to further document the intended behavior of the unit test. Also, replace the LogPrintLevel with the shorter and exact replacements LogTrace and LogDebug.	2025-12-13 12:50:12 +01:00
stickies-v	f273167661	ipc: separate log statements per level Avoids ratelimiting unconditional log statements when debug logging is enabled. Introduces slight behaviour change by removing the category from unconditional logs, making them more uniform with the other unconditional logs in the codebase. Also, in a slight behavior change, prefix the info-level (and higher) messages with "ipc:".	2025-12-13 12:49:39 +01:00
stickies-v	94c51ae540	libevent: separate log statements per level Avoids ratelimiting unconditional log statements when debug logging is enabled. Introduces slight behaviour change by removing the category from unconditional logs, making them more uniform with the other unconditional logs in the codebase. Also, in a slight behavior change, prefix the info-level (and higher) messages with "libevent:".	2025-12-13 12:48:27 +01:00
marcofleon	a70a14a3f4	refactor: Separate out logic for building a tree-shaped dependency graph	2025-12-12 16:09:53 +01:00
marcofleon	ce29d7d626	fuzz: Fix variable in `clusterlin_postlinearize_tree` check The test intends to verify that running `PostLinearize` a second time on a tree-structured graph doesn't change the result. But `PostLinearize` was being called on the original variable, not the copy. So the check was comparing the unmodified copy against itself, which is useless. Fix by post-linearizing the correct variable.	2025-12-12 15:04:10 +00:00
marcofleon	876e2849b4	fuzz: Fix incorrect loop bounds in `clusterlin_postlinearize_tree` The dependency graphs generated by this test can have holes (unused indices) in them. This means some of the transactions were skipped when using `depgraph_gen.TxCount()` as the upper bound of the loop. Switch to using `depgraph.Positions()` to correctly handle sparse graphs.	2025-12-12 15:02:26 +00:00
merge-script	938d7aacab	Merge bitcoin/bitcoin#33657 : rest: allow reading partial block data from storage `07135290c1` rest: allow reading partial block data from storage (Roman Zeyde) `4e2af1c065` blockstorage: allow reading partial block data from storage (Roman Zeyde) `f2fd1aa21c` blockstorage: return an error code from `ReadRawBlock()` (Roman Zeyde) Pull request description: It allows fetching specific transactions using an external index, following https://github.com/bitcoin/bitcoin/pull/32541#issuecomment-3267485313. Currently, electrs and other indexers map between an address/scripthash to the list of the relevant transactions. However, in order to fetch those transactions from bitcoind, electrs relies on reading the whole block and post-filtering for a specific transaction[^1]. Other indexers use a `txindex` to fetch a transaction using its txid [^2][^3][^4]. The above approach has significant storage and CPU overhead, since the `txid` is a pseudo-random 32-byte value. Also, mainnet `txindex` takes ~60GB today. This PR is adding support for using the transaction's position within its block to be able to fetch it directly using [REST API](https://github.com/bitcoin/bitcoin/blob/master/doc/REST-interface.md), using the following HTTP request: ``` GET /rest/blockpart/BLOCKHASH.bin?offset=OFFSET&size=SIZE ``` - The offsets' index can be encoded much more efficiently ([~1.3GB today](https://github.com/romanz/bindex-rs/pull/66#issuecomment-3508476436)). - Address history query performance can be tested on mainnet using [1BitcoinEaterAddressDontSendf59kuE](https://mempool.space/address/1BitcoinEaterAddressDontSendf59kuE) - assuming warm OS block cache, [it takes <1s to fetch 5200 txs, i.e. <0.2ms per tx](https://github.com/romanz/bindex-rs/pull/66#issuecomment-3508476436) with [bindex](https://github.com/romanz/bindex-rs). - Only binary and hex response formats are supported. [^1]: https://github.com/romanz/electrs/blob/master/doc/schema.md [^2]: https://github.com/Blockstream/electrs/blob/new-index/doc/schema.md#txstore [^3]: https://github.com/spesmilo/electrumx/blob/master/docs/HOWTO.rst#prerequisites [^4]: https://github.com/cculianu/Fulcrum/blob/master/README.md#requirements ACKs for top commit: maflcko: review ACK `07135290c1` 🏪 l0rinc: ACK `07135290c1` hodlinator: re-ACK `07135290c1` Tree-SHA512: bcce7bf4b9a3e5e920ab5a83e656f50d5d7840cdde6b7147d329cf578f8a2db555fc1aa5334e8ee64d5630d25839ece77a2cf421c6c3ac1fa379bb453163bd4f	2025-12-12 13:22:00 +00:00
Ryan Ofsky	82be652e40	doc: Improve ChainstateManager documentation, use consistent terms	2025-12-12 06:49:59 -04:00
merge-script	597b8be223	Merge bitcoin/bitcoin#34025 : net: Waste less time in socket handling `5f5c1ea019` net: Cache -capturemessages setting (Anthony Towns) `cea443e246` net: Pass time to InactivityChecks fuctions (Anthony Towns) Pull request description: Cuts out some wasted time in net socket handling. First, only calculates the current time once every 50ms, rather than once for each peer, which given we only care about second-level precision seems more than adequate. Second, caches the value of the `-capturemessages` setting in `CConnman` rather than re-evaluating it every time we invoke `PushMessaage`. ACKs for top commit: maflcko: review ACK `5f5c1ea019` 🏣 vasild: ACK `5f5c1ea019` sedited: ACK `5f5c1ea019` mzumsande: ACK `5f5c1ea019` Tree-SHA512: 0194143a3a4481c6355ac9eab27ce6ae4bed5db1d483ba5d06288dd92f195ccb9f0f055a9eb9d7e16e9bbf72f145eca1ff17c6700ee9aa42730103a8f047b32c	2025-12-12 10:49:59 +00:00
TheCharlatan	af455dcb39	refactor: Simplify pruning functions Move GetPruneRange from ChainstateManager to Chainstate.	2025-12-12 11:49:59 +01:00
Ryan Ofsky	ae85c495f1	refactor: Delete ChainstateManager::GetAll() method Just use m_chainstates array instead.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	6a572dbda9	refactor: Add ChainstateManager::ActivateBestChains() method Deduplicate code looping over chainstate objects and calling ActivateBestChain() and avoid need for code outside ChainstateManager to use the GetAll() method.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	491d827d52	refactor: Add ChainstateManager::m_chainstates member Use to replace m_active_chainstate, m_ibd_chainstate, and m_snapshot_chainstate members. This has several benefits: - Ensures ChainstateManager treats chainstates instances equally, making distinctions based on their attributes, not having special cases and making assumptions based on their identities. - Normalizes ChainstateManager representation so states that should be impossible to reach and validation code has no handling for (like m_snapshot_chainstate being set and m_ibd_chainstate being unset, or both being set but m_active_chainstate pointing to the m_ibd_chainstate) can no longer be represented. - Makes ChainstateManager more extensible so new chainstates can be added for different purposes, like indexing or generating and validating assumeutxo snapshots without interrupting regular node operations. With the m_chainstates member, new chainstates can be added and handled without needing to make changes all over validation code or to copy/paste/modify the existing code that's been already been written to handle m_ibd_chainstate and m_snapshot_chainstate. - Avoids terms that are confusing and misleading: - The term "active chainstate" term is confusing because multiple chainstates will be active and in use at the same time. Before a snapshot is validated, wallet code will use the snapshot chainstate, while indexes will use the IBD chainstate, and netorking code will use both chainstates, downloading snapshot blocks at higher priority, but also IBD blocks simultaneously. - The term "snapshot chainstate" is ambiguous because it could refer either to the chainstate originally loaded from a snapshot, or to the chainstate being used to validate a snapshot that was loaded, or to a chainstate being used to produce a snapshot, but it is arbitrary used to refer the first thing. The terms "most-work chainstate" or "assumed-valid chainstate" should be less ambiguous ways to refer to chainstates loaded from snapshots. - The term "IBD chainstate" is not just ambiguous but actively confusing because technically IBD ends and the node is considered synced when the snapshot chainstate finishes syncing, so in practice the IBD chainstate will mostly by synced after IBD is complete. The term "fully-validated" is a better way of describing the characteristics and purpose of this chainstate.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	e514fe6116	refactor: Delete ChainstateManager::SnapshotBlockhash() method SnapshotBlockhash() is only called two places outside of tests, and is used redundantly in some tests, checking the same field as other checks. Simplify by dropping the method and using the m_from_snapshot_blockhash field directly.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	ee35250683	refactor: Delete ChainstateManager::IsSnapshotValidated() method IsSnapshotValidated() is only called one place outside of tests, and is use redundantly in some tests, asserting that a snapshot is not validated when a snapshot chainstate does not even exist. Simplify by dropping the method and checking Chainstate m_assumeutxo field directly.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	d9e82299fc	refactor: Delete ChainstateManager::IsSnapshotActive() method IsSnapshotActive() method is only called one place outside of tests and asserts, and is confusing because it returns true even after the snapshot is fully validated. The documentation which said this "implies that a background validation chainstate is also in use" is also incorrect, because after the snapshot is validated, the background chainstate gets disabled and IsUsable() would return false.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	4dfe383912	refactor: Convert ChainstateRole enum to struct Change ChainstateRole parameter passed to wallets and indexes. Wallets and indexes need to know whether chainstate is historical and whether it is fully validated. They should not be aware of the assumeutxo snapshot validation process.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	352ad27fc1	refactor: Add ChainstateManager::ValidatedChainstate() method ValidatedChainstate() accessor replaces GetChainstateForIndexing() with no change in behavior.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	a229cb9477	refactor: Add ChainstateManager::CurrentChainstate() method CurrentChainstate() is basically the same as ActiveChainstate() except it requires cs_main to be locked when it is called, instead of locking cs_main internally. The name "current" should also be less confusing than "active" because multiple chainstates can be active, and CurrentChainstate() returns the chainstate targeting the current network tip, regardless of what chainstates are being downloaded or how they are used.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	a9b7f5614c	refactor: Add Chainstate::StoragePath() method Use to simplify code determining the chainstate leveldb paths. New method is the now the only code that needs to figure out the storage path, so the path doesn't need to be constructed multiple places and backed out of leveldb.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	840bd2ef23	refactor: Pass chainstate parameters to MaybeCompleteSnapshotValidation Remove hardcoded references to m_ibd_chainstate and m_snapshot_chainstate so MaybeCompleteSnapshotValidation function can be simpler and focus on validating the snapshot without dealing with internal ChainstateManager states. This is a step towards being able to validate the snapshot outside of ActivateBestChain loop so cs_main is not locked for minutes when the snapshot block is connected.	2025-12-12 06:49:59 -04:00
Ryan Ofsky	1598a15aed	refactor: Deduplicate Chainstate activation code Move duplicate code from ChainstateManager::ActivateSnapshot and ChainstateManager::ActivateExistingSnapshot methods to a new ChainstateManager::AddChainstate method. The "AddChainstate" method name doesn't mention snapshots even though it is only used to add snapshot chainstates now, because it becomes more generalized in a later commit in this PR ("refactor: Add ChainstateManager::m_chainstates member")	2025-12-12 06:49:59 -04:00
Ryan Ofsky	9fe927b6d6	refactor: Add Chainstate m_assumeutxo and m_target_utxohash members Get rid of m_disabled/IsUsable members. Instead of marking chains disabled for different reasons, store chainstate assumeutxo status explicitly and use that information to determine how chains should be treated.	2025-12-12 06:49:59 -04:00

1 2 3 4 5 ...

29613 Commits