bitcoin

mirror of https://github.com/bitcoin/bitcoin.git synced 2026-03-19 12:10:19 +01:00

Author	SHA1	Message	Date
glozow	a2878cfb4a	[unit test] strengthen GetChildrenFromSamePeer tests: results are in recency order	2025-07-14 16:13:47 -04:00
glozow	7ce3b7ee57	[unit test] basic TxOrphanage eviction and protection	2025-07-14 16:13:47 -04:00
glozow	4d23d1d7e7	[cleanup] remove unused rng param from LimitOrphans	2025-07-14 16:13:47 -04:00
glozow	067365d2a8	[p2p] overhaul TxOrphanage with smarter limits This is largely a reimplementation using boost::multi_index_container. All the same public methods are available. It has an index by outpoint, per-peer tracking, peer worksets, etc. A few differences: - Limits have changed: instead of a global limit of 100 unique orphans, we have a maximum number of announcements (which can include duplicate orphans) and a global memory limit which scales with the number of peers. - The maximum announcements limit is 100 to match the original limit, but this is actually a stricter limit because the announcement count is not de-duplicated. - Eviction strategy: when global limits are reached, a per-peer limit comes into play. While limits are exceeded, we choose the peer whose “DoS score” (max usage / limit ratio for announcements and memory limits) is highest and evict announcements by entry time, sorting non-reconsiderable ones before reconsiderable ones. Since announcements are unique by (wtxid, peer), as long as 1 announcement remains for a transaction, it remains in the orphanage. - This eviction strategy means no peer can influence the eviction of another peer’s orphans. - Also, since global limits are a multiple of per-peer limits, as long as a peer does not exceed its limits, its orphans are protected from eviction. - Orphans no longer expire, since older announcements are generally removed before newer ones. - GetChildrenFromSamePeer returns the transactions from newest to oldest. Co-authored-by: Pieter Wuille <pieter@wuille.net>	2025-07-14 16:13:47 -04:00
glozow	3da6d7f8f6	[prep/refactor] make TxOrphanage a virtual class implemented by TxOrphanageImpl	2025-07-14 16:13:46 -04:00
glozow	77ebe8f280	[prep/test] have TxOrphanage remember its own limits in LimitOrphans Move towards a model where TxOrphanage is initialized with limits that it remembers throughout its lifetime. Remove the param. Limiting by number of unique orphans will be removed in a later commit. Now that -maxorphantx is gone, this does not change the node behavior. The parameter is only used in tests.	2025-07-14 16:13:10 -04:00
glozow	d0af4239b7	[prep/refactor] move DEFAULT_MAX_ORPHAN_TRANSACTIONS to txorphanage.h This is move only.	2025-07-14 16:13:10 -04:00
glozow	8dd24c29ae	[prep/test] modify test to not access TxOrphanage internals These internals should and will be private.	2025-07-14 16:12:47 -04:00
glozow	44f5327824	[fuzz] add SeedRandomStateForTest(SeedRand::ZEROS) to txorphan	2025-07-11 13:52:50 -04:00
glozow	08e58fa911	[prep/refactor] move txorphanage to node namespace and directory This is move-only.	2025-07-11 13:52:50 -04:00
merge-script	23e15d40b9	Merge bitcoin/bitcoin#32631 : refactor: Convert GenTxid to `std::variant` `a60f863d3e` scripted-diff: Replace GenTxidVariant with GenTxid (marcofleon) `c8ba199598` Remove old GenTxid class (marcofleon) `072a198ea4` Convert remaining instances of GenTxid to GenTxidVariant (marcofleon) `1b528391c7` Convert `txrequest` to GenTxidVariant (marcofleon) `bde4579b07` Convert `txdownloadman_impl` to GenTxidVariant (marcofleon) `c876a892ec` Replace GenTxid with Txid/Wtxid overloads in `txmempool` (marcofleon) `de858ce2be` move-only: make GetInfo a private CTxMemPool member (stickies-v) `eee473d9f3` Convert `CompareInvMempoolOrder` to GenTxidVariant (marcofleon) `243553d590` refactor: replace get_iter_from_wtxid with GetIter(const Wtxid&) (stickies-v) `fcf92fd640` refactor: make CTxMemPool::GetIter strongly typed (marcofleon) `11d28f21bb` Implement GenTxid as a variant (marcofleon) Pull request description: Part of the [type safety refactor](https://github.com/bitcoin/bitcoin/pull/32189). This PR changes the GenTxid class to a variant, which holds both Txids and Wtxids. This provides compile-time type safety and eliminates the manual type check (bool m_is_wtxid). Variables that can be either a Txid or a Wtxid are now using the new GenTxid variant, instead of uint256. ACKs for top commit: w0xlt: ACK `a60f863d3e` dergoegge: Code review ACK `a60f863d3e` maflcko: review ACK `a60f863d3e` 🎽 theStack: Code-review ACK `a60f863d3e` Tree-SHA512: da9b73b7bdffee2eb9281a409205519ac330d3336094d17681896703fbca8099608782c9c85801e388e4d90af5af8abf1f34931f57bbbe6e9674d802d6066047	2025-07-11 13:47:19 -04:00
merge-script	12fb00fd42	Merge bitcoin/bitcoin#32927 : fuzz: Add missing calls to `SetMockTime` for determinism `fa8862723c` fuzz: CheckGlobals in init (MarcoFalke) `fa26bfde98` test: Avoid resetting mocktime in testing setup (MarcoFalke) `fa6b45fa8e` Add SetMockTime for time_point types (MarcoFalke) Pull request description: (Tracking issue https://github.com/bitcoin/bitcoin/issues/29018) During fuzzing, `AppInitParameterInteraction` may actually disable a previously set mocktime. This is confusing and can also cause non-determinism. Fix this issue, by * fixing the erroneous `-mocktime` parsing in `AppInitParameterInteraction`. * adding the missing `SetMockTime` calls to the affected fuzz init functions. * adding a `CheckGlobals` to the fuzz init, to prevent this issue in the future. This can be tested by * Cherry-picking the `CheckGlobals`-commit onto current master and observing a fuzz failure in the touched fuzz targets. * Reverting the touched fuzz fixups and observing a fuzz failure for each target. ACKs for top commit: w0xlt: ACK `fa8862723c` dergoegge: utACK `fa8862723c` Tree-SHA512: 5a9400f0467c82fa224713af4cc2b525afbefefc7c3f419077110925ad7af6c7fda3dcd2b50f7facf0ee7df2547c6ac20336906d707adcdfd1d652a9d9a735fe	2025-07-11 11:18:03 +01:00
merge-script	5ef0d4897b	Merge bitcoin/bitcoin#30605 : Cluster linearization: separate tests from tests-of-tests `d7fca5c171` clusterlin: add big comment explaning the relation between tests (Pieter Wuille) `b64e61d2de` clusterlin: abstract try-permutations into ExhaustiveLinearize function (Pieter Wuille) `1fa55a64ed` clusterlin tests: verify that chunks are minimal (Pieter Wuille) `da23ecef29` clusterlin tests: support non-empty ReadTopologicalSubset() (Pieter Wuille) `94f3e17c33` clusterlin tests: compare with fuzz-provided linearizations (Pieter Wuille) `5f92ebee0d` clusterlin tests: compare with fuzz-provided topological sets (Pieter Wuille) `6e37824ac3` clusterlin tests: optimize clusterlin_simple_linearize (Pieter Wuille) `98c1c88b6f` clusterlin tests: separate testing of SimpleLinearize and Linearize (Pieter Wuille) `10e90f7aef` clusterlin tests: make SimpleCandidateFinder always find connected (Pieter Wuille) `a38c38951e` clusterlin tests: separate testing of Search- and SimpleCandidateFinder (Pieter Wuille) `77a432ee70` clusterlin tests: count SimpleCandidateFinder iterations better (Pieter Wuille) Pull request description: Part of the cluster mempool project: #30289 The current cluster linearization fuzz tests contain two tests which combine testing of production code with testing of the test code itself: * `clusterlin_search_finder`: establishes the correctness of `SearchCandidateFinder` by comparing against both `SimpleCandidateFinder` and `ExhaustiveCandidateFinder` (which is even more simple than `SimpleCandidateFinder`). If `SimpleCandidateFinder` works correctly, then this comparison with `ExhaustiveCandidateFinder` is redundant. If it isn't, we ought to find that in a test specific to `SimpleCandidateFinder` rather than as a side-effect of testing `SearchCandidateFinder`. Split this functionality out into a new `clusterlin_simple_finder`. * `clusterlin_linearize`: establishes the correctness of `Linearize` by comparing against both `SimpleLinearize` and literally every valid linearization for the cluster. Again, if `SimpleLinearize` works correctly, then this comparison with all valid linearizations is redundant, and if it isn't we should find it in a test for `SimpleLinearize`. Do so by splitting off that functionality into `clusterlin_simple_linearize`. After that, a few general improvements to the affected tests are made (comparing with linearizations and subsets read from the fuzz input, plus a performance improvement). ACKs for top commit: marcofleon: Re ACK `d7fca5c171` ismaelsadeeq: re-ACK `d7fca5c171` monlovesmango: ACK `d7fca5c171` Tree-SHA512: 33cb76bd9b9547a5f3ee231fa452e928f064ad03af98e3d9e64246eb972f2b026c13e7367257ccdac1ae57982ee8ef98c907684588ecbb4bc4c82cbec160b3e8	2025-07-10 13:52:31 -04:00
Ava Chow	a40e953658	Merge bitcoin/bitcoin#30479 : validation: Add eligible ancestors of reconsidered block to setBlockIndexCandidates `8cc3ac6c23` validation: Don't use IsValid() to filter for invalid blocks (Martin Zumsande) `86d98b94e5` test: verify that ancestors of a reconsidered block can become the chain tip (stratospher) `3c39a55e64` validation: Add ancestors of reconsiderblock to setBlockIndexCandidates (Martin Zumsande) Pull request description: When we call `reconsiderblock` for some block, `Chainstate::ResetBlockFailureFlags` puts the descendants of that block into `setBlockIndexCandidates` (if they meet the criteria, i.e. have more work than the tip etc.), but never put any ancestors into the set even though we do clear their failure flags. I think that this is wrong, because `setBlockIndexCandidates` should always contain all eligible indexes that have at least as much work as the current tip, which can include ancestors of the reconsidered block. This is being checked by `CheckBlockIndex()`, which could fail if it was invoked after `ActivateBestChain` connects a block and releases `cs_main`: ``` diff diff --git a/src/validation.cpp b/src/validation.cpp index 7b04bd9a5b..ff0c3c9f58 100644 --- a/src/validation.cpp +++ b/src/validation.cpp @@ -3551,6 +3551,7 @@ bool Chainstate::ActivateBestChain(BlockValidationState& state, std::shared_ptr< } } // When we reach this point, we switched to a new tip (stored in pindexNewTip). + m_chainman.CheckBlockIndex(); if (exited_ibd) { // If a background chainstate is in use, we may need to rebalance our ``` makes `rpc_invalidateblock.py` fail on master. Even though we don't currently have a `CheckBlockIndex()` in that place, after `cs_main` is released other threads could invoke it, which is happening in the rare failures of #16444 where an invalid header received from another peer could trigger a `CheckBlockIndex()` call that would fail. Fix this by adding eligible ancestors to `setBlockIndexCandidates` in `Chainstate::ResetBlockFailureFlags` (also simplifying that function a bit). Fixes #16444 ACKs for top commit: achow101: ACK `8cc3ac6c23` TheCharlatan: Re-ACK `8cc3ac6c23` stratospher: reACK `8cc3ac6`. Tree-SHA512: 53f27591916246be4093d64b86a0494e55094abd8c586026b1247e4a36747bc3d6dbe46dc26ee4a22f47b8eb0d9699d13e577dee0e7198145f3c9b11ab2a30b7	2025-07-09 16:55:43 -07:00
Eugene Siegel	d541409a64	log: Add rate limiting to LogPrintf, LogInfo, LogWarning, LogError, LogPrintLevel To mitigate disk-filling attacks caused by unsafe usages of LogPrintf and friends, we rate-limit them by passing a should_ratelimit bool that eventually makes its way to LogPrintStr which may call LogRateLimiter::Consume. The rate limiting is accomplished by adding a LogRateLimiter member to BCLog::Logger which tracks source code locations for the given logging window. Every hour, a source location can log up to 1MiB of data. Source locations that exceed the limit will have their logs suppressed for the rest of the window determined by m_limiter. This change affects the public LogPrintLevel function if called with a level >= BCLog::Level::Info. The UpdateTipLog function has been changed to use the private LogPrintLevel_ macro with should_ratelimit set to false. This allows UpdateTipLog to log during IBD without hitting the rate limit. Note that on restart, a source location that was rate limited before the restart will be able to log until it hits the rate limit again. Co-Authored-By: Niklas Gogge <n.goeggi@gmail.com> Co-Authored-By: stickies-v <stickies-v@protonmail.com>	2025-07-09 09:13:00 -04:00
Eugene Siegel	a6a35cc0c2	log: use std::source_location in place of __func__, __FILE__, __LINE__ The std::source_location conveniently stores the file name, line number, and function name of a source code location. We switch to using it instead of the __func__ identifier and the __FILE__ and __LINE__ macros. BufferedLog is changed to have a std::source_location member, replacing the source_file, source_line, and logging_function members. As a result, MemUsage no longer explicitly counts source_file or logging_function as the std::source_location memory usage is included in the MallocUsage call. This also changes the behavior of -logsourcelocations as std::source_location includes the entire function signature. Because of this, the functional test feature_config_args.py must be changed to no longer include the function signature as the function signature can differ across platforms. Co-Authored-By: Niklas Gogge <n.goeggi@gmail.com> Co-Authored-By: stickies-v <stickies-v@protonmail.com>	2025-07-09 09:12:59 -04:00
Eugene Siegel	afb9e39ec5	log: introduce LogRateLimiter, LogLimitStats, Status LogRateLimiter will be used to keep track of source locations and our current time-based logging window. It contains an unordered_map and a m_suppressions_active bool to track source locations. The map is keyed by std::source_location, so a custom Hash function (SourceLocationHasher) and custom KeyEqual function (SourceLocationEqual) is provided. SourceLocationHasher uses CSipHasher(0,0) under the hood to get a uniform distribution. A public Reset method is provided so that a scheduler (e.g. the "b-scheduler" thread) can periodically reset LogRateLimiter's state when the time window has elapsed. The LogRateLimiter::Consume method checks if we have enough available bytes in our rate limiting budget to log an additional string. It returns a Status enum that denotes the rate limiting status and can be used by the caller to emit a warning, skip logging, etc. The Status enum has three states: - UNSUPPRESSED (logging was successful) - NEWLY_SUPPRESSED (logging was succcesful, next log will be suppressed) - STILL_SUPPRESSED (logging was unsuccessful) LogLimitStats counts the available bytes left for logging per source location for the current logging window. It does not track actual source locations; it is used as a value in m_source_locations. Also exposes a SuppressionsActive() method so the logger can use that in a later commit to prefix [*] to logs whenenever suppressions are active. Co-Authored-By: Niklas Gogge <n.goeggi@gmail.com> Co-Authored-By: stickies-v <stickies-v@protonmail.com>	2025-07-09 09:12:59 -04:00
Eugene Siegel	df7972a6cf	test: Mark ~DebugLogHelper as noexcept(false) We mark ~DebugLogHelper as noexcept(false) to be able to catch the exception it throws. This lets us use it in test in combination with BOOST_CHECK_THROW and BOOST_CHECK_NO_THROW to check that certain log messages are (not) logged. Co-Authored-By: Niklas Gogge <n.goeggi@gmail.com>	2025-07-09 09:12:59 -04:00
MarcoFalke	fa8862723c	fuzz: CheckGlobals in init	2025-07-09 14:28:23 +02:00
MarcoFalke	fa26bfde98	test: Avoid resetting mocktime in testing setup This allows to set the mocktime before the testing setup. Also, in some fuzz tests the mocktime was reset to 0 before this change, so set it.	2025-07-09 14:28:14 +02:00
marcofleon	a60f863d3e	scripted-diff: Replace GenTxidVariant with GenTxid -BEGIN VERIFY SCRIPT- sed -i 's/GenTxidVariant/GenTxid/g' $(git grep -l 'GenTxidVariant') -END VERIFY SCRIPT-	2025-07-08 20:00:51 +01:00
marcofleon	1b528391c7	Convert `txrequest` to GenTxidVariant Switch all instances of GenTxid to the new variant in `txrequest` and complete `txdownloadman_impl` by converting `GetRequestsToSend`.	2025-07-08 20:00:51 +01:00
marcofleon	bde4579b07	Convert `txdownloadman_impl` to GenTxidVariant Convert all of `txdownloadman_impl` to the new variant except for `GetRequestsToSend`, which will be easier to switch at the same time as `txrequest`.	2025-07-08 20:00:43 +01:00
marcofleon	c876a892ec	Replace GenTxid with Txid/Wtxid overloads in `txmempool` Co-authored-by: stickies-v <stickies-v@protonmail.com>	2025-07-08 19:31:02 +01:00
merge-script	87ab69155d	Merge bitcoin/bitcoin#31553 : cluster mempool: add TxGraph reorg functionality `1632fc104b` txgraph: Track multiple potential would-be clusters in Trim (improvement) (Pieter Wuille) `4608df37e0` txgraph: add Trim benchmark (benchmark) (Pieter Wuille) `9c436ff01c` txgraph: add fuzz test scenario that avoids cycles inside Trim() (tests) (Pieter Wuille) `938e86f8fe` txgraph: add unit test for TxGraph::Trim (tests) (glozow) `a04e205ab0` txgraph: Add ability to trim oversized clusters (feature) (Pieter Wuille) `eabcd0eb6f` txgraph: remove unnecessary m_group_oversized (simplification) (Greg Sanders) `19b14e61ea` txgraph: Permit transactions that exceed cluster size limit (feature) (Pieter Wuille) `c4287b9b71` txgraph: Add ability to configure maximum cluster size/weight (feature) (Pieter Wuille) Pull request description: Part of cluster mempool (#30289). During reorganisations, it is possible that dependencies get added which would result in clusters that violate policy limits (cluster count, cluster weight), when linking the new from-block transactions to the old from-mempool transactions. Unlike RBF scenarios, we cannot simply reject the changes when they are due to received blocks. To accommodate this, add a `TxGraph::Trim()`, which removes some subset of transactions (including descendants) in order to make all resulting clusters satisfy the limits. Conceptually, the way this is done is by defining a rudimentary linearization for the entire would-be too-large cluster, iterating it from beginning to end, and reasoning about the counts and weights of the clusters that would be reached using transactions up to that point. If a transaction is encountered whose addition would violate the limit, it is removed, together with all its descendants. This rudimentary linearization is like a merge sort of the chunks of the clusters being combined, but respecting topology. More specifically, it is continuously picking the highest-chunk-feerate remaining transaction among those which have no unmet dependencies left. For efficiency, this rudimentary linearization is computed lazily, by putting all viable transactions in a heap, sorted by chunk feerate, and adding new transactions to it as they become viable. The `Trim()` function is rather unusual compared to the `TxGraph` functionality added in previous PRs, in that `Trim()` makes it own decisions about what the resulting graph contents will be, without good specification of how it makes that decision - it is just a best-effort attempt (which is improved in the last commit). All other `TxGraph` mutators are simply to inform the graph about changes the calling mempool code decided on; this one lets the decision be made by txgraph. As part of this, the "oversized" property is expanded to also encompass a configurable cluster weight limit (in addition to cluster count limit). ACKs for top commit: instagibbs: reACK `1632fc104b` glozow: reACK `1632fc104b` via range-diff ismaelsadeeq: reACK `1632fc104b` 🛰️ Tree-SHA512: ccacb54be8ad622bd2717905fc9b7e42aea4b07f824de1924da9237027a97a9a2f1b862bc6a791cbd2e1a01897ad2c7c73c398a2d5ccbce90bfbeac0bcebc9ce	2025-07-07 16:11:51 -04:00
merge-script	d33c111448	Merge bitcoin/bitcoin#32829 : threading: use correct mutex name in reverse_lock fatal error messages `de4eef52d1` threading: use correct mutex name in reverse_lock fatal error messages (Cory Fields) Pull request description: "Now that REVERSE_LOCK requires the name of the actual mutex, it can be used for better error messages." - theuni This is a follow-up to this comment https://github.com/bitcoin/bitcoin/pull/32465#issuecomment-2981287545 I just cherry-picked the commit 85c2848eb575f4abaa81fdd4e8f3b2048693dd98 ACKs for top commit: theuni: Re-ACK `de4eef52d1` TheCharlatan: ACK `de4eef52d1` Tree-SHA512: 1109381e1f0589093f7c737cb1ebd1c43324a9e1ea34b5f05a9171d06ab44cca0c5ead43c581f6e37ded1f0463ab8a280f3319c288d39a4625109b5c08a7cb68	2025-07-07 15:51:37 +01:00
Cory Fields	de4eef52d1	threading: use correct mutex name in reverse_lock fatal error messages Now that REVERSE_LOCK requires the name of the actual mutex, it can be used for better error messages.	2025-07-07 10:34:05 -04:00
Ava Chow	ea4285775e	Merge bitcoin/bitcoin#29307 : util: explicitly close all AutoFiles that have been written `c10e382d2a` flatfile: check whether the file has been closed successfully (Vasil Dimov) `4bb5dd78ea` util: check that a file has been closed before ~AutoFile() is called (Vasil Dimov) `8bb34f07df` Explicitly close all AutoFiles that have been written (Vasil Dimov) `a69c4098b2` rpc: take ownership of the file by WriteUTXOSnapshot() (Hodlinator) Pull request description: `fclose(3)` may fail to flush the previously written data to disk, thus a failing `fclose(3)` is as serious as a failing `fwrite(3)`. Previously the code ignored `fclose(3)` failures. This PR improves that by changing all users of `AutoFile` that use it to write data to explicitly close the file and handle a possible error. --- Other alternatives are: 1. `fflush(3)` after each write to the file (and throw if it fails from the `AutoFile::write()` method) and hope that `fclose(3)` will then always succeed. Assert that it succeeds from the destructor 🙄. Will hurt performance. 2. Throw nevertheless from the destructor. Exception within the exception in C++ I think results in terminating the program without a useful message. 3. (this is implemented in the latest incarnation of this PR) Redesign `AutoFile` so that its destructor cannot fail. Adjust _all_ its users 😭. For example, if the file has been written to, then require the callers to explicitly call the `AutoFile::fclose()` method before the object goes out of scope. In the destructor, as a sanity check, assume/assert that this is indeed the case. Defeats the purpose of a RAII wrapper for `FILE*` which automatically closes the file when it goes out of scope and there are a lot of users of `AutoFile`. 4. Pass a new callback function to the `AutoFile` constructor which will be called from the destructor to handle `fclose()` errors, as described in https://github.com/bitcoin/bitcoin/pull/29307#issuecomment-2243842400. My thinking is that if that callback is going to only log a message, then we can log the message directly from the destructor without needing a callback. If the callback is going to do more complicated error handling then it is easier to do that at the call site by directly calling `AutoFile::fclose()` instead of getting the `AutoFile` object out of scope (so that its destructor is called) and inspecting for side effects done by the callback (e.g. set a variable to indicate a failed `fclose()`). ACKs for top commit: l0rinc: ACK `c10e382d2a` achow101: ACK `c10e382d2a` hodlinator: re-ACK `c10e382d2a` Tree-SHA512: 3994ca57e5b2b649fc84f24dad144173b7500fc0e914e06291d5c32fbbf8d2b1f8eae0040abd7a5f16095ddf4e11fe1636c6092f49058cda34f3eb2ee536d7ba	2025-07-03 15:37:44 -07:00
merge-script	49d5f1f2c6	Merge bitcoin/bitcoin#32850 : test: check P2SH sigop count for coinbase tx `d6aaffcb11` test: check P2SH sigop count for coinbase tx (brunoerg) Pull request description: We currently do not test that `GetP2SHSigOpCount` returns 0 for coinbase transactions (see line L129 at https://corecheck.dev/mutation/src/consensus/tx_verify.cpp). This PR addresses it. ACKs for top commit: darosior: That said, i guess unit-tested dead consensus code is better than not-unit-tested dead consensus code. utACK `d6aaffcb11` theStack: ACK `d6aaffcb11` w0xlt: ACK `d6aaffcb11` ishaanam: ACK `d6aaffcb11` pablomartin4btc: ACK `d6aaffcb11` Tree-SHA512: a7d7306f064bb2ec7e93e92625848ae38e150ebb67bde37cd15be1038816b154e867ad21ecd2685d8de5341b67e3b768d30b7654e27b541f33e8f9d63e52261d	2025-07-03 09:46:53 +01:00
Pieter Wuille	1632fc104b	txgraph: Track multiple potential would-be clusters in Trim (improvement) In the existing Trim function, as soon as the set of accepted transactions would exceed the max cluster size or count limit, the acceptance loop is stopped, removing all later transactions. However, it is possible that by excluding some of those transactions the would-be cluster splits apart into multiple would-clusters. And those clusters may well permit far more transactions before their limits are reached. Take this into account by using a union-find structure inside TrimTxData to keep track of the count/size of all would-be clusters that would be formed at any point, and only reject transactions which would cause these resulting partitions to exceed their limits. This is not an optimization in terms of CPU usage or memory; it just improves the quality of the transactions removed by Trim().	2025-07-02 16:01:57 -04:00
Pieter Wuille	9c436ff01c	txgraph: add fuzz test scenario that avoids cycles inside Trim() (tests) Trim internally builds an approximate dependency graph of the merged cluster, replacing all existing dependencies within existing clusters with a simple linear chain of dependencies. This helps keep the complexity of the merging operation down, but may result in cycles to appear in the general case, even though in real scenarios (where Trim is called for stitching re-added mempool transactions after a reorg back to the existing mempool transactions) such cycles are not possible. Add a test that specifically targets Trim() but in scenarios where it is guaranteed not to have any cycles. It is a special case, is much more a whitebox test than a blackbox test, and relies on randomness rather than fuzz input. The upside is that somewhat stronger properties can be tested. Co-authored-by: Greg Sanders <gsanders87@gmail.com>	2025-07-02 15:06:53 -04:00
glozow	938e86f8fe	txgraph: add unit test for TxGraph::Trim (tests) Co-Authored-By: Pieter Wuille <pieter@wuille.net>	2025-07-02 15:06:48 -04:00
Pieter Wuille	a04e205ab0	txgraph: Add ability to trim oversized clusters (feature) During reorganisations, it is possible that dependencies get add which result in clusters that violate limits (count, size), when linking the new from-block transactions to the old from-mempool transactions. Unlike RBF scenarios, we cannot simply reject these policy violations when they are due to received blocks. To accomodate this, add a Trim() function to TxGraph, which removes transactions (including descendants) in order to make all resulting clusters satisfy the limits. In the initial version of the function added here, the following approach is used: - Lazily compute a naive linearization for the to-be-merged cluster (using an O(n log n) algorithm, optimized for far larger groups of transactions than the normal linearization code). - Initialize a set of accepted transactions to {} - Iterate over the transactions in this cluster one by one: - If adding the transaction to the set makes it exceed the max cluster size or count limit, stop. - Add the transaction to the set. - Remove all transactions from the cluster that were not included in the set (note that this necessarily includes all descendants too, because they appear later in the naive linearization). Co-authored-by: Greg Sanders <gsanders87@gmail.com>	2025-07-02 14:52:54 -04:00
Pieter Wuille	19b14e61ea	txgraph: Permit transactions that exceed cluster size limit (feature) This removes the restriction added in the previous commit that individual transactions do not exceed the max cluster size limit. With this change, the responsibility for enforcing cluster size limits can be localized purely in TxGraph, without callers (and in particular, tests) needing to duplicate the enforcement for individual transactions.	2025-07-02 14:52:54 -04:00
Pieter Wuille	c4287b9b71	txgraph: Add ability to configure maximum cluster size/weight (feature) This is integrated with the oversized property: the graph is oversized when any connected component within it contains more than the cluster count limit many transactions, or when their combined size/weight exceeds the cluster size limit. It becomes disallowed to call AddTransaction with a size larger than this limit, though this limit will be lifted in the next commit. In addition, SetTransactionFeeRate becomes SetTransactionFee, so that we do not need to deal with the case that a call to this function might affect the oversizedness.	2025-07-02 14:52:54 -04:00
merge-script	a92e8b10a5	Merge bitcoin/bitcoin#32564 : miniscript, refactor: Make `operator""_mst` `consteval` (re-take) `a34fb9ad6c` miniscript: Make `operator""_mst` `consteval` (Pieter Wuille) `14052162b1` Revert "miniscript: make operator_mst consteval" (Hennadii Stepanov) Pull request description: Same as https://github.com/bitcoin/bitcoin/pull/28657, but without the refactoring required to work around [fixed](https://github.com/bitcoin/bitcoin/pull/28657#discussion_r2095743353) MSVC bugs. The second commit has been taken from https://github.com/bitcoin/bitcoin/pull/29167. ACKs for top commit: sipa: ACK `a34fb9ad6c` hodlinator: re-ACK `a34fb9ad6c` Tree-SHA512: 8b531f9d6c450a8a5218865da05ffb5093d09ce2c0bee9874c0160795c4b1713928730d894ea3cd0b12b133346971ae3a00ed2fe8d9fd8a50b67a74ef81fde98	2025-07-02 15:06:33 +01:00
brunoerg	d6aaffcb11	test: check P2SH sigop count for coinbase tx	2025-07-01 12:08:20 -03:00
marcofleon	fcf92fd640	refactor: make CTxMemPool::GetIter strongly typed This allows adding a GetIter(const Wtxid&) overload in a next commit, making it easier to visit this function from a variant. Co-authored-by: stickies-v <stickies-v@protonmail.com>	2025-07-01 09:00:27 +01:00
merge-script	fb2c16cf7b	Merge bitcoin/bitcoin#32826 : p2p: add more bad ports `6967e8e8ab` add more bad p2p ports (Jameson Lopp) Pull request description: Add a few more ports used by extremely well adopted services that require authentication and really ought not be used by bitcoin nodes for p2p traffic. ACKs for top commit: Sjors: utACK `6967e8e8ab` l0rinc: ACK `6967e8e8ab` glozow: ACK `6967e8e8ab` Tree-SHA512: bbe86aef2be9727338712ded8f90227f5d12f633ab5d324c8907c01173945d1c4d9899e05565f78688842bbf5ebb010d22173969e4168ea08d4e33f01fe9569d	2025-06-30 13:28:17 -04:00
merge-script	f5f3e1f263	Merge bitcoin/bitcoin#32646 : p2p: Add witness mutation check inside FillBlock `28299ce776` p2p: remove vestigial READ_STATUS_CHECKBLOCK_FAILED (Greg Sanders) `bac9ee4830` p2p: Add witness mutation check inside FillBlock (Greg Sanders) Pull request description: Since #29412, we have not allowed mutated blocks to continue being processed immediately the block is received, but this is only done for the legacy BLOCK message. Extend these checks as belt-and-suspenders to not allow similar mutation strategies to affect relay by honest peers by applying the check inside `PartiallyDownloadedBlock::FillBlock`, immediately before returning `READ_STATUS_OK`. ACKs for top commit: Crypt-iQ: ACK `28299ce776` achow101: ACK `28299ce776` stratospher: ACK `28299ce7`. dergoegge: Code review ACK `28299ce776` Tree-SHA512: 883d7c12ca096234b425e6fe12e46b0611607600916e6ac8d1c8112224aa76924b7b074754910163ac2ec15379075d618a9ece3642649ac7629cddb0d4e432ea	2025-06-30 13:15:37 -04:00
Jameson Lopp	6967e8e8ab	add more bad p2p ports	2025-06-30 06:24:00 -04:00
Ava Chow	3086c21df4	Merge bitcoin/bitcoin#32243 : test: added fuzz coverage for consensus/merkle.cpp `95969bc58a` test: added fuzz coverage to consensus/merkle.cpp (kevkevinpal) Pull request description: ### Summary This adds a new fuzz target "merkle" which adds fuzz coverage to `consensus/merkle.cpp` I can also add this to an existing fuzz target if that is preferable Before: ![Screenshot 2025-04-09 at 10 12 54 PM](https://github.com/user-attachments/assets/e5f8da56-4583-441d-b08f-dfcc255ff248) After: ![Screenshot 2025-04-11 at 4 20 41 PM](https://github.com/user-attachments/assets/849ee079-b715-4089-9e36-d156233236c6) ACKs for top commit: marcofleon: ReACK `95969bc58a` Prabhat1308: ACK [`95969bc`](`95969bc58a`) maflcko: lgtm ACK `95969bc58a` achow101: ACK `95969bc58a` Tree-SHA512: e1fe8b69444733516bfa6cf2adaa199fde4c7c5582b7b908408f9313ed0f2e8cb803d27d707a1716d49606d5eaef8c1e722990bbc3cffc30fa91fe73d2233e9d	2025-06-27 13:34:30 -07:00
Ava Chow	319ff58bbd	Merge bitcoin/bitcoin#32638 : blocks: force hash validations on disk read `9341b5333a` blockstorage: make block read hash checks explicit (Lőrinc) `2371b9f4ee` test/bench: verify hash in `ComputeFilter` reads (Lőrinc) `5d235d50d6` net: assert block hash in `ProcessGetBlockData` and `ProcessMessage` (Lőrinc) Pull request description: A follow-up to https://github.com/bitcoin/bitcoin/pull/32487#discussion_r2094072165, after which validating the hash of a read block from disk doesn't incur the cost of calculating its hash anymore. ### Summary This PR adds explicit checks that the read block header's hash matches the one we were expecting. ### Context After the previous PR, validating a block's hash during read operations became essentially free. This PR leverages that by requiring callers to provide a block's expected hash (or `std::nullopt`), preventing silent failures caused by corrupted or mismatched data. Most `ReadBlock` usages were updated with expected hashes and now fail on mismatch. ### Changes * added hash assertions in `ProcessGetBlockData` and `ProcessMessage` to validate that the block read from disk matches the expected hash; * updated tests and benchmark to pass the correct block hash to `ReadBlock()`, ensuring the hash validation is tested - or none if we already expect PoW failure; * removed the default value for `expected_hash`, requiring an explicit hash for all block reads. ### Why is the hash still optional (but no longer has a default value) * for header-error tests, where the goal is to trigger failures early in the parsing process; * for out-of-order orphan blocks, where the child hash isn't available before the initial disk read. ACKs for top commit: maflcko: review ACK `9341b5333a` 🕙 achow101: ACK `9341b5333a` hodlinator: ACK `9341b5333a` janb84: re ACK `9341b5333a` Tree-SHA512: cf1d4fff4c15e3f8898ec284929cb83d7e747125d4ee759e77d369f1716728e843ef98030be32c8d608956a96ae2fbefa0e801200c333b9eefd6c086ec032e1f	2025-06-27 13:28:26 -07:00
Hennadii Stepanov	14052162b1	Revert "miniscript: make operator_mst consteval" This reverts commit `63317103c9`. operator""_mst has been manually adjusted according to commit `faf2162565`	2025-06-25 11:32:57 +01:00
merge-script	154b98a7aa	Merge bitcoin/bitcoin#32772 : fuzz: wallet: remove `FundTx` from `FuzzedWallet` `cd1ae1b4df` fuzz: wallet: remove FundTx from FuzzedWallet (brunoerg) Pull request description: `FundTx` was used by the `wallet_notifications` target which we recently removed. So it's now unused and can be removed. ACKs for top commit: maflcko: lgtm ACK `cd1ae1b4df` kevkevinpal: ACK [`cd1ae1b`](`cd1ae1b4df`) dergoegge: utACK `cd1ae1b4df` Tree-SHA512: 909cc4c8a0ac2a5f8844993ccf0e725021932888da3591925799145daf9196eadfcd0ebbc74a44f4a245074ded4cb8c3c099513f109ce2681dceff36b5f74bcc	2025-06-19 15:14:18 +01:00
Ava Chow	9a7eece5a4	Merge bitcoin/bitcoin#31981 : Add checkBlock() to Mining interface `a18e572328` test: more template verification tests (Sjors Provoost) `10c908808f` test: move gbt proposal mode tests to new file (Sjors Provoost) `94959b8dee` Add checkBlock to Mining interface (Sjors Provoost) `6077157531` ipc: drop BlockValidationState special handling (Sjors Provoost) `74690f4ed8` validation: refactor TestBlockValidity (Sjors Provoost) Pull request description: This PR adds the IPC equivalent of the `getblocktemplate` RPC in `proposal` mode. In order to do so it has `TestBlockValidity` return error reasons as a string instead of `BlockValidationState`. This avoids complexity in IPC code for handling the latter struct. The new Mining interface method is used in `miner_tests`. It's not used by the `getblocktemplate` and `generateblock` RPC calls, see https://github.com/bitcoin/bitcoin/pull/31981#discussion_r2096473337 The `inconclusive-not-best-prevblk` check is moved from RPC code to `TestBlockValidity`. Test coverage is increased by `mining_template_verification.py`. Superseedes #31564 ## Background ### Verifying block templates (no PoW) Stratum v2 allows miners to generate their own block template. Pools may wish (or need) to verify these templates. This typically involves comparing mempools, asking miners to providing missing transactions and then reconstructing the proposed block.[^0] This is not sufficient to ensure a proposed block is actually valid. In some schemes miners could take advantage of incomplete validation[^1]. The Stratum Reference Implementation (SRI), currently the only Stratum v2 implementation, collects all missing mempool transactions, but does not yet fully verify the block.[^2]. It could use the `getblocktemplate` RPC in `proposal` mode, but using IPC is more performant, as it avoids serialising up to 4 MB of transaction data as JSON. (although SRI could use this PR, the Template Provider role doesn't need it, so this is _not_ part of #31098) [^0]: https://github.com/stratum-mining/sv2-spec/blob/main/06-Job-Declaration-Protocol.md [^1]: https://delvingbitcoin.org/t/pplns-with-job-declaration/1099/45?u=sjors [^2]: https://github.com/stratum-mining/stratum/blob/v1.1.0/roles/jd-server/src/lib/job_declarator/message_handler.rs#L196 ACKs for top commit: davidgumberg: reACK `a18e572328` achow101: ACK `a18e572328` TheCharlatan: ACK `a18e572328` ryanofsky: Code review ACK `a18e572328` just adding another NONFATAL_UNREACHABLE since last review Tree-SHA512: 1a6c29f45a1666114f10f55aed155980b90104db27761c78aada4727ce3129e6ae7a522d90a56314bd767bd7944dfa46e85fb9f714370fc83e6a585be7b044f1	2025-06-18 17:07:21 -07:00
brunoerg	cd1ae1b4df	fuzz: wallet: remove FundTx from FuzzedWallet	2025-06-18 11:11:25 -03:00
Ryan Ofsky	5e6dbfd14e	Merge bitcoin/bitcoin#32465 : thread-safety: fix annotations with REVERSE_LOCK `a201a99f8c` thread-safety: fix annotations with REVERSE_LOCK (Cory Fields) `aeea5f0ec1` thread-safety: add missing lock annotation (Cory Fields) `832c57a534` thread-safety: modernize thread safety macros (Cory Fields) Pull request description: This is one of several PRs to cleanup/modernize our threading primitives. While replacing the old critical section locks in the mining code with a `REVERSE_LOCK`, I noticed that our thread-safety annotations weren't hooked up to it. This PR gets `REVERSE_LOCK` working properly. Firstly it modernizes the attributes as-recommended by the [clang docs](https://clang.llvm.org/docs/ThreadSafetyAnalysis.html) (ctrl+f for `USE_LOCK_STYLE_THREAD_SAFETY_ATTRIBUTES`). There's a subtle difference between the old `unlock_function` and new `release_capability`, where our `reverse_lock` only works with the latter. I believe this is an upstream bug. I've [reported and attempted a fix here](https://github.com/llvm/llvm-project/pull/139343), but either way it makes sense to me to modernize. The second adds a missing annotation pointed out by a fixed `REVERSE_LOCK`. Because clang's thread-safety annotations aren't passed through a reference to `UniqueLock` as one may assume (see [here](https://clang.llvm.org/docs/ThreadSafetyAnalysis.html#no-alias-analysis) for more details), `cs_main` has to be listed explicitly as a requirement. The last commit actually fixes the `reverse_lock` by making it a `SCOPED_LOCK` and using the pattern [found in a clang test](https://github.com/llvm/llvm-project/blob/main/clang/test/SemaCXX/warn-thread-safety-analysis.cpp#L3126). Though the docs don't describe how to accomplish it, the functionality was added [in this commit](`6a68efc959`). Due to aliasing issues (see link above), in order to work correctly, the original mutex has to be passed along with the lock, so all existing `REVERSE_LOCK`s have been updated. To ensure that the mutexes actually match, a runtime assertion is added. ACKs for top commit: fjahr: re-ACK `a201a99f8c` davidgumberg: reACK `a201a99f8c` theuni: Ok, done. Those last pushes can be ignored. ACKs on `a201a99` are still fresh. ryanofsky: Code review ACK `a201a99f8c`. Just dropping 0065b9673db5da2994b0b07c1d50ebfb19af39d0 and fixing incorrect `reverse_lock::lockname` initialization since last review. TheCharlatan: Re-ACK `a201a99f8c` Tree-SHA512: 2755fae0c41021976a1a633014a86d927f104ccbc8014c01c06dae89af363f92e5bc5d4276ad6d759302ac4679fe02a543758124d48318074db1c370989af7a7	2025-06-17 14:12:43 -04:00
Cory Fields	a201a99f8c	thread-safety: fix annotations with REVERSE_LOCK Without proper annotations, clang thinks that mutexes are still held for the duration of a reverse_lock. This could lead to subtle bugs as EXCLUSIVE_LOCKS_REQUIRED(foo) passes when it shouldn't. As mentioned in the docs [0], clang's thread-safety analyzer is unable to deal with aliases of mutexes, so it is not possible to use the lock's copy of the mutex for that purpose. Instead, the original mutex needs to be passed back to the reverse_lock for the sake of thread-safety analysis, but it is not actually used otherwise. [0]: https://clang.llvm.org/docs/ThreadSafetyAnalysis.html	2025-06-16 18:09:14 +00:00
Vasil Dimov	8bb34f07df	Explicitly close all AutoFiles that have been written There is no way to report a close error from `AutoFile` destructor. Such an error could be serious if the file has been written to because it may mean the file is now corrupted (same as if write fails). So, change all users of `AutoFile` that use it to write data to explicitly close the file and handle a possible error.	2025-06-16 15:33:15 +02:00

1 2 3 4 5 ...

5904 Commits