bitcoin

mirror of https://github.com/bitcoin/bitcoin.git synced 2026-01-20 07:09:15 +01:00

Author	SHA1	Message	Date
merge-script	2f410ad78c	Merge bitcoin/bitcoin#32263 : cluster mempool: add TxGraph work controls `62ed1f92ef` txgraph: check that DoWork finds optimal if given high budget (tests) (Pieter Wuille) `f3c2fc867f` txgraph: add work limit to DoWork(), try optimal (feature) (Pieter Wuille) `e96b00d99e` txgraph: make number of acceptable iterations configurable (feature) (Pieter Wuille) `cfe9958852` txgraph: track amount of work done in linearization (preparation) (Pieter Wuille) `6ba316eaa0` txgraph: 1-or-2-tx split-off clusters are optimal (optimization) (Pieter Wuille) `fad0eb091e` txgraph: reset quality when merging clusters (bugfix) (Pieter Wuille) Pull request description: Part of #30289. Builds on top of #31553. So far, the `TxGraph::DoWork()` function took no parameters, and just made all clusters reach the "acceptable" internal quality level by performing a minimum number of improvement iterations on it, but: * Did not attempt to go beyond that. * Was broken, as the QualityLevel of optimal clusters that merge together was not being reset. Fix this by adding an argument to `DoWork()` to control how much work it is allowed to do right now, which will first be used to get all clusters to the acceptable level, and if more budget remains, use it to try to get some or all clusters optimal. The function will now return `true` if all clusters are known to be optimal (and thus no further work remains). This is verified in the tests, by remembering whether the graph is optimal, and if it is at the end of the simulation run, verify that the overall linearization cannot be improved further. ACKs for top commit: instagibbs: ACK `62ed1f92ef` ismaelsadeeq: Code review ACK `62ed1f92ef` glozow: ACK `62ed1f92ef` Tree-SHA512: 5f57d4052e369f3444e72e724f04c02004e0f66e365faa59c9f145323e606508380fc97bb038b68783a62ae9c10757f1b628b3b00b2ce9a46161fca2d4336d73	2025-07-29 09:07:10 -04:00
ismaelsadeeq	31c4e77a25	test: fix ReadTopologicalSet unsigned integer overflow	2025-07-18 14:57:39 +01:00
Pieter Wuille	62ed1f92ef	txgraph: check that DoWork finds optimal if given high budget (tests)	2025-07-14 10:37:00 -04:00
Pieter Wuille	cfe9958852	txgraph: track amount of work done in linearization (preparation)	2025-07-14 09:41:17 -04:00
Pieter Wuille	d7fca5c171	clusterlin: add big comment explaning the relation between tests	2025-06-14 18:35:33 -04:00
Pieter Wuille	b64e61d2de	clusterlin: abstract try-permutations into ExhaustiveLinearize function Rather than this exhaustive linearization check happening inline inside clusterlin_simple_linearize, abstract it out into a Linearize()-like function for clarity. Note that this isn't exactly a refactor, because the old code would compare the found linearization against all (valid) permutations, while the new code instead first computes the best linearization from all valid permutations, and then compares it with the found one.	2025-06-14 18:28:13 -04:00
Pieter Wuille	1fa55a64ed	clusterlin tests: verify that chunks are minimal	2025-06-14 18:27:24 -04:00
Pieter Wuille	da23ecef29	clusterlin tests: support non-empty ReadTopologicalSubset() In several call sites for ReadTopologicalSubset, a non-empty result is expected, necessitating a special case at the call site for empty results. Fix this by adding a bool non_empty argument, which does this special casing (more efficiently) inside ReadTopologicalSubset itself.	2025-06-14 18:27:24 -04:00
Pieter Wuille	94f3e17c33	clusterlin tests: compare with fuzz-provided linearizations	2025-06-14 18:27:24 -04:00
Pieter Wuille	5f92ebee0d	clusterlin tests: compare with fuzz-provided topological sets	2025-06-14 18:27:24 -04:00
Pieter Wuille	6e37824ac3	clusterlin tests: optimize clusterlin_simple_linearize Whenever a non-topological permutation is encountered, fast forward to the last permutation with the same non-topological prefix, skipping over potentially many permutations that are non-topological for the same reason. With that, increase the checking of all permutations to clusters of size 8 instead of 7.	2025-06-14 18:27:24 -04:00
Pieter Wuille	98c1c88b6f	clusterlin tests: separate testing of SimpleLinearize and Linearize The separates the existing fuzz test into: * clusterlin_linearize: establishes the correctness of Linearize() using the simpler SimpleLinearize() function. * clusterlin_simple_linearize: establishes the correctness of SimpleLinearize() by comparing with all valid linearizations computed by std::next_permutation. rather than combining the first two into a single fuzz test.	2025-06-14 18:27:24 -04:00
Pieter Wuille	10e90f7aef	clusterlin tests: make SimpleCandidateFinder always find connected Make a small change to guarantee that SimpleCandidateFinder only ever returns connected solutions, even when non-optimal. Then test this property.	2025-06-14 18:27:24 -04:00
Pieter Wuille	a38c38951e	clusterlin tests: separate testing of Search- and SimpleCandidateFinder This separates the existing fuzz test into: * clusterlin_search_finder: establishes SearchCandidateFinder's correctness using the simpler SimpleCandidateFinder. * clusterlin_simple_finder: establishes SimpleCandidateFinder's correctness using the (even) simpler ExhaustiveCandidateFinder. rather than trying to do both at once.	2025-06-14 18:27:24 -04:00
Pieter Wuille	77a432ee70	clusterlin tests: count SimpleCandidateFinder iterations better Only count the number of actual new subsets added. If the queue contains a work item that completely covers a component, no transaction can be added to it without creating a disconnected component. In this case, also don't count it as an iteration. With this, the number of iterations performed by SimpleCandidateFinder is bounded by the number of distinct connected topologically-valid subsets of the cluster.	2025-06-14 18:27:24 -04:00
MarcoFalke	fa9ca13f35	refactor: Sort includes of touched source files	2025-06-03 19:56:55 +02:00
MarcoFalke	fae71d30f7	clang-tidy: Apply modernize-deprecated-headers This can be reproduced according to the developer notes with something like ( cd ./src/ && ../contrib/devtools/run-clang-tidy.py -p ../bld-cmake -fix -j $(nproc) ) Also, the header related changes were done manually.	2025-06-03 15:13:54 +02:00
Pieter Wuille	a52b53926b	clusterlin: add GetConnectedComponent This abstracts out the finding of the connected component that includes a given element from FindConnectedComponent (which just finds any connected component). Use this in the txgraph fuzz test, which was effectively reimplementing this logic. At the same time, improve its performance by replacing a vector with a set.	2025-03-27 15:48:44 -04:00
Pieter Wuille	d449773899	scripted-diff: (refactor) ClusterIndex -> DepGraphIndex Since cluster_linearize.h does not actually have a Cluster type anymore, it is more appropriate to rename the index type to DepGraphIndex. -BEGIN VERIFY SCRIPT- sed -i 's/Data type to represent transaction indices in clusters./Data type to represent transaction indices in DepGraphs and the clusters they represent./' $(git grep -l 'using ClusterIndex') sed -i 's\|\<ClusterIndex\>\|DepGraphIndex\|g' $(git grep -l 'ClusterIndex') -END VERIFY SCRIPT-	2025-03-24 09:34:54 -04:00
Pieter Wuille	bfeb69f6e0	clusterlin: Make IsAcyclic() a DepGraph member function ... instead of being a separate test-only function. Also add a fuzz test for it returning false.	2025-03-24 09:34:54 -04:00
Pieter Wuille	0aa874a357	clusterlin: Add FixLinearization function + fuzz test This function takes an existing ordering for transactions in a DepGraph, and makes it a valid linearization for it (i.e., topological). Any topological prefix of the input remains untouched.	2025-03-24 09:34:54 -04:00
Pieter Wuille	1c24c62510	clusterlin: merge two DepGraph fuzz tests into simulation test This combines the clusterlin_add_dependency and clusterlin_cluster_serialization fuzz tests into a single clusterlin_depgraph_sim fuzz test. This tests starts from an empty DepGraph and performs a arbitrary number of AddTransaction, AddDependencies, and RemoveTransactions operations on it, and compares the resulting state with a naive reimplementation.	2024-10-07 13:49:36 -04:00
Pieter Wuille	0606e66fdb	clusterlin: add DepGraph::RemoveTransactions and support for holes in DepGraph This commits introduces support in DepGraph for the transaction positions to be non-continuous. Specifically, it adds: * DepGraph::RemoveTransactions which removes 0 or more positions from a DepGraph. * DepGraph::Positions() to get a set of which positions are in use. * DepGraph::PositionRange() to get the highest used position in a DepGraph + 1. In addition, it extends the DepGraphFormatter format to support holes in a compatible way (it serializes non-holey DepGraphs identically to the old code, and deserializes them the same way)	2024-10-07 13:49:35 -04:00
Pieter Wuille	75b5d42419	clusterlin: make DepGraph::AddDependency support multiple dependencies at once This changes DepGraph::AddDependency into DepGraph::AddDependencies, which takes in a single child, but a set of parent transactions, making them all dependencies at once. This is important for performance. N transactions can have O(N^2) parents combined, so constructing a full DepGraph using just AddDependency (which is O(N) on its own) could take O(N^3) time, while doing the same with AddDependencies (also O(N) on its own) only takes O(N^2). Notably, this matters for DepGraphFormatter::Unser, which goes from O(N^3) to O(N^2). Co-Authored-By: Greg Sanders <gsanders87@gmail.com>	2024-10-07 13:47:52 -04:00
Pieter Wuille	5901cf7100	clusterlin: abstract out DepGraph::GetReduced{Parents,Children} A fuzz test already relies on these operations, and a future commit will need the same logic too. Therefore, abstract them out into proper member functions, with proper testing.	2024-10-07 13:46:48 -04:00
Pieter Wuille	9ad2fe7e69	clusterlin: only start/use search when enough iterations left	2024-09-12 15:15:36 -04:00
Pieter Wuille	71f2629398	clusterlin: include topological pot subsets automatically (optimization) Automatically add topologically-valid subsets of the potential set pot to inc. It can be proven that these must be part of the best reachable topologically-valid set from that work item. This is a crucial optimization that (apparently) reduces the maximum number of iterations from ~2^(N-1) to ~sqrt(2^N). Co-Authored-By: Suhas Daftuar <sdaftuar@gmail.com>	2024-09-12 15:15:36 -04:00
Pieter Wuille	2965fbf203	clusterlin: track upper bound potential set for work items (optimization) In each work item, keep track of a conservative overestimate of the best possible feerate that can be reached from it, and then use these to avoid exploring hopeless work items.	2024-09-12 15:15:36 -04:00
Pieter Wuille	85a285a306	clusterlin: separate initial search entries per component (optimization) Before this commit, the worst case for linearization involves clusters which break apart in several smaller components after the first candidate is included in the output linearization. Address this by never considering work items that span multiple components of what remains of the cluster.	2024-09-12 15:15:36 -04:00
Pieter Wuille	04d7a04ea4	clusterlin: add MergeLinearizations function + fuzz test + benchmark	2024-08-01 16:03:34 -04:00
Pieter Wuille	4f8958d756	clusterlin: add PostLinearize + benchmarks + fuzz tests	2024-08-01 16:02:09 -04:00
Pieter Wuille	0e2812d293	clusterlin: add algorithms for connectedness/connected components Add utility functions to DepGraph for finding connected components.	2024-08-01 15:43:59 -04:00
Pieter Wuille	0e52728a2d	clusterlin: rename Intersect -> IntersectPrefixes This makes it clearer what the function does.	2024-08-01 14:07:54 -04:00
Pieter Wuille	28549791b3	clusterlin: permit passing in existing linearization to Linearize This implements the LIMO algorithm for linearizing by improving an existing linearization. See https://delvingbitcoin.org/t/limo-combining-the-best-parts-of-linearization-search-and-merging for details.	2024-07-25 10:16:40 -04:00
Pieter Wuille	97d98718b0	clusterlin: add LinearizationChunking class It encapsulates a given linearization in chunked form, permitting arbitrary subsets of transactions to be removed from the linearization. Its purpose is adding the Intersect function, which is a crucial operation that will be used in a further commit to make Linearize improve existing linearizations.	2024-07-25 10:16:40 -04:00
Pieter Wuille	d5918dc3c6	clusterlin: randomize the SearchCandidateFinder search order To make search non-deterministic, change the BFS logic from always picking the first queue item to randomly picking the first or second queue item.	2024-07-25 10:16:40 -04:00
Pieter Wuille	46aad9b099	clusterlin: add Linearize function This adds a first version of the overall linearization interface, which given a DepGraph constructs a good linearization, by incrementally including good candidate sets (found using AncestorCandidateFinder and SearchCandidateFinder).	2024-07-25 10:16:37 -04:00
Pieter Wuille	ee0ddfe4f6	clusterlin: add chunking algorithm A fuzz test is added which verifies various of its expected properties, including correctness	2024-07-25 10:16:37 -04:00
Pieter Wuille	2a41f151af	clusterlin: add SearchCandidateFinder class Similar to AncestorCandidateFinder, this encapsulates the state needed for finding good candidate sets using a search algorithm.	2024-07-25 10:16:37 -04:00
Pieter Wuille	4828079db3	clusterlin: add AncestorCandidateFinder class This is a class that encapsulates precomputed ancestor set feerates, and presents an interface for getting the best remaining ancestor set.	2024-07-25 10:16:37 -04:00
Pieter Wuille	58f7e01db4	tests: framework for testing DepGraph class This introduces a bespoke fuzzing-focused serialization format for DepGraphs, and then tests that this format can represent any graph, roundtrips, and then uses that to test the correctness of DepGraph itself. This forms the basis for future fuzz tests that need to work with interesting graphs.	2024-07-25 10:16:37 -04:00

41 Commits