bitcoin

mirror of https://github.com/bitcoin/bitcoin.git synced 2026-01-21 07:39:08 +01:00

Author	SHA1	Message	Date
Pieter Wuille	1808b5aaf7	clusterlin: remove unused FixLinearization (cleanup)	2026-01-05 11:48:34 -05:00
Pieter Wuille	01ffcf464a	clusterlin: support fixing linearizations (feature) This also updates FixLinearization to just be a thin wrapper around Linearize. In a future commit, FixLinearization will be removed entirely.	2026-01-05 11:48:16 -05:00
bensig	08ed802bab	doc: fix double-word typos in comments	2025-12-30 12:12:26 -08:00
Pieter Wuille	75bdb925f4	clusterlin: drop support for improvable chunking (simplification) With MergeLinearizations() gone and the LIMO-based Linearize() replaced by SFL, we do not need a class (LinearizationChunking) that can maintain an incrementally-improving chunk set anymore. Replace it with a function (ChunkLinearizationInfo) that just computes the chunks as SetInfos once, and returns them as a vector. This simplifies several call sites too.	2025-12-18 16:01:31 -05:00
Pieter Wuille	91399a7912	clusterlin: remove unused MergeLinearizations (cleanup) This ended up never being used in txgraph.	2025-12-18 16:01:31 -05:00
Pieter Wuille	5ce2800745	clusterlin: randomize equal-feerate parts of linearization (privacy) This places equal-feerate chunks (with no dependencies between them) in random order in the linearization output, hiding information about DepGraph insertion order from the output. Likewise, it randomizes the order of transactions within chunks for the same reason.	2025-12-18 16:01:31 -05:00
Pieter Wuille	13aad26b78	clusterlin: randomize various decisions in SFL (feature) This introduces a local RNG inside the SFL state, which is used to randomize various decisions inside the algorithm, in order to make it hard to create pathological clusters which predictably have bad performance. The decisions being randomized are: * When deciding what chunk to attempt to split, the queue order is randomized. * When deciding which dependency to split on, a uniformly random one is chosen among those with higher top feerate than bottom feerate within the chosen chunk. * When deciding which chunks to merge, a uniformly random one among those with the higher feerate difference is picked. * When merging two chunks, a uniformly random dependency between them is now activated. * When making the state topological, the queue of chunks to process is randomized.	2025-12-18 16:01:31 -05:00
Pieter Wuille	ddbfa4dfac	clusterlin: keep FIFO queue of improvable chunks (preparation) This introduces a queue of chunks that still need processing, in both MakeTopological() and OptimizationStep(). This is simultaneously: * A preparation for introducing randomization, by allowing permuting the queue. * An improvement to the fairness of suboptimal solutions, by distributing the work more fairly over chunks. * An optimization, by avoiding retrying chunks over and over again which are already known to be optimal.	2025-12-18 16:01:31 -05:00
Pieter Wuille	3efc94d656	clusterlin: replace cluster linearization with SFL (feature) This replaces the existing LIMO linearization algorithm (which internally uses ancestor set finding and candidate set finding) with the much more performant spanning-forest linearization algorithm. This removes the old candidate-set search algorithm, and several of its tests, benchmarks, and needed utility code. The worst case time per cost is similar to the previous algorithm, so ACCEPTABLE_ITERS is unchanged.	2025-12-18 16:01:31 -05:00
Pieter Wuille	6a8fa821b8	clusterlin: add support for loading existing linearization (feature)	2025-12-18 16:01:22 -05:00
Pieter Wuille	c461259fb6	clusterlin: add class implementing SFL state (preparation) This adds a data structure representing the optimization state for the spanning-forest linearization algorithm (SFL), plus a fuzz test for its correctness. This is preparation for switching over Linearize() to use this algorithm. See https://delvingbitcoin.org/t/spanning-forest-cluster-linearization/1419 for a description of the algorithm.	2025-12-18 15:49:01 -05:00
Pieter Wuille	95bfe7d574	clusterlin: replace benchmarks with SFL-hard ones (bench) This also adds a per-cost variant of each.	2025-12-18 14:17:28 -05:00
Pieter Wuille	bb5cb222ae	depgraph: add memory usage control (feature) Co-Authored-By: Lőrinc <pap.lorinc@gmail.com>	2025-10-11 17:25:09 -04:00
Pieter Wuille	cfe9958852	txgraph: track amount of work done in linearization (preparation)	2025-07-14 09:41:17 -04:00
MarcoFalke	fa9ca13f35	refactor: Sort includes of touched source files	2025-06-03 19:56:55 +02:00
MarcoFalke	fae71d30f7	clang-tidy: Apply modernize-deprecated-headers This can be reproduced according to the developer notes with something like ( cd ./src/ && ../contrib/devtools/run-clang-tidy.py -p ../bld-cmake -fix -j $(nproc) ) Also, the header related changes were done manually.	2025-06-03 15:13:54 +02:00
Pieter Wuille	a52b53926b	clusterlin: add GetConnectedComponent This abstracts out the finding of the connected component that includes a given element from FindConnectedComponent (which just finds any connected component). Use this in the txgraph fuzz test, which was effectively reimplementing this logic. At the same time, improve its performance by replacing a vector with a set.	2025-03-27 15:48:44 -04:00
Pieter Wuille	c7d5dcaa61	clusterlin: fix typos	2025-03-27 12:41:24 -04:00
Pieter Wuille	d449773899	scripted-diff: (refactor) ClusterIndex -> DepGraphIndex Since cluster_linearize.h does not actually have a Cluster type anymore, it is more appropriate to rename the index type to DepGraphIndex. -BEGIN VERIFY SCRIPT- sed -i 's/Data type to represent transaction indices in clusters./Data type to represent transaction indices in DepGraphs and the clusters they represent./' $(git grep -l 'using ClusterIndex') sed -i 's\|\<ClusterIndex\>\|DepGraphIndex\|g' $(git grep -l 'ClusterIndex') -END VERIFY SCRIPT-	2025-03-24 09:34:54 -04:00
Pieter Wuille	bfeb69f6e0	clusterlin: Make IsAcyclic() a DepGraph member function ... instead of being a separate test-only function. Also add a fuzz test for it returning false.	2025-03-24 09:34:54 -04:00
Pieter Wuille	0aa874a357	clusterlin: Add FixLinearization function + fuzz test This function takes an existing ordering for transactions in a DepGraph, and makes it a valid linearization for it (i.e., topological). Any topological prefix of the input remains untouched.	2025-03-24 09:34:54 -04:00
MarcoFalke	fa0c6b7179	refactor: Remove unused Span alias Also, fixup some wording.	2025-03-12 19:45:49 +01:00
MarcoFalke	fade0b5e5e	scripted-diff: Use std::span over Span -BEGIN VERIFY SCRIPT- ren() { sed -i "s!\<$1\>!$2!g" $( git grep -l "$1" -- "./src" ":(exclude)src/span.h" ":(exclude)src/leveldb/db/log_test.cc" ) ; } ren Span std::span ren AsBytes std::as_bytes ren AsWritableBytes std::as_writable_bytes sed -i 's!SpanPopBack(Span!SpanPopBack(std::span!g' ./src/span.h -END VERIFY SCRIPT-	2025-03-12 19:45:37 +01:00
fanquake	726cbee955	doc: correct typos	2024-11-11 14:14:39 +00:00
Pieter Wuille	0b3ec8c59b	clusterlin: remove Cluster type	2024-10-07 13:49:36 -04:00
Pieter Wuille	0606e66fdb	clusterlin: add DepGraph::RemoveTransactions and support for holes in DepGraph This commits introduces support in DepGraph for the transaction positions to be non-continuous. Specifically, it adds: * DepGraph::RemoveTransactions which removes 0 or more positions from a DepGraph. * DepGraph::Positions() to get a set of which positions are in use. * DepGraph::PositionRange() to get the highest used position in a DepGraph + 1. In addition, it extends the DepGraphFormatter format to support holes in a compatible way (it serializes non-holey DepGraphs identically to the old code, and deserializes them the same way)	2024-10-07 13:49:35 -04:00
Pieter Wuille	75b5d42419	clusterlin: make DepGraph::AddDependency support multiple dependencies at once This changes DepGraph::AddDependency into DepGraph::AddDependencies, which takes in a single child, but a set of parent transactions, making them all dependencies at once. This is important for performance. N transactions can have O(N^2) parents combined, so constructing a full DepGraph using just AddDependency (which is O(N) on its own) could take O(N^3) time, while doing the same with AddDependencies (also O(N) on its own) only takes O(N^2). Notably, this matters for DepGraphFormatter::Unser, which goes from O(N^3) to O(N^2). Co-Authored-By: Greg Sanders <gsanders87@gmail.com>	2024-10-07 13:47:52 -04:00
Pieter Wuille	5901cf7100	clusterlin: abstract out DepGraph::GetReduced{Parents,Children} A fuzz test already relies on these operations, and a future commit will need the same logic too. Therefore, abstract them out into proper member functions, with proper testing.	2024-10-07 13:46:48 -04:00
Pieter Wuille	9ad2fe7e69	clusterlin: only start/use search when enough iterations left	2024-09-12 15:15:36 -04:00
Pieter Wuille	bd044356ed	clusterlin: improve heuristic to decide split transaction (optimization) Empirically, this approach seems to be more efficient in common real-life clusters, and does not change the worst case. Co-Authored-By: Suhas Daftuar <sdaftuar@gmail.com>	2024-09-12 15:15:36 -04:00
Pieter Wuille	71f2629398	clusterlin: include topological pot subsets automatically (optimization) Automatically add topologically-valid subsets of the potential set pot to inc. It can be proven that these must be part of the best reachable topologically-valid set from that work item. This is a crucial optimization that (apparently) reduces the maximum number of iterations from ~2^(N-1) to ~sqrt(2^N). Co-Authored-By: Suhas Daftuar <sdaftuar@gmail.com>	2024-09-12 15:15:36 -04:00
Pieter Wuille	e20fda77a2	clusterlin: reduce computation of unnecessary pot sets (optimization) Keep track of which transactions in the graph have an individual feerate that is better than the best included set so far. Others do not need to be added to the pot set, as they cannot possibly help beating best.	2024-09-12 15:15:36 -04:00
Pieter Wuille	2965fbf203	clusterlin: track upper bound potential set for work items (optimization) In each work item, keep track of a conservative overestimate of the best possible feerate that can be reached from it, and then use these to avoid exploring hopeless work items.	2024-09-12 15:15:36 -04:00
Pieter Wuille	9e43e4ce10	clusterlin: use feerate-sorted depgraph in SearchCandidateFinder This is a requirement for a future commit, which will rely on quickly iterating over transaction sets in decreasing individual feerate order.	2024-09-12 15:15:36 -04:00
Pieter Wuille	b80e6dfe78	clusterlin: add reordering support for DepGraph Add a DepGraph(depgraph, reordering) function that constructs a new DepGraph corresponding to an old one, but with its transactions is a modified order (given as a vector from old to new positions). Also use this reordering feature inside DepGraphFormatter::Unser, which needs a small modification so that its reordering mapping is old-to-new (rather than the new-to-old it used before).	2024-09-12 15:15:36 -04:00
Pieter Wuille	85a285a306	clusterlin: separate initial search entries per component (optimization) Before this commit, the worst case for linearization involves clusters which break apart in several smaller components after the first candidate is included in the output linearization. Address this by never considering work items that span multiple components of what remains of the cluster.	2024-09-12 15:15:36 -04:00
Pieter Wuille	bbcee5a0d6	clusterlin: improve rechunking in LinearizationChunking (optimization) When the transactions being marked done exactly match the first chunk of what remains of the linearization, we can just remember to skip that chunk instead of computing a full rechunking. Further, chop off prefixes of the input linearization that are already done, so they don't need to be reconsidered for further rechunkings.	2024-08-01 16:03:38 -04:00
Pieter Wuille	04d7a04ea4	clusterlin: add MergeLinearizations function + fuzz test + benchmark	2024-08-01 16:03:34 -04:00
Pieter Wuille	4f8958d756	clusterlin: add PostLinearize + benchmarks + fuzz tests	2024-08-01 16:02:09 -04:00
Pieter Wuille	0e2812d293	clusterlin: add algorithms for connectedness/connected components Add utility functions to DepGraph for finding connected components.	2024-08-01 15:43:59 -04:00
Pieter Wuille	0e52728a2d	clusterlin: rename Intersect -> IntersectPrefixes This makes it clearer what the function does.	2024-08-01 14:07:54 -04:00
Pieter Wuille	28549791b3	clusterlin: permit passing in existing linearization to Linearize This implements the LIMO algorithm for linearizing by improving an existing linearization. See https://delvingbitcoin.org/t/limo-combining-the-best-parts-of-linearization-search-and-merging for details.	2024-07-25 10:16:40 -04:00
Pieter Wuille	97d98718b0	clusterlin: add LinearizationChunking class It encapsulates a given linearization in chunked form, permitting arbitrary subsets of transactions to be removed from the linearization. Its purpose is adding the Intersect function, which is a crucial operation that will be used in a further commit to make Linearize improve existing linearizations.	2024-07-25 10:16:40 -04:00
Pieter Wuille	d5918dc3c6	clusterlin: randomize the SearchCandidateFinder search order To make search non-deterministic, change the BFS logic from always picking the first queue item to randomly picking the first or second queue item.	2024-07-25 10:16:40 -04:00
Pieter Wuille	991ff9a9a4	clusterlin: use bounded BFS exploration (optimization) Switch to BFS exploration of the search tree in SearchCandidateFinder instead of DFS exploration. This appears to behave better for real world clusters. As BFS has the downside of needing far larger search queues, switch back to DFS temporarily when the queue grows too large.	2024-07-25 10:16:40 -04:00
Pieter Wuille	46aad9b099	clusterlin: add Linearize function This adds a first version of the overall linearization interface, which given a DepGraph constructs a good linearization, by incrementally including good candidate sets (found using AncestorCandidateFinder and SearchCandidateFinder).	2024-07-25 10:16:37 -04:00
Pieter Wuille	ee0ddfe4f6	clusterlin: add chunking algorithm A fuzz test is added which verifies various of its expected properties, including correctness	2024-07-25 10:16:37 -04:00
Pieter Wuille	2a41f151af	clusterlin: add SearchCandidateFinder class Similar to AncestorCandidateFinder, this encapsulates the state needed for finding good candidate sets using a search algorithm.	2024-07-25 10:16:37 -04:00
Pieter Wuille	4828079db3	clusterlin: add AncestorCandidateFinder class This is a class that encapsulates precomputed ancestor set feerates, and presents an interface for getting the best remaining ancestor set.	2024-07-25 10:16:37 -04:00
Pieter Wuille	a6e07e769a	clusterlin: introduce cluster_linearize.h with Cluster and DepGraph types This primarily adds the DepGraph class, which encapsulates precomputed ancestor/descendant information for a given transaction cluster, with a number of utility features (inspectors for set feerates, computing reduced parents/children, adding transactions, adding dependencies), which will become needed in future commits.	2024-07-25 10:16:37 -04:00

50 Commits