highperfocused/lnd - lnd - Gitea: Git with a cup of tea

mirror of https://github.com/lightningnetwork/lnd.git synced 2025-11-10 14:17:56 +01:00

Author	SHA1	Message	Date
Olaoluwa Osuntokun	06c7d60452	discovery: fix endless loop in gossip syncer on context cancellation This commit fixes a critical bug where the channelGraphSyncer goroutine would enter an endless loop when context cancellation or peer disconnect errors occurred during the syncingChans or queryNewChannels states. The root cause was that state handler functions (handleSyncingChans and synchronizeChanIDs) did not return errors to the main goroutine loop. When these functions encountered fatal errors like context cancellation, they would log the error and return early without changing the syncer's state. This caused the main loop to immediately re-enter the same state handler, encounter the same error, and loop indefinitely while spamming error logs. The fix makes error handling explicit by having state handlers return errors. The main channelGraphSyncer loop now checks these errors and exits cleanly when fatal errors occur. We return any error (not just context cancellation) because fatal errors can manifest in multiple forms: context.Canceled, ErrGossipSyncerExiting from the rate limiter, lnpeer.ErrPeerExiting from Brontide, or network errors like connection closed. This approach matches the error handling pattern already used in other goroutines like replyHandler.	2025-11-03 10:44:12 -08:00
Elle Mouton	b8abe130a5	multi: rename lnwire.NodeAnnouncement In preparation for adding a NodeAnnouncement2 struct along with a NodeAnnouncement interface, this commit renames the existing NodeAnnouncment struct to NodeAnnouncement1.	2025-10-01 13:13:32 +02:00
Olaoluwa Osuntokun	77f3b35640	discovery: update ApplyGossipFilter to use lazy iterator with Pull2 In this commit, we update ApplyGossipFilter to leverage the new iterator-based UpdatesInHorizon method. The key innovation here is using iter.Pull2 to create a pull-based iterator that allows us to check if any updates exist before launching the background goroutine. This approach provides several benefits over the previous implementation. First, we avoid the overhead of launching a goroutine when there are no updates to send, which was previously unavoidable without materializing the entire result set. Second, we maintain lazy loading throughout the sending process, only pulling messages from the database as they're needed for transmission. The implementation uses Pull2 to peek at the first message, determining whether to proceed with sending updates. If updates exist, ownership of the iterator is transferred to the goroutine, which continues pulling and sending messages until exhausted. This design ensures memory usage remains bounded regardless of the number of updates being synchronized.	2025-09-26 17:00:11 -07:00
Olaoluwa Osuntokun	d8f6fd29f7	discovery: convert UpdatesInHorizon to return iter.Seq2[lnwire.Message, error] In this commit, we complete the iterator conversion work started in PR 10128 by threading the iterator pattern through to the higher-level UpdatesInHorizon method. This change converts the method from returning a fully materialized slice of messages to returning a lazy iterator that yields messages on demand. The new signature uses iter.Seq2 to allow error propagation during iteration, eliminating the need for a separate error return value. This approach enables callers to handle errors as they occur during iteration rather than failing upfront. The implementation now lazily processes channel and node updates, yielding them as they're generated rather than accumulating them in memory. This maintains the same ordering guarantees (channels before nodes) while significantly reducing memory pressure when dealing with large update sets during gossip synchronization.	2025-09-26 16:59:41 -07:00
yyforyongyu	778456769a	discovery: rate limiting sending msgs per peer We now add another layer of rate limiting before sending the messages inside `GossipSyncer`.	2025-09-02 21:23:07 +08:00
yyforyongyu	6826703c77	discovery: introduce rate limiter to `GossipSyncer`	2025-09-02 21:13:46 +08:00
Olaoluwa Osuntokun	7e767eac82	discovery: only permit a single gossip backlog goroutine per peer In this commit, we add a new atomic bool to only permit a single gossip backlog goroutine per peer. If we get a new reuqest that needs a backlog while we're still processing the other, then we'll drop that request.	2025-08-01 11:20:22 -05:00
Olaoluwa Osuntokun	7fb289f24f	discovery: add async timestamp range queue to prevent blocking In this commit, we introduce an asynchronous processing queue for GossipTimestampRange messages in the GossipSyncer. This change addresses a critical issue where the gossiper could block indefinitely when processing timestamp range messages during periods of high load. Previously, when a peer sent a GossipTimestampRange message, the gossiper would synchronously call ApplyGossipFilter, which could block on semaphore acquisition, database queries, and rate limiting. This synchronous processing created a bottleneck where the entire peer message processing pipeline would stall, potentially causing timeouts and disconnections. The new design adds a timestampRangeQueue channel with a capacity of 1 message and a dedicated goroutine for processing these messages asynchronously. This follows the established pattern used for other message types in the syncer. When the queue is full, we drop messages and log a warning rather than blocking indefinitely, providing graceful degradation under extreme load conditions.	2025-08-01 11:20:21 -05:00
Elle Mouton	9597f01ce3	graph/db: thread context through to HighestChanID	2025-06-19 14:49:32 +02:00
ziggie	732d3a77b4	discovery: increase syncer gossip chan buffer	2025-06-03 21:13:42 +02:00
Elle Mouton	c68a19c0ba	discovery: revert passing ctx through to Start methods	2025-05-22 14:14:39 +02:00
Elle Mouton	3162837c32	discovery: listen on ctx in any select For any method that takes a context that has a select that listens on the systems quit channel, we should also listen on the ctx since we should not need to worry about if this context is derived internally or externally.	2025-05-22 14:14:37 +02:00
Elle Mouton	6d03df7b26	discovery: remove unnecessary context.Background() calls	2025-05-22 14:14:37 +02:00
Elle Mouton	2a5235a79f	discovery: thread contexts to syncer The `GossiperSyncer` makes various calls to the `ChannelGraphTimeSeries` interface which threads through to the graph DB. So in preparation for threading context through to all the methods on that interface, we update the GossipSyncer accordingly by passing contexts through. Two `context.TODO()`s are added in this commit. They will be removed in the upcoming commits.	2025-05-22 14:14:36 +02:00
Olaoluwa Osuntokun	05702d48b2	discovery: switch to bytes based rate limiting for outbound msgs In this commit, we revamp the old message based rate limiting. First, we move to meter by bytes/s instead of messages/s. The old logic had an error in that it limited groups of message replies, instead of each message. With this new approach, we'll use the newly added SerializedSize method to implement fine grained bandwidth metering. We need to pick two values, the burst rate, and the msg bytes rate. The burst rate is the max amt that can be sent in a given period of time. We need to set this above 65 KB, or the max msg limit, otherwise no messages can be sent. The bucket starts with this many tokens (bytes). As those are depleted, the amount of tokens is refilled at the msg bytes rate. As conservative values, we've chosen 200 KB as the burst rate, and 100 KB/s as the limit.	2025-03-24 19:21:45 -07:00
Eugene Siegel	6eb746fbba	server.go+accessman.go: introduce caches for access permissions Here we introduce the access manager which has caches that will determine the access control status of our peers. Peers that have had their funding transaction confirm with us are protected. Peers that only have pending-open channels with us are temporary access and can have their access revoked. The rest of the peers are granted restricted access.	2025-03-11 20:42:34 -04:00
yyforyongyu	37799b95b7	discovery: fix state transition in `GossipSyncer` Previously, we would set the state of the syncer after sending the msg, which has the following flow, 1. In state `queryNewChannels`, we send the msg `QueryShortChanIDs`. 2. Once the msg is sent, we change to state `waitingQueryChanReply`. But there's no guarantee the remote won't reply back inbetween the two step. When that happens, our syncer would still be in state `queryNewChannels`, causing the following error, ``` [ERR] DISC gossiper.go:873: Process query msg from peer [Alice] got unexpected msg *lnwire.ReplyShortChanIDsEnd received in state queryNewChannels ``` To fix it, we now make sure the state is updated before sending the msg.	2025-03-10 16:58:16 +08:00
yyforyongyu	772a9d5f42	discovery: fix mocked peer in unit tests The mocked peer used here blocks on `sendToPeer`, which is not the behavior of the `SendMessageLazy` of `lnpeer.Peer`. To reflect the reality, we now make sure the `sendToPeer` is non-blocking in the tests.	2025-01-17 17:59:06 +08:00
yyforyongyu	9fecfed3b5	discovery: fix race access to syncer's state This commit fixes the following race, 1. syncer(state=syncingChans) sends QueryChannelRange 2. remote peer replies ReplyChannelRange 3. ProcessQueryMsg fails to process the remote peer's msg as its state is neither waitingQueryChanReply nor waitingQueryRangeReply. 4. syncer marks its new state waitingQueryChanReply, but too late. The historical sync will now fail, and the syncer will be stuck at this state. What's worse is it cannot forward channel announcements to other connected peers now as it will skip the broadcasting during initial graph sync. This is now fixed to make sure the following two steps are atomic, 1. syncer(state=syncingChans) sends QueryChannelRange 2. syncer marks its new state waitingQueryChanReply.	2025-01-17 02:39:07 +08:00
yyforyongyu	4b30b09d1c	discovery: add new method `handleSyncingChans` This is a pure refactor to add a dedicated handler when the gossiper is in state syncingChans.	2025-01-17 00:22:22 +08:00
yyforyongyu	001e5599b6	multi: add debug logs for edge policy flow This commit adds more logs around the ChannelUpdate->edge policy process flow.	2025-01-17 00:17:23 +08:00
Elle Mouton	74a4b1922b	refactor: move graph related DB code to graph/db from channeldb This is a pure refactor commit. It moves over all the graph related CRUD code from `channeldb` to `graph/db`.	2024-11-28 13:36:13 +02:00
Elle Mouton	60f331edb1	multi: rename ChannelUpdate to ChannelUpdate1 In preparation for adding a new ChannelUpdate2 message and a ChannelUpdate interface, we rename the existing message to ChannelUpdate1.	2024-09-18 16:13:17 +02:00
Elle Mouton	bb44efa21f	multi: rename ChannelAnnouncement to ChannelAnnouncment1 In preparation for adding the new ChannelAnnouncement2 message along with a ChannelAnnouncement interface, we rename the existing message to ChannelAnnouncement1.	2024-09-18 16:13:17 +02:00
Matt Morehouse	532fe05b63	discovery: remove check for incorrect number of timestamps The check is no longer required, as it is now done during decoding.	2024-09-10 14:52:16 -05:00
Eugene Siegel	013452cff0	discovery: implement ChannelAnnouncement banning This commit hooks up the banman to the gossiper: - peers that are banned and don't have a channel with us will get disconnected until they are unbanned. - peers that are banned and have a channel with us won't get disconnected, but we will ignore their channel announcements until they are no longer banned. Note that this only disables gossip of announcements to us and still allows us to open channels to them.	2024-08-27 14:11:06 -04:00
ziggie	6fb1e0c17d	multi: fix time.Time initialization. ChanUpdate timestamps are now restircted so that they cannot be more than two weeks into the future. Moreover channels with both timestamps in the ReplyChannelRange msg either too far in the past or too far in the future are not queried. Moreover fix unitests.	2024-08-17 04:29:57 +02:00
Oliver Gugger	648fb22f63	multi: wrap all errors	2024-04-11 15:04:03 +02:00
ffranr	cd566eb097	multi: fix fmt.Errorf error wrapping Refactor fmt.Errorf usage to correctly wrap errors instead of using non-wrapping format verbs.	2024-02-27 11:13:40 +00:00
Elle Mouton	a242ad5acb	channeldb+discovery: use timestamps to maybe revive zombie	2023-12-11 09:12:05 +02:00
Elle Mouton	e12321cfd6	multi: extract and pass through IsZombieChannel method	2023-12-11 09:12:05 +02:00
Elle Mouton	67df687f08	discovery: start sending timestamps if requested	2023-12-11 09:12:05 +02:00
Elle Mouton	0ad4ef373a	channeldb+discovery: fetch timestamps from DB if required	2023-12-11 09:12:05 +02:00
Elle Mouton	a439cc970f	multi: start asking for update timestamps in QueryChannelRange This commit also adds a new `protocol.no-timestamp-query-option` option to disable the new behaviour.	2023-12-11 09:12:04 +02:00
Elle Mouton	c882223ead	lnwire+discovery: rename ShortChannelIDEncoding to QueryEncoding Since the the encoding can be used for multiple different fields, we rename it here to be more generic.	2023-12-11 09:12:04 +02:00
yyforyongyu	f22b25a1bf	discovery+server: add more trace log	2023-03-08 21:02:53 +08:00
yyforyongyu	716c685f10	discovery+peer: add logs to reveal shutdown flow Also adds a `TODO` for checking the err chan.	2022-12-08 17:57:01 +08:00
Oliver Gugger	895a2e497b	multi: formatting and comment fixes	2022-02-10 11:02:02 +01:00
Olaoluwa Osuntokun	dd6f0ba931	discovery+lnwire: remove embedding within ReplyChannelRange In order to prep for allowing TLV extensions for the `ReplyChannelRange` and `QueryChannelRange` messages, we'll need to remove the struct embedding as is. If we don't remove this, then we'll attempt to decode TLV extensions from both the embedded and outer struct. All relevant call sites have been updated to reflect this minor change.	2021-02-24 17:31:55 +01:00
Conner Fromknecht	e42301dee2	lntest: call markGraphSynced from gossipSyncer Rather than performing this call in the SyncManager, we give each gossipSyncer the ability to mark the first sync completed. This permits pinned syncers to contribute towards the rpc-level synced_to_graph value, allowing the value to be true after the first pinned syncer or regular syncer complets. Unlinke regular syncers, pinned syncers can proceed in parallel possibly decreasing the waiting time if consumers rely on this field before proceeding to load their application.	2021-01-29 00:19:48 -08:00
Conner Fromknecht	340414356d	discovery: perform initial historical sync for pinned peers	2021-01-29 00:19:47 -08:00
Conner Fromknecht	2f0d56d539	discovery: add support for PinnedSyncers A pinned syncer is an ActiveSyncer that is configured to always remain active for the lifetime of the connection. Pinned syncers do not count towards the total NumActiveSyncer count, which are rotated periodically. This features allows nodes to more tightly synchronize their routing tables by ensuring they are always receiving gossip from distinguished subset of peers.	2021-01-29 00:19:47 -08:00
Wilmer Paulino	a4f33ae63c	discovery: adhere to proper channel chunk splitting for ReplyChannelRange	2020-12-08 15:18:07 -08:00
Wilmer Paulino	c5fc7334a4	discovery: limit NumBlocks to best known height for outgoing QueryChannelRange This is done to ensure we don't receive replies for channels in blocks not currently known to us, which we wouldn't be able to process.	2020-12-08 15:18:06 -08:00
Brian Mancini	28931390ff	discovery: prevent endBlock overflow in replyChanRangeQuery Modifies syncer.replyChanRangeQuery method to use the LastBlockHeight method on the query. LastBlockHeight safely calculates the ending block height and prevents an overflow of start_block + num_blocks. Prior to this change, query messages that had a start_block + num_blocks that overflows uint32_max would return zero results in the reply message. Tests are added to fix the bug and ensure proper start and end values are supplied to the channel graph filter.	2020-06-18 16:48:09 -04:00
Wilmer Paulino	57b69e3b1a	discovery: check ChainHash in QueryChannelRange messages If the provided ChainHash in a QueryChannelRange message does not match that of our current chain, then we should send a blank response, rather than reply with channels for the wrong chain.	2020-01-17 11:51:09 -08:00
Wilmer Paulino	1bacdfb41e	discovery: interpret block range from ReplyChannelRange messages We move from our legacy way of interpreting ReplyChannelRange messages which was incorrect. Previously, we'd rely on the Complete field of the ReplyChannelRange message to determine when our peer had sent all of their replies. Now, we properly adhere to the specification by interpreting the block ranges of these messages as intended. Due to the large number of nodes deployed with the previous method, we still maintain and detect when we are communicating with them, such that we are still able to sync with them for backwards compatibility.	2020-01-06 14:03:13 -08:00
Wilmer Paulino	c7c0853531	discovery: cover requested range in ReplyChannelRange messages In order to properly adhere to the spec, when handling a QueryChannelRange message, we must reply with a series of ReplyChannelRange messages, that when consumed together cover the entirety of the block range requested.	2020-01-06 14:00:15 -08:00
Wilmer Paulino	1f781ea431	discovery: use inclusive range in FilterChannelRange FilterChannelRange takes an inclusive range, so it was possible for us to return channels for an additional block that was not requested.	2020-01-06 14:00:14 -08:00
Johan T. Halseth	c04ef68cc3	Merge pull request #3826 from arik-so/wrong_chain_error_fix fix order in wrong chain error message	2019-12-12 09:46:31 +01:00

1 2

83 Commits