Avoid parsing `PublicKey`s when handling RGS updates #3581

TheBlueMatt · 2025-01-31T17:37:55Z

The first commit fixes a regression in 0.1 and should be backported, the second is just an optimization but shouldn't be backported as it changes the public API.

TheBlueMatt · 2025-01-31T17:56:37Z

Oh, new commit improves the bench some 25% or so, too.

tnull

LGTM, could just use some more verbose comments/log messages.

lightning/src/routing/gossip.rs

tnull · 2025-01-31T20:09:03Z

That said, CI is said as the PublicKey to NodeId broke something.

tnull · 2025-02-01T09:26:55Z

Feel free to squash.

shaavan

Overall, LGTM!

Since there aren’t any direct code references explaining why we switch to using NodeId in RGS, it might be helpful to add a quick note in the commit message (of ddc3326) on why skipping PublicKey verification is safe in this case.
I think adding a little extra context could make it clearer for future readers!

`PublicKey` parsing is relatively expensive as we have to check if the point is actually on the curve. To avoid it, our `NetworkGraph` uses `NodeId`s which don't have the validity requirement. Sadly, we were always parsing the broadcasting node's `PublicKey` from the `node_id` in the network graph whenever we see an update for that channel, whether we have a corresponding signature or not. Here we fix this, only parsing the public key (and hashing the message) if we're going to check a signature.

`PublicKey` parsing is relatively expensive as we have to check if the point is actually on the curve. To avoid it, our `NetworkGraph` uses `NodeId`s which don't have the validity requirement. Here, we take advantage of that in RGS application to avoid parsing `PublicKey`s, improving performance.

When we build a new `NetworkGraph` from empty, we're generally doing an initial startup and will be syncing the graph very soon. Using an initially-empty `IndexedMap` for the `channels` and `nodes` results in quite some memory churn, with the initial RGS application benchmark showing 15% of its time in pagefault handling alone (i.e. allocating new memory from the OS, let alone the 23% of time in `memmove`). Further, when deserializing a `NetworkGraph`, we'd swapped the expected node and channel count constants, leaving the node map too small and causing map doubling as we read entries from disk. Finally, when deserializing, allocating only exactly the amount of map entries we need is likely to lead to at least one doubling, so we're better off just over-estimating the number of nodes and channels and allocating what we want. Here we just always allocate `channels` and `nodes` based on constants, leading to a 20%-ish speedup in the initial RGS application benchmark.

TheBlueMatt · 2025-02-02T20:02:10Z

Squashed with one extra commit at the top that just adds more detail to the NodeId docs to describe why it exists.

tnull · 2025-02-03T07:50:47Z

Mhh, seems fuzz is failing. Not sure if related, but it blocks merging.

TheBlueMatt · 2025-02-04T14:20:26Z

Hmm, its possible this caused performance of some fuzz target to regress, but I think more likely github just decided to run it on a slower machine or so, so gonna disable the fuzz-required and merge.

TheBlueMatt · 2025-02-21T22:50:57Z

Backported the first (fixing the perf regression) and third (more correctly reserving the graph structures for a large performance win) commits in #3613

TheBlueMatt added the backport 0.1 label Jan 31, 2025

arik-so previously approved these changes Jan 31, 2025

View reviewed changes

TheBlueMatt dismissed arik-so’s stale review via a5810bf January 31, 2025 17:46

TheBlueMatt force-pushed the 2025-01-rgs-speedups branch from a5810bf to 4df7223 Compare January 31, 2025 17:52

arik-so previously approved these changes Jan 31, 2025

View reviewed changes

tnull reviewed Jan 31, 2025

View reviewed changes

lightning/src/routing/gossip.rs Show resolved Hide resolved

lightning/src/routing/gossip.rs Outdated Show resolved Hide resolved

TheBlueMatt dismissed arik-so’s stale review via 31390b2 January 31, 2025 20:32

TheBlueMatt force-pushed the 2025-01-rgs-speedups branch from 4df7223 to 31390b2 Compare January 31, 2025 20:32

arik-so previously approved these changes Jan 31, 2025

View reviewed changes

shaavan reviewed Feb 1, 2025

View reviewed changes

TheBlueMatt added 4 commits February 2, 2025 19:43

Better describe the NodeId type and why it exists

0ff7fba

TheBlueMatt dismissed arik-so’s stale review via 0ff7fba February 2, 2025 20:01

TheBlueMatt force-pushed the 2025-01-rgs-speedups branch from 31390b2 to 0ff7fba Compare February 2, 2025 20:01

arik-so approved these changes Feb 3, 2025

View reviewed changes

tnull approved these changes Feb 3, 2025

View reviewed changes

shaavan approved these changes Feb 3, 2025

View reviewed changes

TheBlueMatt merged commit be93dc3 into lightningdevkit:main Feb 4, 2025
23 of 25 checks passed

TheBlueMatt removed the backport 0.1 label Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid parsing `PublicKey`s when handling RGS updates #3581

Avoid parsing `PublicKey`s when handling RGS updates #3581

TheBlueMatt commented Jan 31, 2025

TheBlueMatt commented Jan 31, 2025

tnull left a comment

tnull commented Jan 31, 2025

tnull commented Feb 1, 2025

shaavan left a comment

TheBlueMatt commented Feb 2, 2025

tnull commented Feb 3, 2025

TheBlueMatt commented Feb 4, 2025

TheBlueMatt commented Feb 21, 2025

Avoid parsing PublicKeys when handling RGS updates #3581

Avoid parsing PublicKeys when handling RGS updates #3581

Conversation

TheBlueMatt commented Jan 31, 2025

TheBlueMatt commented Jan 31, 2025

tnull left a comment

Choose a reason for hiding this comment

tnull commented Jan 31, 2025

tnull commented Feb 1, 2025

shaavan left a comment

Choose a reason for hiding this comment

TheBlueMatt commented Feb 2, 2025

tnull commented Feb 3, 2025

TheBlueMatt commented Feb 4, 2025

TheBlueMatt commented Feb 21, 2025

Avoid parsing `PublicKey`s when handling RGS updates #3581

Avoid parsing `PublicKey`s when handling RGS updates #3581