lightning

Commit Graph

Author	SHA1	Message	Date
Rusty Russell	5292f11818	pytest: test (fail) that we don't repeat gossip back to the node we got it from Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	5 years ago
Christian Decker	8b8538024d	bitcoind: Defer initialization of filteredblock_call->result During sync it is highly likely that we can coalesce multiple calls and share results among them. We also report back failures for non-existing blocks early on, so we don't run into issues with blocks that our bitcoind doesn't have yet. Signed-off-by: Christian Decker <decker.christian@gmail.com>	5 years ago
Christian Decker	187e493ab8	gossip: Stop backfilling the future This was caused by us not checking against the max_blockheight, but rather the min_blockheight which can be negative with a newly created node. This is still safe since we check for duplicates anyway in `wallet_filteredblock_add`. Signed-off-by: Christian Decker <decker.christian@gmail.com>	5 years ago
Rusty Russell	944439853a	pytest: two tests for gossip of channels in as-yet-unknown blocks. Two tests which crash lightningd in different ways. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	5 years ago
Rusty Russell	bf3b77a947	Travis: skip testing VALGRIND=1 DEVELOPER=0, remove the slowest non-developer tests. I don't remember ever seeing a bug which only showed up in VALGRIND=1 with developer mode disabled, so don't test that, and spread out the other test more evenly. In addition, disable the worst-performing tests in DEVELOPER=0 mode. Here timings from my build machine: the worst 6 (- DEVELOPER=0 VALGRIND=0) with the same tests (+ DEVELOPER=1 VALGRIND=1) -452.42s call tests/test_pay.py::test_channel_spendable +87.69s call tests/test_pay.py::test_channel_spendable -335.66s call tests/test_gossip.py::test_gossip_store_compact_on_load +47.41s call tests/test_gossip.py::test_gossip_store_compact_on_load -332.07s call tests/test_connection.py::test_opening_tiny_channel +89.71s call tests/test_connection.py::test_opening_tiny_channel -331.97s call tests/test_pay.py::test_channel_spendable_large +56.23s call tests/test_pay.py::test_channel_spendable_large -305.28s call tests/test_invoices.py::test_invoice_routeboost +37.57s call tests/test_invoices.py::test_invoice_routeboost -284.28s call tests/test_plugin.py::test_htlc_accepted_hook_forward_restart +49.12s call tests/test_plugin.py::test_htlc_accepted_hook_forward_restart Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	5 years ago
ZmnSCPxj	3e74ca4b86	gossipd/routing.c: Correctly handle a duplicated entry in `exclude` of `getroute`.	5 years ago
ZmnSCPxj	a5fb37298c	tests/test_gossip.py: Add test to check that duplicated exclusions in `getroute` have no lasting effect.	5 years ago
Rusty Russell	54ce4ed1cf	pytest: fail tests if we get any LOG_BROKEN level messages, unless flagged. And clean up some dev ones which actually happen (mainly by calling channel_fail_permanent which logs UNUSUAL, rather than channel_internal_error which logs BROKEN). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	c303d7d534	gossipd: only do (automatic) store compaction at startup. Rewriting the gossip_store is much more trivial when we don't have any pointers into it, so add some simple offline compaction code and disable the automatic compaction code. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	c15d9ed37c	gossip_store: make copy of corrupt gossip_store on failure. This should help debugging vastly. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	8928f0b5f9	gossipd: remove gossip entirely if we hit a problem on load. The crashes in #2750 are mostly caused by us trying to partially truncate the store. The simplest fix for release is to discard the whole thing if we detect a problem. This is a workaround: it'd be far nicer to try to recover. Fixes: #2750 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	9bf0467967	pytest: fix test_gossip_store_load_no_channel_update It wasn't invalid due to a missing channel_update, but in fact was a bad checksum due to a cut & paste bug. Fix that, and assert it's not actually truncating. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	47b5f2e837	gossipd: truncate gossip_store.tmp for compaction. If something went wrong and there was an old one, we were appending to it! Reported-by: @SimonVrouwe Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	5e3690b3c5	gossipd: delete channel_amount from the store when we delete channel_announcement. Otherwise we slowly build up cruft: compaction simply moves them since they're not deleted. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	10c503b4b4	gossip_store: clean up a truncated store. We might have channel_announcements which have no channel_update: normally these don't get written into the store until there is one, but if the store was truncated it can happen. We then get upset on compaction, since we don't have an in-memory representation of the channel_announcement. Similarly, we leave the node_announcement pending until after that channel_announcement, leading to a similar case. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	adc52b6ee8	pytest: add test for dangling channel_announcement/node_announcement after gossip_store. This can happen if the store was truncated. Reported-by: @jb55 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	a35ab51a06	pytest: gossip_store test for channel_amount truncated. We pass, but this test should have been added a while ago with the original code. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	909f22f117	pytest: gossip_store test for node_announcement before update. We pass, but this test should have been added a while ago with the fix. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	eb5cc47bdd	gossipd: count deleted records correctly when loading gossip_store. The result of an incorrect count was that we failed on next compaction. Fixes: #2743 Fixes: #2742 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	12a523f7c5	pytest: add (xfail) test for store load miscount. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	0d2a4830ed	ccan: update to faster and correct crc32c implementation. I decided to try a faster implementation, only to find our crc32c was not correct! Ouch. I removed the crc32c functions from ccan/crc, and added a new crc32c module which has the Mark Adler x86-64-optimized variants. We bump gossip_store version again, since csums have changed. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	409368e058	pytest: move test_channel_drainage to test_pay.py This is where payment tests should go. Also mark it xfail for the moment, and remove developer-only tag (propagating gossip is only 60 seconds, which is OK). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Michael Schmoock	4a242edc1f	test: drains a channel to crash the daemon	6 years ago
Rusty Russell	db0a28501b	gossip: bump version to remove lingering issues with master. There were several gossip breakages in master; bumping version means upgrades get a clean store (not just those upgrading from stable version). Fixes: #2719 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Michael Schmoock	42d6bf564c	test: fix flaky test_gossip_notices_close with wait_for_mempool	6 years ago
Rusty Russell	5161b79bfc	gossipd/gossip_store: keep count of deleted entries, don't use bs->count. We didn't count some records before, so we could compare the two counters. This is much simpler, and avoids reliance on bs. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	728bb4e662	common/gossip_store: handle timestamp filtering. This means we intercept the peer's gossip_timestamp_filter request in the per-peer subdaemon itself. The rest of the semantics are fairly simple however. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	948490ec58	gossipd: add timestamp in gossip store header. (We don't increment the gossip_store version, since there are only a few commits since the last time we did this). This lets the reader simply filter messages; this is especially nice since the channel_announcement timestamp is derived, not in the actual message. This also creates a 'struct gossip_hdr' which makes the code a bit clearer. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	5591c0b5d8	gossipd: don't send gossip stream, let per-peer daemons read it themselves. Keeping the uintmap ordering all the broadcastable messages is expensive: 130MB for the million-channels project. But now we delete obsolete entries from the store, we can have the per-peer daemons simply read that sequentially and stream the gossip itself. This is the most primitive version, where all gossip is streamed; successive patches will bring back proper handling of timestamp filtering and initial_routing_sync. We add a gossip_state field to track what's happening with our gossip streaming: it's initialized in gossipd, and currently always set, but once we handle timestamps the per-peer daemon may do it when the first filter is sent. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	df00f20e4a	gossipd: erase old entries from the store, don't just append. We use the high bit of the length field: this way we can still check that the checksums are valid on deleted fields. Once this is done, serially reading the gossip_store file will result in a complete, ordered, minimal gossip broadcast. Also, the horrible corner case where we might try to delete things from the store during load time is completely gone: we only load non-deleted things. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	696dc6b597	gossipd: disable gossip_store upgrade. We're about to bump version again, and the code to upgrade it was quite hairy (and buggy!). It's not worthwhile for such a poorly-tested path: I will just add code to limit how much incoming gossip we get to avoid flooding when we upgrade, however. I also use a modern gossip_store version in our test_gossip_store_load test, instead of relying on the upgrade path. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	21fe518513	gossip_store: fix 'bad node_announcement' by allowing node_announcement on un-updated channel. When we first receive a channel_update, we write both the channel_announcement and that channel_update to the store: we need that first update so we can set the channel_announcement timestamp. However, the channel_update can be replaced later. This means we can have a channel_announcement, a node_update which relies on it, then the channel_update later. So move the "this applies to a pending announcement" check lower, where gossip_store can use it too. Has a nice side-effect of avoiding one lookup of the node id. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	048a650a6b	pytest: more comprehensive tests for test_gossip_store_compact. First, we should have a channel_update so we actually do some compaction! (Reported-by @SimonVrouwe). But we should also handle the cases where: 1. A channel_announcement is not directly followed by a channel_update (happens when the channel_update is replaced). 2. A node_announcement predates a channel_update for the peer (again, can happen once a channel_update is replaced). 3. A local/private channel_creation is not directly followed by an update. In addition, we might as well check that we can load such a store, before compaction. This checks the corner cases which occur in real gossip stores. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	1147e65602	pytest: make test_gossip_notices_close more reliable. It's possible that it hasn't got the node_announcement messages; it will still list the nodes, however (the channel_announcement tells it the nodes exist). Check for the alias field instead. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	6ee2cd8ce3	openingd: fix hangup when gossipd compacts. My raspberry pi node hung up on my other node: lightning_openingd-... chan #1: Got bad message from gossipd: 0db1 This is because we didn't handle that message in one path. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	7ede5aac31	gossip_store: change format so we store raw messages. Save some overhead, plus gets us ready for giving subdaemons direct store access. This is the first time we upgrade the gossip_store, rather than just discarding. The downside is that we need to add an extra message after each channel_announcement, containing the channel capacity. After: store_load_msec:28337-30288(28975+/-7.4e+02) vsz_kb:582304-582316(582306+/-4.8) store_rewrite_sec:11.240000-11.800000(11.55+/-0.21) listnodes_sec:1.800000-1.880000(1.84+/-0.028) listchannels_sec:22.690000-26.260000(23.878+/-1.3) routing_sec:2.280000-9.570000(6.842+/-2.8) peer_write_all_sec:48.160000-51.480000(49.608+/-1.1) Differences: -vsz_kb:582320 +vsz_kb:582316 -listnodes_sec:2.100000-2.170000(2.118+/-0.026) +listnodes_sec:1.800000-1.880000(1.84+/-0.028) -peer_write_all_sec:51.600000-52.550000(52.188+/-0.34) +peer_write_all_sec:48.160000-51.480000(49.608+/-1.1) Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	cccce75e56	patch refine-test_gossip_persistence.patch	6 years ago
Rusty Russell	ec50ec6a71	gossipd: make gossip loading stats accurate. They didn't count the header sizes when reporting bytes, which is misleading. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	2bd7df93c6	gossipd: preserve unannounced channels across store compaction. Otherwise we'd forget them on restart, again. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	3dd47950ad	pytest: test that gossipd remembers unannounced local channels across restarts Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Christian Decker	59fa47bf64	pytest: Mark the worst gossip offenders as developer-only tests Signed-off-by: Christian Decker <decker.christian@gmail.com>	6 years ago
Christian Decker	b7222531fe	pytest: Stabilize the test_pay_direct test It was waiting for a remote channel, but not for all the interesting channels we want to check. It can sometimes happen that further away channels are added before closer ones are added, depending on propagation path, flush timers and bitcoind poll timers. This now just checks for all channels, which also reduces the ambiguity of whether we selected a path solely because we were lacking alternatives. Signed-off-by: Christian Decker <decker.christian@gmail.com>	6 years ago
Rusty Russell	0ba547ee10	gossipd: handle overflowing query properly (avoid slow 100% CPU reports) Don't do this: (gdb) bt #0 0x00007f37ae667c40 in ?? () from /lib/x86_64-linux-gnu/libz.so.1 #1 0x00007f37ae668b38 in ?? () from /lib/x86_64-linux-gnu/libz.so.1 #2 0x00007f37ae669907 in deflate () from /lib/x86_64-linux-gnu/libz.so.1 #3 0x00007f37ae674c65 in compress2 () from /lib/x86_64-linux-gnu/libz.so.1 #4 0x000000000040cfe3 in zencode_scids (ctx=0xc1f118, scids=0x2599bc49 "\a\325{", len=176320) at gossipd/gossipd.c:218 #5 0x000000000040d0b3 in encode_short_channel_ids_end (encoded=0x7fff8f98d9f0, max_bytes=65490) at gossipd/gossipd.c:236 #6 0x000000000040dd28 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290511, number_of_blocks=8) at gossipd/gossipd.c:576 #7 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290511, number_of_blocks=16) at gossipd/gossipd.c:595 #8 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290495, number_of_blocks=32) at gossipd/gossipd.c:596 #9 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290495, number_of_blocks=64) at gossipd/gossipd.c:595 #10 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=128) at gossipd/gossipd.c:596 #11 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=256) at gossipd/gossipd.c:595 #12 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=512) at gossipd/gossipd.c:595 #13 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17290431, number_of_blocks=1024) at gossipd/gossipd.c:595 #14 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=2047) at gossipd/gossipd.c:596 #15 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=4095) at gossipd/gossipd.c:595 #16 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=8191) at gossipd/gossipd.c:595 #17 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=16382) at gossipd/gossipd.c:595 #18 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=32764) at gossipd/gossipd.c:595 #19 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=65528) at gossipd/gossipd.c:595 #20 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=131056) at gossipd/gossipd.c:595 #21 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=262112) at gossipd/gossipd.c:595 #22 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=524225) at gossipd/gossipd.c:595 #23 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=1048450) at gossipd/gossipd.c:595 #24 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=2096900) at gossipd/gossipd.c:595 #25 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=4193801) at gossipd/gossipd.c:595 #26 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=8387603) at gossipd/gossipd.c:595 #27 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=17289408, number_of_blocks=16775207) at gossipd/gossipd.c:595 #28 0x000000000040ddee in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=33550414) at gossipd/gossipd.c:596 #29 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=67100829) at gossipd/gossipd.c:595 #30 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=134201659) at gossipd/gossipd.c:595 #31 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=268403318) at gossipd/gossipd.c:595 #32 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=536806636) at gossipd/gossipd.c:595 #33 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=1073613273) at gossipd/gossipd.c:595 #34 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=2147226547) at gossipd/gossipd.c:595 #35 0x000000000040ddc6 in queue_channel_ranges (peer=0x3868fc8, first_blocknum=514201, number_of_blocks=4294453094) at gossipd/gossipd.c:595 #36 0x000000000040df26 in handle_query_channel_range (peer=0x3868fc8, msg=0x37e0678 "\001\ao\342\214\n\266\361\263r\301\246\242F\256c\367O\223\036\203e\341Z\b\234h\326\031") at gossipd/gossipd.c:625 The cause was that converting a block number to an scid truncates it at 24 bits. When we look through the index from (truncated number) to (real end number) we get every channel, which is too large to encode, so we iterate again. This fixes both that problem, and also the issue that we'd end up dividing into many empty sections until we get to the highest block number. Instead, we just tack the empty blocks on to then end of the final query. (My initial version requested 0xFFFFFFFE blocks, but the dev code which records what blocks were returned can't make a bitmap that big on 32 bit). Reported-by: George Vaccaro Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	ba41d6e3df	pytest: failing test for overflow in query_channel_range Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	52750f2dcc	pytest: tighten the query_channel_range test. Make the two channels adjacent, and specify exactly the number of divide-and-conquer steps there are. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	9f1f79587e	short_channel_id_dir: new primitive for one direction of short_channel_id Currently only used by gossipd for channel elimination. Also print them in canonical form (/[01]), so tests need to be changed. Suggested-by: @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	80753bfbd5	Feedback from @niftynei . Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	dc2ee9639b	listchannels: allow source arg to list channels by their source node. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	3d016e7249	getroute: allow array of channels to exclude. The pay plugin will use this, rather than the current "suppress for 90 second" hacks. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Christian Decker	659a26ea5a	misc: Update short_channel_id representation to use 'x' separators Reported-by: Alex Bosworth <@alexbosworth> Signed-off-by: Christian Decker <decker.christian@gmail.com>	6 years ago

1 2 3

131 Commits (5d987f2dec8d4d292021945a61c69c09264f031e)