lightning

Commit Graph

Author	SHA1	Message	Date
Rusty Russell	175db926c2	chaintopology: expose when we don't actually know feerate. We use feerate in several places, and each one really should react differently when it's not available (such as when bitcoind is still catching up): 1. For general fee-enforcement, we use the broadest possible limits. 2. For closingd, we use it as our opening negotiation point: just use half the last tx feerate. 3. For onchaind, we can use the last tx feerate as a guide for our own txs; it might be too high, but at least we know it was sufficient to be mined. 4. For withdraw and fund_channel, we can simply refuse. Fixes: #1836 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	d93be58bd0	pytest: remove use dev-override-feerates. Manipulate fees via fake-bitcoin-cli. It's not quite the same, as these are pre-smoothing, so we need a restart to override that where we really need an exact change. Or we can wait until it reaches a certain value in cases we don't care about exact amounts. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	607d4bf9d2	channel: update fees after lockin. We don't respond to fee changes until we're locked in: make sure we catch up at that point. Note that we use NORMAL fees during opening, but IMMEDIATE after, so this often sends a fee update. The tests which break, we set those feerates to be equal. This (sometimes) changes the behavior of test_permfail, as we now get an immediate commit, so that is fixed too so we always wait for that to complete. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	6338ae8a44	channeld: update fees if we're restarting. This is a noop if we're opening a new channel (channel_fees_can_change(channel) is false until funding locked in), but important if we're restarting. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	ede05ecd40	pytest: demonstrate feerate on restart bug. We don't update a channel's feerate on reestablishment: we insert a restart in test_onchain_different_fees() (which we'll need soon anyway) to show it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	36b1cac6e6	lightningd: new state AWAITING_UNILATERAL. When in this state, we send a canned error "Awaiting unilateral close". We enter this both when we drop to chain, and when we're trying to get them to drop to chain due to option_data_loss_protect. As this state (unlike channel errors) is saved to the database, it means we will never talk to a peer again in this state, so they can't confuse us. Since we set this state in channel_fail_permanent() (which is the only place we call drop_to_chain for a unilateral close), we don't need to save to the db: channel_set_state() does that for us. This state change has a subtle effect: we return WIRE_UNKNOWN_NEXT_PEER instead of WIRE_TEMPORARY_CHANNEL_FAILURE as soon as we get a failure with a peer. To provoke a temporary failure in test_pay_disconnect we take the node offline. Reported-by: Christian Decker @cdecker Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	a5ecc95c42	db: store claimed per_commitment_point from option_data_loss_protect. This means we don't try to unilaterally close after a restart, and we can tell onchaind to try to use the point to recover funds when the peer unilaterally closes. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	1a4084442b	onchaind: use a point-of-last-resort if we see an unknown transaction. This may have been supplied by the peer if it's nice and supports option_data_loss_protect. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	6aed936799	channeld: check option_data_loss_protect fields. Firstly, if they claim to know a future value, we ask the HSM; if they're right, we tell master what the per-commitment-secret it gave us (we have no way to validate this, though) and it will not broadcast a unilateral (knowing it will cause them to use a penalty tx!). Otherwise, we check the results they sent were valid. The spec says to do this (and close the channel if it's wrong!), because otherwise they could continually lie and give us a bad per-commitment-secret when we actually need it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	da8d620907	pytest: check that we advertise and send option_data_loss_protect. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	4c891f4661	common: log when we toggle IO logging, don't edit env in tests! Tests were failing when in the same thread after a test which set log_all_io=True, because SIGUSR1 seemed to be turning logging off. This is due to Python using references not copies for assignment. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	9f044305db	pytest: dev env var LIGHTNINGD_DEV_LOG_IO turns io logging on immediately. This is required for the next test, which has to log messages from channeld as soon as it starts (so might be too late if it sends SIGUSR1). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	ebaf5eaf2e	channeld: send option_data_loss_protect fields. We ignore incoming for now, but this means we advertize the option and we send the required fields. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	162adfdf12	listpeers: correctly display features on reconnect. peer features are only kept for connected peers (as they can change), but we didn't update them on reconnect. The main effect was that after a restart we displayed the features as empty, even after reconnect. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	f5143d9549	pytest: fail if we see 'bad reestablish' in the logs. Really, we should use log-level more cleverly here. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Christian Decker	8f56d64a1f	log: Append the current time to the crash log filename This should make it easier to identify the latest crash file and correlate crashes with external monitoring tools.	6 years ago
Rusty Russell	00696277d2	logging: always dump a crash log, but make files per-pid. Someone had a 21GB crash.log, which doesn't help anyone! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	213be90e77	log: implement reopening log-file on SIGHUP Closes: #1623 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	9f175deecd	lightningd: update feerate upon receiving revoke_and_ack from fundee. 1. l1 update_fee -> l2 2. l1 commitment_signed -> l2 (using new feerate) 3. l1 <- revoke_and_ack l2 4. l1 <- commitment_signed l2 (using new feerate) 5. l1 -> revoke_and_ack l2 When we break the connection after #3, the reconnection causes #4 to be retransmitted, but it turns out l1 wasn't telling the master to set the local feerate until it received the commitment_signed, so on reconnect it uses the old feerate, with predictable results (bad signature). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	c106fa1b4f	pytest: add test for reconnect immediately after feerate change. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	cda6c97a57	pytest: allow NodeFactory to create non-started peers. Useful it we want to intercept bitcoin-cli first. We move the getinfo() caching into start(), as that's when we can actually use RPC. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	7eed9cba90	pytest: always override estimatesmartfee. It (almost?) always fails for regtest; best to override it directly. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	7a77b81dff	pytest: always use the fake-bitcoin-cli, name it more appropriately. We're going to use it to override specific commands. It's non-valgrinded already since we use '--trace-children-skip=bitcoin-cli' so the overhead should be minimal. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	5e20eedb41	pytest: allow more elaborate bitcoin-cli override. In particular, this lets us intercept individual commands, such as estimatesmartfee. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	bc7ca34a38	pytest: fix race in test_closing_different_fees We got an index error, because status had only one field (onchaind not started yet). > wait_for(lambda: only_one(p.rpc.listpeers(l1.info['id'])['peers'][0]['channels'])['status'][1] == 'ONCHAIN:Tracking mutual close transaction') E IndexError: list index out of range1 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	f8052a6c1a	chaintopology: watch UTXOs which need closeinfo when we remove blocks. Normal wallet txs get reconfirmed as blocks come in, but ones which need closeinfo are more fragile, so we do it manually using txwatch for them. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	d280eabc1e	pytest: add a resart to test_permfail that we don't forget on restart. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	b379bec4e4	pytest: fix flakiness in test_channel_reenable. In one case, the channel_update which we expected to activate the channel from l2 was suppressed as redundant. This is certainly valid, so just check the results. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	3454ca57a2	pytest: remove DEVELOPER guard for test_ping Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Christian Decker	a97955845f	pytest: dev-ping was renamed to ping	6 years ago
Rusty Russell	35d7449259	connectd: initialize peer->conn. It's only used in one place, but that's enough. Fixes: #1434 Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	f8aed1b4b0	pytest: add reconnection stress test. It sometimes triggers a crash like #1434 (though never under valgrind). Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	fefb7faba7	pytest: try a simple reconnection test. This passes, but that's OK. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	9d8b3a070b	pytest: make test_htlc_send_timeout more reliable. Waiting for three node_announcments isn't always enough, since l2 can publish two of them (an independent bug). Do the more Right Thing and just wait for 30 seconds of no input... Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	65c882ca3a	Minor cleanups. 1. connect convenience variable for improved readabilty. 2. a comment explaining that timer is on channel, not HTLC. 3. use modern python style in test_htlc_send_timeout Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	63e4ea17af	channeld: don't commit until we've seen recent incoming msg, ping if required. Now sending a ping makes sense: it should force the other end to send a reply, unblocking the commitment process. Note that rather than waiting for a reply, we're actually spinning on a 100ms loop in this case. But it's simple and it works. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	ebd229fb37	pytest: add test for HTLC timeout when a node is unreachable (xfail). Currently, if we don't realize a TCP connection is down, we almost certainly don't find out until after we're sent the commitment_signed message, in which case we cannot fail the incoming HTLC. This test demonstrates that. Note the 30 second sleep: we should really run Travis tests in parallel! Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	71575b2115	ping: no longer a dev_ command. Fixes: #1407 Suggested-by: conanoc@gmail.com Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	4f1186c4b1	connectd: iterate through all known addresses for a peer, not just one. If we have an address hint, we start with that, but we'll use node_announcement information if required. Note: we (ab)use the address hint when restoring from the database or reconnecting, even if the connection was incoming. That meant that the recipient of a connection would never manage to connect out. We still don't take multiple addresses from the DNS seeds: I assume we should, since there could be IPv4 and IPv6. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	8939a5001b	connectd: rely on the master to tell us to reconnect. connectd tells master about every disconnection, and master knows whether it's important to reconnect. Just get the master to invoke a new connect command if it considers the peer important! The only twist is timeouts: we don't want to immediately reconnect if we've failed to connect. To solve this, connectd passes a 'delaytime' to the master when a connection fails, and the master passes it back when it asks for a connection. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	035362e151	openingd: don't exit when we receive an error. In particular, all opening_read_peer_msg() callers need to know there was an error (presumably, negotiating) so they can stop, but we should not exit. This lets us reenable the final disabled test. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	02966a4857	connectd: remove unused handback APIs and code. We now simply maintain a pubkey set for connected peers (we only care if there's a reconnect), not the entire peer structure. lightningd no longer queries us for getpeers: it knows more than we do already. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	e59cbb3e2c	pytest: make sure receiving peer's openingd is ready. There's now a potential race: the source peer connect returns, but in destination peer the master hasn't read the connect message from connectd, so the peer isn't in listpeers yet. (Previously the connection stayed in connectd, so there was no such window). This is an occasional issue in a few places. Note that we take the opportunity to speed up test_disconnectpeer too while we're there. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	50f5eb34b4	openingd: take peer before we're opening, wait for explicit funding msg. Prior to this, lightningd would hand uninteresting peers back to connectd, which would then return it to lightningd if it sent a non-gossip msg, or if lightningd asked it to release the peer. Now connectd hands the peer to lightningd once we've done the init handshake, which hands it off to openingd. This is a deep structural change, so we do the minimum here and cleanup in the following patches. Lightningd: 1. Remove peer_nongossip handling from connect_control and peer_control. 2. Remove list of outstanding fundchannel command; it was only needed to find the race between us asking connectd to release the peer and it reconnecting. 3. We can no longer tell if the remote end has started trying to fund a channel (until it has succeeded): it's very transitory anyway so not worth fixing. 4. We now always have a struct peer, and allocate an uncommitted_channel for it, though it may never be used if neither end funds a channel. 5. We start funding on messages for openingd: we can get a funder_reply or a fundee, or an error in response to our request to fund a channel. so we handle all of them. 6. A new peer_start_openingd() is called after connectd hands us a peer. 7. json_fund_channel just looks through local peers; there are none hidden in connectd any more. 8. We sometimes start a new openingd just to send an error message. Openingd: 1. We always have information we need to accept them funding a channel (in the init message). 2. We have to listen for three fds: peer, gossip and master, so we opencode the poll. 3. We have an explicit message to start trying to fund a channel. 4. We can be told to send a message in our init message. Testing: 1. We don't handle some things gracefully yet, so two tests are disabled. 2. 'hand_back_peer .*: now local again' from connectd is no longer a message, openingd says 'Handed peer, entering loop' once its managing it. 3. peer['state'] used to be set to 'GOSSIPING' (otherwise this field doesn't exist; 'state' is now per-channel. It doesn't exist at all now. 4. Some tests now need to turn on IO logging in openingd, not connectd. 5. There's a gap between connecting on one node and having connectd on the peer hand over the connection to openingd. Our tests sometimes checked getpeers() on the peer, and didn't see anything, so line_graph needed updating. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	5135d3ad7d	pytest: make sure we truncate disconnect file for blackhole processes. In particular, I found lightning_openingd processes after running tests. When we use the dev_disconnect blackhole '0' option, they stick around until the dev_disconnect file is truncated (there is only so much you can do with only a file descriptor), so let's do that. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	8668b0028e	pytest: make sure both sides of channel are ready before returning from fund_channel The following changes revealed this race, where expecting listchannels() to contain two channels immediately after fund_channel() was racy. We also derive the short_channel_id first, so we can search logs for the exact messages. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	329270525c	pytest: only use dev-allow-localhost when needed. The next patches get better at reconecting, so if we use dev-allow-localhost nodes can often find each other and reconnect before shutting down; only use that option where we actually need it. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	fedcfd661f	pytest: hand 'True' to decoderawtransaction so it doesn't get confused. This explains the very-very occasional issue we had parsing (hence the random-looking fixes!). The decoderawtransaction heuristic sometimes thinks it's a zero-input tx, not a segwit tx. Setting 'iswitness' to true makes it reliable. Here's the example I finally caught: ``` rusty$ bitcoin-cli decoderawtransaction 0200000000010180f80017ceb208d84cd5be0d4e21c1acb91798c55ada6541f7633a2739453b4e0100000000ffffffff0269300f0000000000160014774b1c651a1b409213057783547e2bd37a71731240420f00000000002200205f743123f9584a76058bac1142ec2bc6c60b4b2af1d3145e74418d41ae51009e02483045022100ba65a905cf4ebbb9728dc682fcf17cb73ade0ca224729a1878f689a8afa9a49e02206f323f224c5171d170aafb8ff57e2761411a27dea304ac8f9a3663c456d21f3e012102225ce166e84b3833d9f620863b4e713099de616f559e8768f44ff674054bb07d00000000 { "txid": "3b8b78c18d30036f93b10a67eb8731325927fb046be969d24075e5b2e1e66e07", "hash": "3b8b78c18d30036f93b10a67eb8731325927fb046be969d24075e5b2e1e66e07", "version": 2, "size": 235, "vsize": 235, "locktime": 0, "vin": [ ], "vout": [ { "value": 6267898963.53775617, "n": 0, "scriptPubKey": { "asm": "be0d4e21c1acb91798c55ada6541f7633a2739453b4e0100000000ffffffff0269300f0000000000160014774b1c651a1b409213057783547e2bd37a71731240420f00000000002200205f743123f9584a76058bac1142ec2bc6c60b4b2af1d3145e74418d41ae51009e02483045022100ba65a905cf4ebbb9728dc682fcf17cb73ade0ca224729a1878f689a8afa9a49e02206f323f224c5171d170aafb8ff57e2761411a27dea304ac8f9a3663c456d21f3e012102225ce166e84b3833d9f620863b4e713099de616f559e8768f44ff674054bb0 OP_TUCK", "hex": "4cd5be0d4e21c1acb91798c55ada6541f7633a2739453b4e0100000000ffffffff0269300f0000000000160014774b1c651a1b409213057783547e2bd37a71731240420f00000000002200205f743123f9584a76058bac1142ec2bc6c60b4b2af1d3145e74418d41ae51009e02483045022100ba65a905cf4ebbb9728dc682fcf17cb73ade0ca224729a1878f689a8afa9a49e02206f323f224c5171d170aafb8ff57e2761411a27dea304ac8f9a3663c456d21f3e012102225ce166e84b3833d9f620863b4e713099de616f559e8768f44ff674054bb07d", "type": "nonstandard" } } ] } rusty$ bitcoin-cli decoderawtransaction 0200000000010180f80017ceb208d84cd5be0d4e21c1acb91798c55ada6541f7633a2739453b4e0100000000ffffffff0269300f0000000000160014774b1c651a1b409213057783547e2bd37a71731240420f00000000002200205f743123f9584a76058bac1142ec2bc6c60b4b2af1d3145e74418d41ae51009e02483045022100ba65a905cf4ebbb9728dc682fcf17cb73ade0ca224729a1878f689a8afa9a49e02206f323f224c5171d170aafb8ff57e2761411a27dea304ac8f9a3663c456d21f3e012102225ce166e84b3833d9f620863b4e713099de616f559e8768f44ff674054bb07d00000000 true { "txid": "d1f0e478ada951d4ee2d952a526a90cda181da6226980d69b345f644ed57a05d", "hash": "3b8b78c18d30036f93b10a67eb8731325927fb046be969d24075e5b2e1e66e07", "version": 2, "size": 235, "vsize": 153, "locktime": 0, "vin": [ { "txid": "4e3b4539273a63f74165da5ac59817b9acc1214e0dbed54cd808b2ce1700f880", "vout": 1, "scriptSig": { "asm": "", "hex": "" }, "txinwitness": [ "3045022100ba65a905cf4ebbb9728dc682fcf17cb73ade0ca224729a1878f689a8afa9a49e02206f323f224c5171d170aafb8ff57e2761411a27dea304ac8f9a3663c456d21f3e01", "02225ce166e84b3833d9f620863b4e713099de616f559e8768f44ff674054bb07d" ], "sequence": 4294967295 } ], "vout": [ { "value": 0.00995433, "n": 0, "scriptPubKey": { "asm": "0 774b1c651a1b409213057783547e2bd37a717312", "hex": "0014774b1c651a1b409213057783547e2bd37a717312", "reqSigs": 1, "type": "witness_v0_keyhash", "addresses": [ "bc1qwa93ceg6rdqfyyc9w7p4gl3t6da8zucjnugke0" ] } }, { "value": 0.01000000, "n": 1, "scriptPubKey": { "asm": "0 5f743123f9584a76058bac1142ec2bc6c60b4b2af1d3145e74418d41ae51009e", "hex": "00205f743123f9584a76058bac1142ec2bc6c60b4b2af1d3145e74418d41ae51009e", "reqSigs": 1, "type": "witness_v0_scripthash", "addresses": [ "bc1qta6rzgletp98vpvt4sg59mptcmrqkje278f3ghn5gxx5rtj3qz0qgkydgs" ] } } ] } Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	58d090c3c2	pytest: fix flaky test. Saw this in Travis: technically we return from the dev_set_max_scids... cmd after sending it to gossipd, but we should wait for it to log. Adding an internal reply message for a dev command seems overkill. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago
Rusty Russell	d8a6028214	connectd: fix binding to same port on IPv4 and IPv6. 1. If the IPv6 address was public, that changed the wireaddr and thus the ipv4 bind would not be to a wildcard and would fail. 2. Binding two fds to the same port on both wildcard IPv4 and IPv6 succeeds; we only fail when we try to listen, so allow error at this point. For some reason this triggered on my digital ocean machine. Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>	6 years ago

1 2 3 4 5 ...

666 Commits (6da379631f24ae57eb317ac869973287700a72b2)