Calling DialPeer from DialPeer started goroutine deadlocks. #161

Jorropo · 2020-02-24T18:41:23Z

While implementing webrtc aside I found a deadlock (still finishing properly in a timeout), calling DialPeer on the same peers that was used to instanciate your goroutine (example: calling host.NewStream or host.Connect from the Dial function of a transport (only if peer ID is the same)).

It seems like the first Connect (starting the transport Dial) is the problem, the second one just wait on the channel as expected.
I think transport dial are somehow tested sequentially that mean for Connect to return to the transport would require Dial in the transport to return (here is the deadlocks).

Jorropo · 2020-02-24T19:43:15Z

Issue found, that was on the host, I was at one point beffore doing a Connect with a single address (the one of my transport) and so the host store that in the peerstore, and then future connect don't resolve the dht because routedhost only does if no address are avaible.

Should be solved with peerstore address origin.

Jorropo · 2020-03-01T14:13:57Z

Even with this fixed the bug is still here, while implementing #162 I've found the bug, if a dial already have been started newer address are not used to start a new dial. Gonna be fixed in #167.

Stebalien · 2020-03-02T00:11:59Z

There's no way to fix this issue inside the swarm itself. Transports need to avoid recursively trying to dial the same peer.

Stebalien · 2020-03-02T00:13:57Z

Nevermind. I'm not sure how to fix this in the swarm itself but we need to find some way to detect this.

Stebalien · 2020-03-02T00:28:04Z

I've re-reported this issue in libp2p/go-libp2p#816 to give a clear description of what's happening.

Jorropo closed this as completed Feb 24, 2020

Jorropo reopened this Mar 1, 2020

Stebalien closed this as completed Mar 2, 2020

Stebalien reopened this Mar 2, 2020

Stebalien mentioned this issue Mar 2, 2020

Recursive dialing can block libp2p/go-libp2p#816

Open

Stebalien closed this as completed Mar 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calling DialPeer from DialPeer started goroutine deadlocks. #161

Calling DialPeer from DialPeer started goroutine deadlocks. #161

Jorropo commented Feb 24, 2020 •

edited

Loading

Jorropo commented Feb 24, 2020

Jorropo commented Mar 1, 2020 •

edited

Loading

Stebalien commented Mar 2, 2020

Stebalien commented Mar 2, 2020

Stebalien commented Mar 2, 2020

Calling DialPeer from DialPeer started goroutine deadlocks. #161

Calling DialPeer from DialPeer started goroutine deadlocks. #161

Comments

Jorropo commented Feb 24, 2020 • edited Loading

Jorropo commented Feb 24, 2020

Jorropo commented Mar 1, 2020 • edited Loading

Stebalien commented Mar 2, 2020

Stebalien commented Mar 2, 2020

Stebalien commented Mar 2, 2020

Jorropo commented Feb 24, 2020 •

edited

Loading

Jorropo commented Mar 1, 2020 •

edited

Loading