Fix: Network hiccups using Retry logic #306

rd4398 · 2024-08-07T18:33:06Z

Fixes: #222
As per discussion here: #228

dhellmann · 2024-08-07T18:54:28Z

src/fromager/sdist.py

@@ -311,7 +311,7 @@ def download_wheel(
    wheel_filename = output_directory / os.path.basename(urlparse(wheel_url).path)
    if not wheel_filename.exists():
        logger.info(f"{req.name}: downloading pre-built wheel {wheel_url}")
-        wheel_filename = _download_wheel_check(output_directory, wheel_url)
+        wheel_filename = _download_wheel_check(output_directory, wheel_url, ctx)


We have the context as the first argument to functions everywhere else, could you follow that pattern here, too?

okay, done!

dhellmann · 2024-08-07T18:55:03Z

src/fromager/sources.py

@@ -165,7 +164,7 @@ def default_download_source(
    )

    source_filename = _download_source_check(
-        ctx.sdists_downloads, url, destination_filename
+        ctx.sdists_downloads, url, ctx, destination_filename


Same comment about argument ordering.

okay, done!

dhellmann · 2024-08-07T18:57:26Z

src/fromager/sources.py

@@ -213,7 +218,7 @@ def download_url(
        return outfile
    # Open the URL first in case that fails, so we don't end up with an empty file.
    logger.debug(f"reading from {url}")
-    with requests.get(url, stream=True) as r:
+    with ctx.requests.get(url, stream=True) as r:


How does retrying work with streaming? If it starts over, the caller needs to understand that so it doesn't just write duplicate copies of the data to the same file, right?

Yeah, need to look at this

Found this: https://stackoverflow.com/questions/25860105/python-requests-package-lost-connection-while-streaming but not sure whether it will help our cause

https://gist.github.com/tobiasraabe/58adee67de619ce621464c1a6511d7d9 looks potentially useful. The Range HTTP header tells the server where to start sending data, based on how much has already been downloaded. https://medium.com/@lope.ai/downloading-files-over-http-with-python-requests-e12e6b795e43 shows another example of tracking that.

Agreed
I have tried to use Retry header in the latest version of this PR

dhellmann · 2024-08-09T15:27:18Z

src/fromager/sources.py

    # Open the URL first in case that fails, so we don't end up with an empty file.
    logger.debug(f"reading from {url}")
-    with requests.get(url, stream=True) as r:
+    with ctx.requests.get(url, stream=True, headers=headers) as r:


I think we're going to need a loop here to handle the retry.

Let's set this aside for now and come back when we have more time to work on it together.

shubhbapna · 2024-08-14T17:46:32Z

Converting to draft based on the last review comment

rd4398 requested a review from dhellmann August 7, 2024 18:33

mergify bot added the ci label Aug 7, 2024

dhellmann reviewed Aug 7, 2024

View reviewed changes

rd4398 temporarily deployed to release August 7, 2024 20:02 — with GitHub Actions Inactive

rd4398 force-pushed the network-hiccups branch from c3e7436 to cab1ab6 Compare August 7, 2024 20:11

rd4398 temporarily deployed to release August 7, 2024 20:11 — with GitHub Actions Inactive

rd4398 temporarily deployed to release August 8, 2024 15:02 — with GitHub Actions Inactive

Fix: Network hiccups using Retry logic

0d99308

rd4398 force-pushed the network-hiccups branch from 955db02 to 0d99308 Compare August 8, 2024 15:03

rd4398 temporarily deployed to release August 8, 2024 15:03 — with GitHub Actions Inactive

dhellmann reviewed Aug 9, 2024

View reviewed changes

shubhbapna marked this pull request as draft August 14, 2024 17:46

mergify bot mentioned this pull request Sep 6, 2024

Fix: Network hiccups using Retry logic #397

Draft

shubhbapna closed this Oct 11, 2024

shubhbapna deleted the network-hiccups branch October 11, 2024 14:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Network hiccups using Retry logic #306

Fix: Network hiccups using Retry logic #306

rd4398 commented Aug 7, 2024

dhellmann Aug 7, 2024

rd4398 Aug 7, 2024

dhellmann Aug 7, 2024

rd4398 Aug 7, 2024

dhellmann Aug 7, 2024

rd4398 Aug 7, 2024

rd4398 Aug 7, 2024

dhellmann Aug 7, 2024

rd4398 Aug 8, 2024

dhellmann Aug 9, 2024

shubhbapna commented Aug 14, 2024

Fix: Network hiccups using Retry logic #306

Fix: Network hiccups using Retry logic #306

Conversation

rd4398 commented Aug 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shubhbapna commented Aug 14, 2024