-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Download data #72
Download data #72
Conversation
these are intended to be run as pipenv scripts: pipenv run download_data pipenv run delete_data they can also be run from file
Updated Pipfile to specify versions. Updated Scripts definitions. Moved scripts to process_data repository. Edited header comment in download_data.
Hullo, James. Just checking in. This branch looks like it's nearly ready for a merge. Do my remarks mostly make sense? Is there any trouble with |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
All looks good, but make sure to merge changes from main and delete your other PR.
Corrected nits and did some clean-up. Simplified the download_data function with pathlib.
generate new pipfile
Manually combined pipfile and regenerated pipfile.lock, reverted requests version to try and avoid problems
tried to fix errors in pipfile
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Heya, just a couple extra comments to help get this finished! Basically it seems like just five(ish) changes (take as many commits as you need):
- Modify the download_data operations to extract the tar.gz download to the same folder, delete the original file, then extract training_set.tar into training_set/.
- Simplify clean_up_data (honestly, probably could just remove it and use just rm).
- Reorder pipfile alphabetically
- Delete pipfile.lock and run pipenv install again to regenerate it.
- Pull from main and resolve any merge conflicts.
I've fixed the merge conflict with the following commands:
The Let's merge this now and work on follow ups in #85. I've copied any remain comments into that issue and resolved them. |
Converted my fork into a branch called Download-Data