Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem Downloading the data #22

Open
marcomiglionico94 opened this issue Mar 11, 2022 · 5 comments
Open

Problem Downloading the data #22

marcomiglionico94 opened this issue Mar 11, 2022 · 5 comments

Comments

@marcomiglionico94
Copy link

marcomiglionico94 commented Mar 11, 2022

I followed the instructions on the README but when i run the script to download the file, it find the first path dataset%2Fpublic%2Fzip%2F0Kajc_nnyZ6K0cRGCQJW56.zip but then give me this error:

urllib.error.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:852)> unzip: cannot find or open zips/*.zip, zips/*.zip.zip or zips/*.zip.ZIP. No zipfiles found.

@test77123
Copy link

Hello @marcomiglionico94,
I am using MAC OS, so I solved this problem by

  1. browsing to Applications/Python[your python version]
  2. running Install Certificates.command

I hope this will be helpful for you.

@tejaswivg
Copy link

You can also use a browser plugin such as Downthemall to download all the data after you get the email with links.

@weiXiaxvv
Copy link

I save the email as an HTML file and run sh download.sh. But this is the result:
Traceback (most recent call last):
File "download_from_email.py", line 48, in
main(args)
File "download_from_email.py", line 32, in main
links = get_all_links(open(args.source))
File "download_from_email.py", line 10, in get_all_links
soup = BeautifulSoup(html, 'lxml')
File "/home/wangqiting/.local/lib/python3.7/site-packages/bs4/init.py", line 251, in init
% ",".join(features))
bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?
unzip: cannot find or open zips/.zip, zips/.zip.zip or zips/*.zip.ZIP.

Can you help me?

@W-Q-T
Copy link

W-Q-T commented Aug 12, 2022

If cannot download through .html, you can directly download the model compressed package to /data/public_100/zips and run unzip_all.sh to decompress it.

@thedevup
Copy link

I save the email as an HTML file and run sh download.sh. But this is the result: Traceback (most recent call last): File "download_from_email.py", line 48, in main(args) File "download_from_email.py", line 32, in main links = get_all_links(open(args.source)) File "download_from_email.py", line 10, in get_all_links soup = BeautifulSoup(html, 'lxml') File "/home/wangqiting/.local/lib/python3.7/site-packages/bs4/init.py", line 251, in init % ",".join(features)) bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library? unzip: cannot find or open zips/.zip, zips/.zip.zip or zips/*.zip.ZIP.

Can you help me?

pip install lxml

After that download the email as an html file, name it 'email.html'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants