Problem Downloading the data #22

marcomiglionico94 · 2022-03-11T21:31:16Z

I followed the instructions on the README but when i run the script to download the file, it find the first path dataset%2Fpublic%2Fzip%2F0Kajc_nnyZ6K0cRGCQJW56.zip but then give me this error:

urllib.error.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:852)> unzip: cannot find or open zips/*.zip, zips/*.zip.zip or zips/*.zip.ZIP. No zipfiles found.

The text was updated successfully, but these errors were encountered:

test77123 · 2022-05-11T02:28:43Z

Hello @marcomiglionico94,
I am using MAC OS, so I solved this problem by

browsing to Applications/Python[your python version]
running Install Certificates.command

I hope this will be helpful for you.

tejaswivg · 2022-05-14T15:18:00Z

You can also use a browser plugin such as Downthemall to download all the data after you get the email with links.

weiXiaxvv · 2022-08-10T12:26:51Z

I save the email as an HTML file and run sh download.sh. But this is the result:
Traceback (most recent call last):
File "download_from_email.py", line 48, in
main(args)
File "download_from_email.py", line 32, in main
links = get_all_links(open(args.source))
File "download_from_email.py", line 10, in get_all_links
soup = BeautifulSoup(html, 'lxml')
File "/home/wangqiting/.local/lib/python3.7/site-packages/bs4/init.py", line 251, in init
% ",".join(features))
bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?
unzip: cannot find or open zips/.zip, zips/.zip.zip or zips/*.zip.ZIP.

Can you help me?

W-Q-T · 2022-08-12T06:07:32Z

If cannot download through .html, you can directly download the model compressed package to /data/public_100/zips and run unzip_all.sh to decompress it.

thedevup · 2023-07-12T09:27:27Z

I save the email as an HTML file and run sh download.sh. But this is the result: Traceback (most recent call last): File "download_from_email.py", line 48, in main(args) File "download_from_email.py", line 32, in main links = get_all_links(open(args.source)) File "download_from_email.py", line 10, in get_all_links soup = BeautifulSoup(html, 'lxml') File "/home/wangqiting/.local/lib/python3.7/site-packages/bs4/init.py", line 251, in init % ",".join(features)) bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library? unzip: cannot find or open zips/.zip, zips/.zip.zip or zips/*.zip.ZIP.

Can you help me?

pip install lxml

After that download the email as an html file, name it 'email.html'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem Downloading the data #22

Problem Downloading the data #22

marcomiglionico94 commented Mar 11, 2022 •

edited

Loading

test77123 commented May 11, 2022

tejaswivg commented May 14, 2022

weiXiaxvv commented Aug 10, 2022

W-Q-T commented Aug 12, 2022

thedevup commented Jul 12, 2023

Problem Downloading the data #22

Problem Downloading the data #22

Comments

marcomiglionico94 commented Mar 11, 2022 • edited Loading

test77123 commented May 11, 2022

tejaswivg commented May 14, 2022

weiXiaxvv commented Aug 10, 2022

W-Q-T commented Aug 12, 2022

thedevup commented Jul 12, 2023

marcomiglionico94 commented Mar 11, 2022 •

edited

Loading