Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Post-installation error #14

Open
Kevin-Prichard opened this issue Mar 4, 2025 · 0 comments
Open

Post-installation error #14

Kevin-Prichard opened this issue Mar 4, 2025 · 0 comments

Comments

@Kevin-Prichard
Copy link

The package requires tabulate to function, and despite it being mentioned in requirements.txt it does not get installed via pip install metawarc.

I may address this with a pull, when I get some free time. Thanks for the package, it is helpful for testing code that reads and writes WARCs.

Install:

(venv) kev@ganymede:~/projs/coisas-deliciosas$ pip install metawarc
Collecting metawarc
  Downloading metawarc-1.1.1.tar.gz (15 kB)
  Installing build dependencies ... done
  Getting requirements to build wheel ... done
  Preparing metadata (pyproject.toml) ... done
Requirement already satisfied: warcio in ./venv/lib/python3.13/site-packages (from metawarc) (1.7.5)
Collecting pdfminer (from metawarc)
  Downloading pdfminer-20191125.tar.gz (4.2 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 4.2/4.2 MB 3.3 MB/s eta 0:00:00
  Installing build dependencies ... -
done
  Getting requirements to build wheel ... done
  Preparing metadata (pyproject.toml) ... done
Collecting click (from metawarc)
  Using cached click-8.1.8-py3-none-any.whl.metadata (2.3 kB)
Collecting rich (from metawarc)
  Using cached rich-13.9.4-py3-none-any.whl.metadata (18 kB)
Requirement already satisfied: lxml in ./venv/lib/python3.13/site-packages (from metawarc) (5.3.1)
Collecting hachoir (from metawarc)
  Downloading hachoir-3.3.0-py3-none-any.whl.metadata (2.9 kB)
Collecting sqlalchemy (from metawarc)
  Downloading SQLAlchemy-2.0.38-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (9.6 kB)
Collecting pycryptodome (from pdfminer->metawarc)
  Downloading pycryptodome-3.21.0-cp36-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (3.4 kB)
Collecting markdown-it-py>=2.2.0 (from rich->metawarc)
  Using cached markdown_it_py-3.0.0-py3-none-any.whl.metadata (6.9 kB)
Requirement already satisfied: pygments<3.0.0,>=2.13.0 in ./venv/lib/python3.13/site-packages (from rich->metawarc) (2.19.1)
Collecting greenlet!=0.4.17 (from sqlalchemy->metawarc)
  Using cached greenlet-3.1.1-cp313-cp313-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl.metadata (3.8 kB)
Requirement already satisfied: typing-extensions>=4.6.0 in ./venv/lib/python3.13/site-packages (from sqlalchemy->metawarc) (4.12.2)
Requirement already satisfied: six in ./venv/lib/python3.13/site-packages (from warcio->metawarc) (1.17.0)
Collecting mdurl~=0.1 (from markdown-it-py>=2.2.0->rich->metawarc)
  Using cached mdurl-0.1.2-py3-none-any.whl.metadata (1.6 kB)
Using cached click-8.1.8-py3-none-any.whl (98 kB)
Downloading hachoir-3.3.0-py3-none-any.whl (650 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 650.4/650.4 kB 2.8 MB/s eta 0:00:00
Using cached rich-13.9.4-py3-none-any.whl (242 kB)
Downloading SQLAlchemy-2.0.38-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.2 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 3.2/3.2 MB 3.1 MB/s eta 0:00:00
Using cached greenlet-3.1.1-cp313-cp313-manylinux_2_24_x86_64.manylinux_2_28_x86_64.whl (615 kB)
Using cached markdown_it_py-3.0.0-py3-none-any.whl (87 kB)
Downloading pycryptodome-3.21.0-cp36-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.3 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 2.3/2.3 MB 3.0 MB/s eta 0:00:00
Using cached mdurl-0.1.2-py3-none-any.whl (10.0 kB)
Building wheels for collected packages: metawarc, pdfminer
  Building wheel for metawarc (pyproject.toml) ... done
  Created wheel for metawarc: filename=metawarc-1.1.1-py2.py3-none-any.whl size=16703 sha256=99a787b6b0b0a80d5a8d13e665fa46625909136b9c46349bcf6942c304e3d1d5
  Stored in directory: /home/kev/.cache/pip/wheels/f8/fa/9a/d931532c443a7627d7cf9fc340354c105930a19b033b7f38ce
  Building wheel for pdfminer (pyproject.toml) ... done
  Created wheel for pdfminer: filename=pdfminer-20191125-py3-none-any.whl size=6140774 sha256=2b79fafed2638b3f2e85b9c70a2b6a5e1bdb4cfa33b9c8dc526c7278c4144d0c
  Stored in directory: /home/kev/.cache/pip/wheels/b6/45/85/3db811d181e6ac5b5ea9187c72cdd94ff1df2642d97cf3106a
Successfully built metawarc pdfminer
Installing collected packages: hachoir, pycryptodome, mdurl, greenlet, click, sqlalchemy, pdfminer, markdown-it-py, rich, metawarc
Successfully installed click-8.1.8 greenlet-3.1.1 hachoir-3.3.0 markdown-it-py-3.0.0 mdurl-0.1.2 metawarc-1.1.1 pdfminer-20191125 pycryptodome-3.21.0 rich-13.9.4 sqlalchemy-2.0.38

Run with error:

(venv) kev@ganymede:~/projs/coisas-deliciosas$ metawarc 
Traceback (most recent call last):
  File "/home/kev/projs/coisas-deliciosas/venv/bin/metawarc", line 8, in <module>
    sys.exit(main())
             ~~~~^^
  File "/home/kev/projs/coisas-deliciosas/venv/lib/python3.13/site-packages/metawarc/__main__.py", line 10, in main
    from .core import cli
  File "/home/kev/projs/coisas-deliciosas/venv/lib/python3.13/site-packages/metawarc/core.py", line 7, in <module>
    from .cmds.analyzer import Analyzer
  File "/home/kev/projs/coisas-deliciosas/venv/lib/python3.13/site-packages/metawarc/cmds/analyzer.py", line 3, in <module>
    from tabulate import tabulate
ModuleNotFoundError: No module named 'tabulate'

Install missing tabulate:

(venv) kev@ganymede:~/projs/coisas-deliciosas$ pip install tabulate
Collecting tabulate
  Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB)
Downloading tabulate-0.9.0-py3-none-any.whl (35 kB)
Installing collected packages: tabulate
Successfully installed tabulate-0.9.0

Run successfully:

(venv) kev@ganymede:~/projs/coisas-deliciosas$ metawarc 
Usage: metawarc [OPTIONS] COMMAND [ARGS]...

Options:
  --help  Show this message and exit.

Commands:
  analyze   Analysis of the WARC
  dump      Dumps content by query
  export    Exports WARC file headers or warcio index
  index     Generates WARC file index
  list      Lists urls inside WARC file
  metadata  Extracts metadata from files inside WARC file or another file...
  stats     Generates WARC file index
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant