Skip to content

Latest commit

 

History

History
20 lines (14 loc) · 1019 Bytes

README.md

File metadata and controls

20 lines (14 loc) · 1019 Bytes

german-ipa-dict

German IPA dictionary as extracted from wiktionary

Generate de.csv from en.wiktionary

  • Download enwiktionary-****-pages-meta-current.xml dump from Wikimedia
  • $ python3 ./extract_de_ipa_en.py > de_enwikt_raw.csv
  • $ sort < de_enwikt_raw.csv > de_enwikt.csv

Generate de.csv from de.wiktionary

  • Download dewiktionary-****-pages-meta-current.xml dump from Wikimedia
  • Change INFILE at the top of extract_de_ipa.py to point to the .xml.bz2 file you downloaded
  • $ python3 ./extract_de_ipa.py

Documentation