Skip to content

Latest commit

 

History

History
10 lines (6 loc) · 516 Bytes

README.md

File metadata and controls

10 lines (6 loc) · 516 Bytes

cisnlp/oscar-io

Types and IO (Reader/Writer) for GlotCC/OSCAR Corpus processing and generation.

The crate provides basic abstractions around Corpus items and generic readers/writers useable in GlotCC/OSCAR Corpus files. At some time, it should replace reader implementations in both cisnlp/Ungoliant and cisnlp/oscar-tools.

Features

cisnlp/oscar-io aims to provide readers/writers for numerous types of GlotCC/OSCAR Corpora.