Skip to content

Latest commit

 

History

History
42 lines (23 loc) · 2.61 KB

PLANNED_DATA.md

File metadata and controls

42 lines (23 loc) · 2.61 KB

Planned datasets

Data subsets are devided according to single languages or language pairs, which are labeled with the respective ISO code.

hun

Hungarian translations of Pite Saami spoken texts collected by Ignác Halász.

kpv

Komi-Zyrian. The texts were digitalized by the Fennougrica project, proofread bu FU-Lab and processed by the Izhva Komi Documentation Project.

sia

  • Akkala Saami spoken texts collected and published by Arvid Genetz. The texts were digitalized by the Kola Saami Documentation Project.
  • Gospel of Matthew by Arvid Genetz

sia-sms

Gospel of Matthew by Arvid Genetz (Akkala) and Konstantin Ščekoldin (Skolt)

sjd

  • Kildin Saami spoken texts collected and published by Arvid Genetz. The texts were digitalized by the Kola Saami Documentation Project.
  • Gospel of Matthew by Arvid Genetz.

sjd-sms

Gospel of Matthew by Arvid Genetz (Akkala) and Konstantin Ščekoldin (Skolt)

sje-hun

Pite Saami spoken texts with Hungarian translations collected and published by Ignác Halász. The texts were digitalized by the Pite Saami Documentation Project; annotations are being added in that project (and its descendants) on an on-going basis.

sms

  • Skolt Saami spoken texts collected and published by Arvid Genetz. The texts were digitalized by the Kola Saami Documentation Project.
  • Gospel of Matthew by Konstantin Ščekoldin

sjt

Ter Saami spoken texts collected and published by Arvid Genetz. The texts were digitalized by the Kola Saami Documentation Project.