Skip to content

Commit

Permalink
Add some documentation on the ontonotes/ww conversion
Browse files Browse the repository at this point in the history
  • Loading branch information
AngledLuffa committed Nov 13, 2023
1 parent b301532 commit 27d306f
Showing 1 changed file with 11 additions and 0 deletions.
11 changes: 11 additions & 0 deletions stanza/utils/datasets/ner/ontonotes_multitag.py
Original file line number Diff line number Diff line change
@@ -1,3 +1,14 @@
"""
Combines OntoNotes and WW into a single dataset with OntoNotes used for dev & test
The resulting dataset has two layers saved in the multi_ner column.
WW is kept as 9 classes, with the tag put in either the first or
second layer depending on the flags.
OntoNotes is converted to one column for 18 and one column for 9 classes.
"""

import argparse
import json
import os
Expand Down

0 comments on commit 27d306f

Please sign in to comment.