-
Notifications
You must be signed in to change notification settings - Fork 2
/
eval.log
82 lines (82 loc) · 5.97 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
Running the following version of UD tools:
commit 13e6b709a8bc643c3f902800321a7beda46feb8d
Author: Dan Zeman <[email protected]>
Date: Sun Nov 13 22:03:41 2022 +0100
Evaluating the following revision of UD_Hindi_English-HIENCS:
commit 2b182e6996a8b6bde6eed5d9babf0dba459ff840
Author: Dan Zeman <[email protected]>
Date: Sat May 14 14:30:24 2022 +0200
Size: counted 26909 of 26909 words (nodes).
Size: min(0, log((N/1000)**2)) = 6.58492160628321.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Did not find at least 10000 development words.
Split: Did not find at least 10000 test words.
Lemmas: '_' is the most frequent lemma.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 1.
Features: 0 out of 26909 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 31 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.8.
Udapi:
TOTAL 6912
Udapi: found 6912 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 26909 words.
Genres: found 1 out of 17 known.
Availability: README does not say Includes text: yes
Availability: '_' is the most frequent form.
validate.py --lang qhe --max-err=10 UD_Hindi_English-HIENCS/qhe_hiencs-ud-dev.conllu
[Line 464 Sent dev-s28 Node 10]: [L3 Syntax punct-causes-nonproj] Punctuation must not cause non-projectivity of nodes [20]
[Line 606 Sent dev-s35 Node 10]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (10:_:aux --> 7:_:obl)
[Line 618 Sent dev-s36 Node 3]: [L3 Syntax leaf-mark-case] 'mark' not expected to have children (3:_:mark --> 2:_:compound)
[Line 703 Sent dev-s40 Node 5]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 835 Sent dev-s47 Node 12]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [5, 10]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 1130 Sent dev-s62 Node 4]: [L3 Syntax right-to-left-appos] Relation 'appos' must go left-to-right.
[Line 1446 Sent dev-s79 Node 11]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 1463 Sent dev-s80 Node 10]: [L3 Syntax leaf-mark-case] 'case' not expected to have children (10:_:case --> 9:_:compound)
[Line 1560 Sent dev-s85 Node 11]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
...suppressing further errors regarding Syntax
Syntax errors: 30
*** FAILED *** with 30 errors
Exit code: 1
validate.py --lang qhe --max-err=10 UD_Hindi_English-HIENCS/qhe_hiencs-ud-test.conllu
[Line 47 Sent test-s3 Node 4]: [L3 Syntax right-to-left-appos] Relation 'appos' must go left-to-right.
[Line 76 Sent test-s4 Node 19]: [L3 Syntax punct-is-nonproj] Punctuation must not be attached non-projectively over nodes [12, 13, 14, 15, 16, 17, 18]
[Line 155 Sent test-s8 Node 6]: [L3 Syntax right-to-left-appos] Relation 'appos' must go left-to-right.
[Line 267 Sent test-s15 Node 6]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 429 Sent test-s24 Node 7]: [L3 Syntax leaf-mark-case] 'case' not expected to have children (7:_:case --> 8:_:compound)
[Line 519 Sent test-s29 Node 12]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [10, 11]. Outer subjects are allowed if a clause acts as the predicate of another clause.
[Line 545 Sent test-s31 Node 6]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 702 Sent test-s40 Node 13]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 722 Sent test-s41 Node 12]: [L3 Syntax too-many-subjects] Node has multiple subjects not subtyped as ':outer': [4, 11]. Outer subjects are allowed if a clause acts as the predicate of another clause.
...suppressing further errors regarding Syntax
Syntax errors: 42
*** FAILED *** with 42 errors
Exit code: 1
validate.py --lang qhe --max-err=10 UD_Hindi_English-HIENCS/qhe_hiencs-ud-train.conllu
[Line 209 Sent train-s11 Node 14]: [L3 Syntax punct-is-nonproj] Punctuation must not be attached non-projectively over nodes [9, 10, 11, 12, 13]
[Line 871 Sent train-s54 Node 1]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'NOUN'
[Line 1050 Sent train-s64 Node 7]: [L3 Syntax leaf-cc] 'cc' not expected to have children (7:_:cc --> 20:_:discourse)
[Line 1049 Sent train-s64 Node 19]: [L3 Syntax punct-is-nonproj] Punctuation must not be attached non-projectively over nodes [8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18]
[Line 1061 Sent train-s65 Node 1]: [L3 Syntax leaf-punct] 'punct' not expected to have children (1:_:punct --> 5:_:acl)
[Line 1134 Sent train-s70 Node 6]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 1214 Sent train-s75 Node 20]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 1241 Sent train-s77 Node 13]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 1375 Sent train-s86 Node 6]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
...suppressing further errors regarding Syntax
Syntax errors: 194
*** FAILED *** with 194 errors
Exit code: 1
Validity: 0.01
(weight=0.0769230769230769) * (score{features}=0.01) = 0.000769230769230769
(weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805
(weight=0.0769230769230769) * (score{lemmas}=0.01) = 0.000769230769230769
(weight=0.256410256410256) * (score{size}=0.476632519562383) = 0.122213466554457
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974
(weight=0.0769230769230769) * (score{tags}=1) = 0.0769230769230769
(weight=0.307692307692308) * (score{udapi}=0.01) = 0.00307692307692308
(weight=0.0769230769230769) * (score{udeprels}=0.67027027027027) = 0.0515592515592516
(TOTAL score=0.277271963965896) * (availability=0.1) * (validity=0.01) = 0.000277271963965896
STARS = 0
UD_Hindi_English-HIENCS 0.000277271963965896 0