You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
....
# loop throught the candidatesfork, vinself.candidates.items():
# get candidate document frequencycandidate_df=1# get the df for unigram onlyiflen(v.lexical_form) ==1:
candidate_df+=df.get(k, 0)
...
I do not think the paper intends to do this. Shouldn't we get DF for all candidates whether unigram or not, and assign a default value of 1 if candidate is not present in the vocabulary?
The text was updated successfully, but these errors were encountered:
In KP-Miner implementation, n-gram candidates with
n>1
are assignedcandidate_df=1
.See
pke/pke/unsupervised/statistical/kpminer.py
Line 143 in 8f1d05d
I do not think the paper intends to do this. Shouldn't we get DF for all candidates whether unigram or not, and assign a default value of 1 if candidate is not present in the vocabulary?
The text was updated successfully, but these errors were encountered: