You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
DAB can have multiple reversal indexes. The dictionary has a default or vernacular language which is the language being documented. There is also an analysis language, which is the language used to enter the notes, glosses, and definitions.
There is a reverse index of the glosses back to the head word in the dictionary. The data.sqlite database has an entries table that has id and name fields. The reversal index (which has the file name lexicon-[langid].idx) has a list of glosses in a related language (often the analysis language) with the id from the entries table. It is a tab separated values file (TSV).
Here is the top of an example file:
abon ikan 1471
abu (sisa pembakaran) 1050
ada 2345
adat-istiadat 768
adu 604
aduh 0
advokat 16
There can be multiple words in the vernacular language that map to the reversal language which will be a comma-separated list of indexes.
In the display of the index, there will be the reversal word with the list of vernacular words below it. If there is a non-zero homonym-index, it will be displayed as a superscript after the vernacular word.
There can be multiple reversal indexes.
Convert to a format that is useful to the PWA.
The naive approach would be to convert to a JSON file with a mapping of index word to array of objects with the vernacular index, name, and homonym_index. Performance might not be good with 100K entries.
The text was updated successfully, but these errors were encountered:
DAB can have multiple reversal indexes. The dictionary has a default or vernacular language which is the language being documented. There is also an analysis language, which is the language used to enter the notes, glosses, and definitions.
There is a reverse index of the glosses back to the head word in the dictionary. The data.sqlite database has an entries table that has id and name fields. The reversal index (which has the file name
lexicon-[langid].idx
) has a list of glosses in a related language (often the analysis language) with the id from the entries table. It is a tab separated values file (TSV).Here is the top of an example file:
There can be multiple words in the vernacular language that map to the reversal language which will be a comma-separated list of indexes.
In the display of the index, there will be the reversal word with the list of vernacular words below it. If there is a non-zero homonym-index, it will be displayed as a superscript after the vernacular word.
There can be multiple reversal indexes.
Convert to a format that is useful to the PWA.
The naive approach would be to convert to a JSON file with a mapping of index word to array of objects with the vernacular index, name, and homonym_index. Performance might not be good with 100K entries.
The text was updated successfully, but these errors were encountered: