-
-
Notifications
You must be signed in to change notification settings - Fork 40
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'master' of https://github.com/quwatsunny/lexical-models
- Loading branch information
Showing
19 changed files
with
50,324 additions
and
70 deletions.
There are no files selected for viewing
7 changes: 0 additions & 7 deletions
7
release/burushos/burusho.bsk-latn.burushaski/burusho.bsk-latn.burushaski.model_info
This file was deleted.
Oops, something went wrong.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
6 changes: 6 additions & 0 deletions
6
release/chechen_language/chechen_language.ce-latn.chechen/HISTORY.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
Chechen Change History | ||
==================== | ||
|
||
1.0 (2025-01-20) | ||
---------------- | ||
* Created by Chechen Language |
21 changes: 21 additions & 0 deletions
21
release/chechen_language/chechen_language.ce-latn.chechen/LICENSE.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
The MIT License (MIT) | ||
|
||
Copyright © 2025 Chechen Language | ||
|
||
Permission is hereby granted, free of charge, to any person obtaining a copy | ||
of this software and associated documentation files (the "Software"), to deal | ||
in the Software without restriction, including without limitation the rights | ||
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell | ||
copies of the Software, and to permit persons to whom the Software is | ||
furnished to do so, subject to the following conditions: | ||
|
||
The above copyright notice and this permission notice shall be included in all | ||
copies or substantial portions of the Software. | ||
|
||
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR | ||
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, | ||
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE | ||
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER | ||
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, | ||
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE | ||
SOFTWARE. |
23 changes: 23 additions & 0 deletions
23
release/chechen_language/chechen_language.ce-latn.chechen/README.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,23 @@ | ||
Chechen lexical model | ||
=================== | ||
|
||
Description | ||
----------- | ||
Lexical model for the Chechen language using the 1992 Latin script. Word forms are sourced from the corpus at corpora.dosham.info, created by a member of our team and maintained as part of our efforts to support the Chechen language. | ||
|
||
Links | ||
----- | ||
|
||
Copyright | ||
--------- | ||
See [LICENSE.md](LICENSE.md) | ||
|
||
Supported Platforms | ||
------------------- | ||
* iPhone | ||
* iPad | ||
* Android phone | ||
* Android tablet | ||
* Mobile devices | ||
* Tablet devices | ||
|
10 changes: 10 additions & 0 deletions
10
...se/chechen_language/chechen_language.ce-latn.chechen/chechen_language.ce-latn.chechen.kpj
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
<?xml version="1.0" encoding="utf-8"?> | ||
<KeymanDeveloperProject> | ||
<Options> | ||
<Version>2.0</Version> | ||
<CompilerWarningsAsErrors>True</CompilerWarningsAsErrors> | ||
<CheckFilenameConventions>True</CheckFilenameConventions> | ||
<SkipMetadataFiles>True</SkipMetadataFiles> | ||
<ProjectType>lexicalmodel</ProjectType> | ||
</Options> | ||
</KeymanDeveloperProject> |
7 changes: 7 additions & 0 deletions
7
...hen_language/chechen_language.ce-latn.chechen/chechen_language.ce-latn.chechen.model_info
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,7 @@ | ||
{ | ||
"license": "mit", | ||
"languages": [ | ||
"ce-Latn" | ||
], | ||
"description": "Lexical model for the Chechen language using the 1992 Latin script. Word forms are sourced from the corpus at corpora.dosham.info, created by a member of our team and maintained as part of our efforts to support the Chechen language." | ||
} |
125 changes: 62 additions & 63 deletions
125
...rce/burusho.bsk-latn.burushaski.model.kps → ...hechen_language.ce-latn.chechen.model.kps
100644 → 100755
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,63 +1,62 @@ | ||
<?xml version="1.0" encoding="utf-8"?> | ||
<Package> | ||
<System> | ||
<KeymanDeveloperVersion>17.0.332.0</KeymanDeveloperVersion> | ||
<FileVersion>12.0</FileVersion> | ||
</System> | ||
<Options> | ||
<ExecuteProgram></ExecuteProgram> | ||
<ReadMeFile>readme.htm</ReadMeFile> | ||
<LicenseFile>..\LICENSE.md</LicenseFile> | ||
<WelcomeFile>welcome.htm</WelcomeFile> | ||
<MSIFileName></MSIFileName> | ||
<MSIOptions></MSIOptions> | ||
</Options> | ||
<StartMenu> | ||
<Folder></Folder> | ||
<Items/> | ||
</StartMenu> | ||
<Info> | ||
<Name URL="">Burushaski</Name> | ||
<Copyright URL="">Copyright © Burusho Quwatz</Copyright> | ||
<Author URL="">Burusho</Author> | ||
<Version URL="">1.27</Version> | ||
<Description URL="">This is lexical model for Burushaski Latin Script.</Description> | ||
</Info> | ||
<Files> | ||
<File> | ||
<Name>..\build\burusho.bsk-latn.burushaski.model.js</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.js</FileType> | ||
</File> | ||
<File> | ||
<Name>welcome.htm</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.htm</FileType> | ||
</File> | ||
<File> | ||
<Name>readme.htm</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.htm</FileType> | ||
</File> | ||
<File> | ||
<Name>..\LICENSE.md</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.md</FileType> | ||
</File> | ||
</Files> | ||
<Keyboards/> | ||
<LexicalModels> | ||
<LexicalModel> | ||
<Name>Burushaski</Name> | ||
<ID>burusho.bsk-latn.burushaski</ID> | ||
<Languages> | ||
<Language ID="bsk-Latn">Burushaski</Language> | ||
</Languages> | ||
</LexicalModel> | ||
</LexicalModels> | ||
<Strings/> | ||
</Package> | ||
<?xml version="1.0" encoding="utf-8"?> | ||
<Package> | ||
<System> | ||
<KeymanDeveloperVersion>17.0.333.0</KeymanDeveloperVersion> | ||
<FileVersion>12.0</FileVersion> | ||
</System> | ||
<Options> | ||
<ExecuteProgram></ExecuteProgram> | ||
<ReadMeFile>readme.htm</ReadMeFile> | ||
<LicenseFile>..\LICENSE.md</LicenseFile> | ||
<MSIFileName></MSIFileName> | ||
<MSIOptions></MSIOptions> | ||
</Options> | ||
<StartMenu> | ||
<Folder></Folder> | ||
<Items/> | ||
</StartMenu> | ||
<Info> | ||
<Name URL="">Chechen</Name> | ||
<Copyright URL="">Copyright © Chechen Language</Copyright> | ||
<Author URL="">Chechen Language</Author> | ||
<Version URL="">1.0</Version> | ||
<Description URL="">Lexical model for the Chechen language using the 1992 Latin script. Word forms are sourced from the corpus at corpora.dosham.info, created by a member of our team and maintained as part of our efforts to support the Chechen language.</Description> | ||
</Info> | ||
<Files> | ||
<File> | ||
<Name>..\build\chechen_language.ce-latn.chechen.model.js</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.js</FileType> | ||
</File> | ||
<File> | ||
<Name>welcome.htm</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.htm</FileType> | ||
</File> | ||
<File> | ||
<Name>readme.htm</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.htm</FileType> | ||
</File> | ||
<File> | ||
<Name>..\LICENSE.md</Name> | ||
<Description></Description> | ||
<CopyLocation>0</CopyLocation> | ||
<FileType>.md</FileType> | ||
</File> | ||
</Files> | ||
<Keyboards/> | ||
<LexicalModels> | ||
<LexicalModel> | ||
<Name>Chechen</Name> | ||
<ID>chechen_language.ce-latn.chechen</ID> | ||
<Languages> | ||
<Language ID="ce-Latn">Chechen</Language> | ||
</Languages> | ||
</LexicalModel> | ||
</LexicalModels> | ||
<Strings/> | ||
</Package> |
20 changes: 20 additions & 0 deletions
20
...anguage/chechen_language.ce-latn.chechen/source/chechen_language.ce-latn.chechen.model.ts
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
/* | ||
Chechen 1.0 | ||
Lexical model for the Chechen language using the 1992 Latin script. Word forms are sourced from the corpus at corpora.dosham.info, | ||
created by a member of our team and maintained as part of our efforts to support the Chechen language. | ||
*/ | ||
|
||
const source: LexicalModelSource = { | ||
format: 'trie-1.0', | ||
wordBreaker: 'default', | ||
sources: ['wordlist.tsv'], | ||
languageUsesCasing: true, | ||
punctuation: { | ||
quotesForKeepSuggestion: { | ||
open: "\"", | ||
close: "\"" | ||
} | ||
} | ||
}; | ||
export default source; |
24 changes: 24 additions & 0 deletions
24
release/chechen_language/chechen_language.ce-latn.chechen/source/readme.htm
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
<!DOCTYPE html> | ||
<html lang="en"> | ||
<head> | ||
<meta charset="utf-8"> | ||
<meta name="viewport" content="width=device-width, initial-scale=1"> | ||
<title>Chechen</title> | ||
<style type="text/css"> | ||
p { font: 10pt Tahoma; } | ||
h1 { font: bold 16pt Tahoma; color: #4444cc; margin-bottom: 2px } | ||
h2 { font: bold 12pt Tahoma; color: #4444cc; } | ||
</style> | ||
</head> | ||
<body> | ||
|
||
<h1>Chechen</h1> | ||
|
||
<p> | ||
Lexical model for the Chechen language using the 1992 Latin script. Word forms are sourced from the corpus at corpora.dosham.info, created by a member of our team and maintained as part of our efforts to support the Chechen language. | ||
</p> | ||
|
||
<p>Copyright © Chechen Language</p> | ||
|
||
</body> | ||
</html> |
28 changes: 28 additions & 0 deletions
28
release/chechen_language/chechen_language.ce-latn.chechen/source/welcome.htm
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
<!DOCTYPE html> | ||
<html lang="en"> | ||
<head> | ||
<meta charset="utf-8"> | ||
<meta name="viewport" content="width=device-width, initial-scale=1"> | ||
<title>Start Using Chechen</title> | ||
<style type="text/css"> | ||
p { font: 10pt Tahoma; } | ||
h1 { font: bold 16pt Tahoma; color: #4444cc; margin-bottom: 2px } | ||
h2 { font: bold 12pt Tahoma; color: #4444cc; } | ||
</style> | ||
</head> | ||
<body> | ||
|
||
<h1>Start Using Chechen</h1> | ||
|
||
<p> | ||
Lexical model for the Chechen language using the 1992 Latin script. Word forms are sourced from the corpus at corpora.dosham.info, created by a member of our team and maintained as part of our efforts to support the Chechen language. | ||
</p> | ||
|
||
<h1>Wordlist Model Documentation</h1> | ||
|
||
<!-- Insert HTML documentation here --> | ||
|
||
<p>Copyright © Chechen Language</p> | ||
|
||
</body> | ||
</html> |
Oops, something went wrong.