File | Discription |
---|---|
groupby_parse_tokenize.ipynb | Groups the data by document. Parses the HTML text. Then it tokenize it. Creates the file: tokenized_data |
stemming.ipynb | Removes stopwords from tokenized file and stems the words. Creates the file: stemmed_data |
evaluate_query_expansion.ipynb | Computes the Normalized Discounted Cumulative Gain for the non stemmed corpus (tokenized_data) and stemmed corpus (stemmed_data) |
query_expansion.ipynb | creates the expanded queries |
results_q_expansion.ipynb | averages nDCG scores and plots them |
forked from IrProject88/IrProject
-
Notifications
You must be signed in to change notification settings - Fork 0
stanvanlier/IrProject
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published
Languages
- Jupyter Notebook 100.0%