Skip to content

nabihanaqvie/SANA-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

e683e55 · Jan 5, 2022

History

10 Commits
Jan 5, 2022
Dec 9, 2021
Dec 9, 2021
Dec 7, 2021

Repository files navigation

SANA-Project

The SANA project's goal is to create a Islamic-specific database for research purposes. My contribution to this goal is to create a model that would predict category based on Abstract and Title.

In order to accomplish this, I have currently divided the work with taking removig no punctuation and removing punctuation to see the overall noise difference it creates.

Next Steps:

  1. Create a dictionary or import a list of arabic names for grouping. Ex: Mohammed and Mohamad --> Mohammad
  2. Include removing some punctuation vs others
  3. Write machine learning classifiers as a pipeline

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published