Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
.gitignore		.gitignore
00_data_extraction_preparation_assembly.ipynb		00_data_extraction_preparation_assembly.ipynb
LICENSE		LICENSE
README.md		README.md
pipeline_ml.ipynb		pipeline_ml.ipynb

Repository files navigation

ai-content-moderator

Author: Chris Gian Project Description:

Build an API that will determine whether content is appropriate where appropriate is defined as whether it violates a terms of service agreement you would see a typical social media company have.

Motivation / Value Proposition

Sample Terms of Service

Spotify® Support Community Terms! Spotify® Support Community Guidelines !

8. Always use an appropriate and respectful language when you post information in the Community. Avoid racist, sexist, abusive, harassing, defamatory, pornographic, threatening, obscene, condescending or otherwise offensive language that could be considered detrimental to other users, or Spotify employees or moderators.

9. Do not post information or create threads for the promotion or advertisement of commercial products or services.

From the above, types of content that violate the following will be removed unilaterally:

racist, sexist, abusive, harassing, defamatory, pornographic, threatening, obscene, condescending, offensive

From this list, I will target the most discrete of these categories:

Racism, sexism, threatening

Datasets:

Given above objectives the following data sources will be used:

General Resource:
- Catalog of Hate Speech Datasets
Sexism:
- Automatic Misogyny Identification
  - notes: need to get password from that team.
Online Bullying:
- Bullying
Fox News Hate Speech:
- Detecting Online Hate Speech Using Context Aware Models

Evaluation:

Internal Validity: Can the model moderate content within the confines of this experiment (train-test split / K-Folds)?
External Validity: Can the model moderate content outside of the experiment (real-world data)?
Measures: TBD

Project Details

TBD

TODO

identify quality datasets (one, two, three)
create method to extract csv of ids and labels and pass through twitter api

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-content-moderator

Motivation / Value Proposition

Sample Terms of Service

Datasets:

Evaluation:

Project Details

TODO

About

Releases

Packages

Languages

License

chrisgian/ai-content-moderator

Folders and files

Latest commit

History

Repository files navigation

ai-content-moderator

Motivation / Value Proposition

Sample Terms of Service

Datasets:

Evaluation:

Project Details

TODO

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages