ResearchPaperClassifier

A classification algorithm for identifying properties about a scientific paper from its text, particularly whether it is AI generated or not

Prework for tutorial:

Create a python virtual environment: python -m venv path/to/venv/folder
Activate the environment (dependent on shell/system). In bash on linux: source path/to/venv/folder/bin/activate
Clone and enter this repo
Install the requirements: pip install -r requirements.txt (this may take awhile because of pytorch)
Run wandb login
Follow the instructions to create a profile if you don't have one, then get and paste your api key NOTE: If you don't have a decent GPU with cuda installed, it will use CPU to train, and that will be very, very slow.
Run nvidia-smi to check

How to use

go to http://73.254.3.61:5003/ to use the classifier by providing an abstract, receiving a prediction on whether it is fake or real

How it works

The discriminator is trained using a Generative Adversarial Network architecture using a finetuned pretrained GPT2 as the Generator and an non finetuned pretrained GPT2 as the discriminator.

It does not work very well at the moment. More computational power and training time is needed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ResearchPaperClassifier

Prework for tutorial:

How to use

How it works

Files

README.md

Latest commit

History

README.md

File metadata and controls

ResearchPaperClassifier

Prework for tutorial:

How to use

How it works