Question Classifier

Question Classifier is a machine learning model that classifies whether a given question is related to health topics or not. This repository contains the code and data to train and test the classifier.

For the understanding of the problem and the creation of the code, the book "Natural Language Processing with Transformers" (O'Reilly) was consulted. The corresponding Jupyter notebook can be accessed here: https://github.com/nlp-with-transformers/notebooks/blob/main/02_classification.ipynb

Overview

The purpose of this project is to create a classifier that can accurately identify questions related to health topics. This can be useful for various applications, such as filtering non-healthcare-related queries from a question-answering system, assisting in medical chatbots, or categorizing webcraped content.

The model is built using Python and the DistilBERT base model (uncased) witch is a distilled version of the BERT base model.

Dataset

The dataset used in this project is a self composed, labeled and balanced dataset. I used 1900 hand-labeled questions from the Stanford Question Answering Dataset (of which 80 were health questions) and 1740 questions from other healthcare-Q&A-Datasets like AskDocs https://github.com/ju-resplande/askD and others, that can be found here https://github.com/LasseRegin/medical-question-answer-data.

The dataset is divided into training and testing sets, which can be found in the data folder of this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
README.md		README.md
train_classifier.ipynb		train_classifier.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Question Classifier

Table of Contents

Overview

Dataset

About

Releases

Packages

Languages

SinaRampe/question_classifier

Folders and files

Latest commit

History

Repository files navigation

Question Classifier

Table of Contents

Overview

Dataset

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages