Welcome to the Artificial Intelligence for Public Health (AI4PH) event in 2022.
The tutorial and data challenge materials can be found at: https://bookdown.org/tianyuan09/ai4ph2022/.
This online tutorial will accompany two sessions:
- Tutorial on text analytics with R
- data pre-processing
- regular expressions
- tokenization
- stopwords
- stemming
- Exploratory data analysis
- supervised learning (classification models)
- unsupervised learning (topic modelling)
- data pre-processing
- Data Challenge using the N2C2 NLP Research Datasets
This repository contains the Twitter dataset you will use for the tutorial session.
- TwitterDataforClassification.csv
- TwitterDataforTopicModelling.csv
The tutorial site was created with R Markdown and bookdown (https://github.com/rstudio/bookdown).