Skip to content

Latest commit

 

History

History
27 lines (19 loc) · 951 Bytes

README.md

File metadata and controls

27 lines (19 loc) · 951 Bytes

Tutorial on Text Analytics Using R (AI4PH 2022)

Welcome to the Artificial Intelligence for Public Health (AI4PH) event in 2022.

The tutorial and data challenge materials can be found at: https://bookdown.org/tianyuan09/ai4ph2022/.

This online tutorial will accompany two sessions:

  • Tutorial on text analytics with R
    • data pre-processing
      • regular expressions
      • tokenization
      • stopwords
      • stemming
      • Exploratory data analysis
    • supervised learning (classification models)
    • unsupervised learning (topic modelling)
  • Data Challenge using the N2C2 NLP Research Datasets

The Twitter Datasets

This repository contains the Twitter dataset you will use for the tutorial session.

  • TwitterDataforClassification.csv
  • TwitterDataforTopicModelling.csv

The tutorial site was created with R Markdown and bookdown (https://github.com/rstudio/bookdown).