This lesson provides an introduction to R
, RStudio, and the tidyverse
. It is targeted at participants who are not familiar with R
or who use base R
for much of their productivity.
At the end of this lesson, participants should be able to:
- Create and manage projects using
R
, including withR
projects andR
notebooks - Use
readr
to import data from.csv
files - Employ
dplyr
to clean and manipulate data - Use
skimir
to calculate descriptive statistics - Use
janitor
to create frequency tables - Create basic plots of data using
ggplot2
- The
SETUP.md
file in thereferences/
directory contains a list of packages required for this lesson - The
notebook/
directory contains a sample notebook and a completed notebook that emulate the template we use during lessons - The lesson slides provide an overview of
R
, RStudio, and project organization - The
references/
directory also contains other notes on changes to the repository, key topics, terms, data sources, and software. - The
notebooks/
directory contains three notebooks on (1) data cleaning, (2) data exploration, and (3) data plotting. Completed versions of the notebooks are also included.
- The
tidyverse
website - Hadley Wickham's excellent R for Data Science provides an excellent overview of how to use the
tidyverse
for various tasks - RStudio's cheatsheets:
- Github's Mastering Markdown guide
You can download this lesson to your Desktop easily using usethis
:
usethis::use_course("https://github.com/slu-dss/atlasWeek/archive/master.zip")
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.
In this workshop, we’ll use publicly available data about St. Louis to introduce open source data analysis software (R
) that can be used to explore and respond to questions that are of concern to communities worldwide. We’ll use two local datasets to introduce participants to the art of data analysis: information on murders and lead poisoning for St. Louis.
Our goal is to showcase how to use basic data science skills to evaluate problems and meet the needs of different communities or organizations. The workshop will be hands-on and will include a brief introduction to the software and local data being used, demonstration of how to import and explore data, and discussion of the different types of analysis that can be performed and the diverse questions that can be addressed in R
.
The Sam and Marilyn Fox Atlas Week Program launched in Spring of 2001 as a way to recognize the international dimension of Saint Louis University's academic programs and to celebrate SLU's role in international education and service in light of our Jesuit tradition. One of the main goals of the Atlas Program is to increase awareness of the global issues that confront us today in an effort not only to promote discussion, but also to inspire and inform action. It focuses on what we as global citizens can do to contribute to a better life for all people now and in the future. The Atlas Program epitomizes the Jesuit mission of SLU and highlights how SLU truly is a place where "knowledge touches lives."
The SLU Data Science Seminar (DSS) is a collaborative, interdisciplinary group at Saint Louis University focused on building researchers’ data science skills using open source software. We currently host seminars focused on the programming language R. The SLU DSS is co-organized by Christina Gacia, Ph.D., Kelly Lovejoy, Ph.D., and Christopher Prener, Ph.D.. You can keep up with us here on GitHub, on our website, and on Twitter.
Founded in 1818, Saint Louis University is one of the nation’s oldest and most prestigious Catholic institutions. Rooted in Jesuit values and its pioneering history as the first university west of the Mississippi River, SLU offers nearly 13,000 students a rigorous, transformative education of the whole person. At the core of the University’s diverse community of scholars is SLU’s service-focused mission, which challenges and prepares students to make the world a better, more just place.