Skip to content

team-data-science/learning-apache-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Learning Apache Spark Course

Hi and welcome to the repository of the Learning Spark course. You can find all the source codes of the Jupyter notebooks here.

The full course is available in our Data Engineering Academy at

Course Contents

  • Why Spark
  • How Spark Works
  • Set up your dev environment 
with Docker & Jupyter
  • Work with DataFrames (JSON & CSV)
  • Introduction into SparkSQL
  • Coding with RDDs
  • Conclusion

Source codes

  • 01_JSON_Transformations
  • 02_CSV_Schemas
  • 03_Working_with_DataFrames
  • 04_SparkSQL
  • 05_Working_With_RDDs

Interesting Links

About

Repository for Apache Spark course at Team Data Science

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published