Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chapter 6 - dataset size #646

Open
profvjreddi opened this issue Jan 23, 2025 · 0 comments
Open

Chapter 6 - dataset size #646

profvjreddi opened this issue Jan 23, 2025 · 0 comments
Assignees
Labels
improvement Improve existing content

Comments

@profvjreddi
Copy link
Contributor

profvjreddi commented Jan 23, 2025

@18jeffreyma @mmaz I think it will be interesting to include how dataset sizes have been growing in the data engineering chapter.

Could you look through https://epoch.ai/blog/trends-in-training-dataset-sizes and see what plot might make sense?

Something at the beginning would be good for students to learn just how big data is getting in big ML systems. And then, of course, we should say that edge ML and mobile ML data is even later, but just not as structured and captured.

@profvjreddi profvjreddi added the improvement Improve existing content label Jan 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improve existing content
Projects
None yet
Development

No branches or pull requests

3 participants