Skip to content

Optimize Pandas operations with Index Optimization, Memory Optimization, and Vectorization

License

Notifications You must be signed in to change notification settings

data-max-hq/optimize-pandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Optimize-Pandas

Jupyter Notebook showing how to optimize Pandas operations with Index Optimization, Memory Optimization, and Vectorization.

How to Benchmark

Use timeit and lineprofiler to measure performance.

Optimization Techniques

Techniques to optimize the performance of your pandas dataframe operations:

  • Use group_by instead of filter for categorical columns
  • Prefer join instead of merge for joining dataframes
  • Filter dataframes before joining them
  • Use inplace option to optimize memory usage
  • Use vectorization to improve speed of transformation operations

References

Made with ❤️ by Data Max

About

Optimize Pandas operations with Index Optimization, Memory Optimization, and Vectorization

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published