Skip to content

Latest commit

 

History

History
23 lines (19 loc) · 1.16 KB

README.md

File metadata and controls

23 lines (19 loc) · 1.16 KB

Optimize-Pandas

Jupyter Notebook showing how to optimize Pandas operations with Index Optimization, Memory Optimization, and Vectorization.

How to Benchmark

Use timeit and lineprofiler to measure performance.

Optimization Techniques

Techniques to optimize the performance of your pandas dataframe operations:

  • Use group_by instead of filter for categorical columns
  • Prefer join instead of merge for joining dataframes
  • Filter dataframes before joining them
  • Use inplace option to optimize memory usage
  • Use vectorization to improve speed of transformation operations

References

Made with ❤️ by Data Max