Skip to content

Latest commit

 

History

History
83 lines (66 loc) · 4.44 KB

CHANGELOG.md

File metadata and controls

83 lines (66 loc) · 4.44 KB

vaex 2.6.0-dev (unreleased)

vaex-core 1.5.0-dev

  • Features
    • df.evalute_iterator for efficient parallel chunked evaluation #515

vaex-ml 0.7.1-dev

  • Performance
    • IncrementalPredictor uses parallel chunked support (2x speedup possible) #515
  • Fix
    • IncrementalPredictor: epochs now iterate over the whole DataFrame instead on a batch level #523
    • rename vaex.ml.sklearn.SKLearnPredictor to vaex.ml.sklearn.Predictor #524

vaex 2.5.0 (2019-12-16)

vaex-core 1.4.0

  • Performance
    • Dataframes are always true (implements __bool__) to avoid calling __len__ #496
  • Fixes
    • Do not duplicate column when joining DataFrames on a column with the same name #480
    • Better error messages/stack traces, and work better with debugger. #488
    • Accept numpy scalars in expressions. #462
    • Expression.astype can create datetime64 columns out of (arrow) strings arrays. #440
    • Invalid mask access triggered when memory-mapped read only for strings. #459
  • Features
    • Expressions are not evaluated for filtered data #483 #496 and selections 505
    • Filtering (using df.filter) allows more flexible (and growing/expanding!) filter. #489
    • Filtering and selections allow for booleans (True or False) to auto 'broadcast', to allow 'conditional filtering'. #489

vaex-ml 0.7.0

  • Features
    • IncrementalPredictor for scikit-learn models that support the .partial_fit method #497
  • Fixes
    • Adding unique function names to dataframes to enable adding a predictor twice #492

vaex-arrow 0.4.2

  * Compatibility with vaex-core 1.4.0

vaex 2.4.0 (2019-11-26)

vaex-core 1.3.0

  • Performance
    • Parallel df.evaluate #474
    • Avoid calling df.get_column_names (1000x for 1 billion rows per column use) #473
    • Slicing e.g df[1:-1] goes much faster for filtered dataframes #471
    • Dataframe copying and expression rewriting was slow #470
    • Double indices columns were not using index cache since empty dict is falsy #439
  • Features
    • multi-key sorting of a DataFrame #463
    • vaex expression to pandas.Series support #456
    • Dask array support #449 #476 example
    • isin() method for expressions #441 docs
    • Existing expressions are rewritten, to make them behave like arrays #450

vaex-hdf5 0.5.6

  • requires vaex-core >=1.3,<2 for parallel evaluate

vaex-jupyter 0.4.1

  • Fixes:
    • bqplot 0.12 revealed a bug/inconsistency with heatmap #465

vaex-arrow 0.4.1

  • Fixes
    • Support for Apache Arrow >= 0.15

vaex-ml 0.6.2

  • Fixes
    • Docstrings and minor improvements

vaex-graphql 0.0.1 (2019-10-15)

  • initial release 0.1

vaex 2.3.0 (2019-10-15)

vaex-core 1.2.0

  • feature: auto upcasting for sum #435
  • fix: selection/filtering fix when using masked values #431
  • fix: masked string array fixes #434
  • fix: memory usage fix for joins #439

vaex-arrow 0.4.1

  • fix: support for Apache Arrow >= 0.15