Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minify Kernel #7186

Open
tustvold opened this issue Feb 24, 2025 · 0 comments
Open

Minify Kernel #7186

tustvold opened this issue Feb 24, 2025 · 0 comments
Labels
enhancement Any new improvement worthy of a entry in the changelog

Comments

@tustvold
Copy link
Contributor

Is your feature request related to a problem or challenge? Please describe what you are trying to do.

Often people want to store arrow data in memory, and in some cases are potentially willing to pay a premium for reduced memory usage.

Currently various array types can have "bloated" memory footprints, for example:

  • Arrays can be sliced, with potentially unreferenced data buffers and child arrays
  • Dictionaries can contain duplicate entries
  • View arrays can contain unreferenced data

Describe the solution you'd like

I would like to propose a minify kernel in arrow-select that allows performing this minification. It should take a non-exhaustive/build-pattern MinifyOptions struct to allow controlling how this logic is performed. Ideally it would be possible to move the logic that currently resides in the IPCWriter over to use this kernel.

Describe alternatives you've considered

Additional context

@tustvold tustvold added the enhancement Any new improvement worthy of a entry in the changelog label Feb 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement Any new improvement worthy of a entry in the changelog
Projects
None yet
Development

No branches or pull requests

1 participant