Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for xarray? #947

Open
joaquinvanschoren opened this issue Sep 30, 2020 · 2 comments
Open

Support for xarray? #947

joaquinvanschoren opened this issue Sep 30, 2020 · 2 comments

Comments

@joaquinvanschoren
Copy link
Contributor

I heard from a growing number of people that it would be good to support xarray:
http://xarray.pydata.org/en/stable/

It supports multi-dimensional data (tensors) whereas pandas only supports single tables.
Images can for instance be stored more easily.

It seems possible to convert back and forth between xarray and pandas. What I don't know is how extensive this is. Can it 'flatten' tensors and store it as vectors and vice versa?
If so, maybe this is an easy extension. If not, this may require updates to the backend as well.

I just wanted to start some discussin about this :)

Thanks!

@mfeurer
Copy link
Collaborator

mfeurer commented Sep 30, 2020

I guess the most important questions right now would be:

  1. how to convert this arff in a useful manner?
  2. what would be a practical use case that cannot be easily tackled right now?
  3. do any consuming ML libraries support it?
  4. who would implement this? Supporting yet another library asks for generalizing the data format a bit more...

@PGijsbers
Copy link
Collaborator

For reference, the xarray to/from dataframe methods are documented here. My initial response would be that it provides little to no benefit without as long as our datasets are ultimately flat arff tables with no meta-data to convert them to tensors? Very interested to see some of Matthias' questions answered.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants