How to perform tests with other datasets that are not listed? #4

PoliteApps · 2022-08-25T20:38:39Z

I looked into the repository and did not find an easy way to use our code in my own datasets. Something I would expect is an interface similar to the regression functions on scikit learn. This would be amazing for other researchers and students to use this new network. Is there any script, change or material available to help use this repository this way?

E.g. hopular.fit(x,y)

Thanks!

YuvalRom · 2022-09-11T08:59:35Z

The sklearn API is also valuable for hyperopt and other research. For example, if I want to run the model on the "Why do tree-based models still outperform deep learning on tabular data?" (Léo Grinsztajn 2022) dataset and code for benchmarking the model, I will need to implement it myself.

akash-isu · 2022-11-09T16:52:19Z

Were you able to figure out how to use this on custom datasets?

PoliteApps · 2022-11-10T21:58:05Z

I just gave up after reading that: https://medium.com/@tunguz/trouble-with-hopular-6649f22fa2d3

YuvalRom · 2022-11-11T08:41:17Z

I was able to run it on a custom dataset.
You need to create a class that fits the datasets classes API here (https://github.com/ml-jku/hopular/blob/main/hopular/auxiliary/data.py) then you can just run it with the instruction on the README file.

I ran some tests comparing the model to xgboost and catboost on the same datasets as the paper and optimized as they said they did but reached much better results than reported.

Also ran the model on a bigger dataset (58K samples) and I was amazed at how much memory it uses.
I believe this is a very interesting Idea but still not a usable model.

still a great paper and repo though

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to perform tests with other datasets that are not listed? #4

How to perform tests with other datasets that are not listed? #4

PoliteApps commented Aug 25, 2022

YuvalRom commented Sep 11, 2022

akash-isu commented Nov 9, 2022

PoliteApps commented Nov 10, 2022

YuvalRom commented Nov 11, 2022

How to perform tests with other datasets that are not listed? #4

How to perform tests with other datasets that are not listed? #4

Comments

PoliteApps commented Aug 25, 2022

YuvalRom commented Sep 11, 2022

akash-isu commented Nov 9, 2022

PoliteApps commented Nov 10, 2022

YuvalRom commented Nov 11, 2022