Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark using TPC-H, TPC-DS #718

Open
philippemnoel opened this issue Jan 10, 2024 · 3 comments
Open

Benchmark using TPC-H, TPC-DS #718

philippemnoel opened this issue Jan 10, 2024 · 3 comments
Labels
ci/tests Issue related to our CI and/or testing frameworks good first issue Good for newcomers priority-medium Medium priority issue

Comments

@philippemnoel
Copy link
Collaborator

What
Alongside ClickBench, the most popular OLAP benchmarking framework is probably TPC-H. It's a much older benchmark, and is trusted by the older enterprises.

DataFusion already has native support for TPC-H, so I think we could add support for it really rapidly. While I think it's fine to ship without this, adding it eventually will probably really help larger enterprises think highly of ParadeDB and be confident about the maturity of our product.

There is a benchmarking framework/tool for it called HammerDB, perhaps it is useful?

Why
^

How
^

@philippemnoel philippemnoel transferred this issue from another repository Jan 24, 2024
@philippemnoel philippemnoel added ci/tests Issue related to our CI and/or testing frameworks priority-medium Medium priority issue labels Feb 14, 2024
@philippemnoel philippemnoel self-assigned this Feb 14, 2024
@philippemnoel
Copy link
Collaborator Author

I have started doing this, I just need to finish generating the dataset

@philippemnoel
Copy link
Collaborator Author

For these two TODOs, we'll implement them in a separate PR

  • Test with various combinations of heap vs parquet tables -- EDIT: Configured, but there's a bug in datafusion-federation, which we need to fix
  • Test on the 100GB generator on a c6a.4xlarge instance

@philippemnoel philippemnoel changed the title Benchmark using TPC-H Benchmark using TPC-H, TPC-DS Apr 30, 2024
@philippemnoel
Copy link
Collaborator Author

@philippemnoel philippemnoel removed their assignment Jun 26, 2024
@philippemnoel philippemnoel added the good first issue Good for newcomers label Oct 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci/tests Issue related to our CI and/or testing frameworks good first issue Good for newcomers priority-medium Medium priority issue
Projects
None yet
Development

No branches or pull requests

2 participants