You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Congratulations on the great work and thank you for your effort in open-sourcing the related artifacts!
Would you consider releasing the original user queries used to build the released MixEval test data (i.e., the 4K data points in MixEval and 1K data points for MixEval-hard)? I understand you mentioned in #36 that you are not open-sourcing the exact web query data or pipeline because it would be easier for others to hack the resampled benchmark versions, and that you have another on-going project. However, such risks should probably be low regarding the original queries for the already released test data. It would be very interesting to study these queries, and they would be a valuable resource to the researchers!
Thank you!
The text was updated successfully, but these errors were encountered:
Thank you for your kind words!
We're sorry that we didn't keep the separate query batches for each dynamic version (it was on the fly). The web query pool is not being open-sourced due to the mentioned issues in #36.
You may consider other real-world user datasets for your experiments, which is quite similar to our web queries (as shown in the figure 2 of the paper).
Sorry for the inconvenience
Dear Authors,
Congratulations on the great work and thank you for your effort in open-sourcing the related artifacts!
Would you consider releasing the original user queries used to build the released MixEval test data (i.e., the 4K data points in MixEval and 1K data points for MixEval-hard)? I understand you mentioned in #36 that you are not open-sourcing the exact web query data or pipeline because it would be easier for others to hack the resampled benchmark versions, and that you have another on-going project. However, such risks should probably be low regarding the original queries for the already released test data. It would be very interesting to study these queries, and they would be a valuable resource to the researchers!
Thank you!
The text was updated successfully, but these errors were encountered: