Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluations without human blind evaluation #78

Open
ADiko1997 opened this issue Sep 3, 2024 · 3 comments
Open

Evaluations without human blind evaluation #78

ADiko1997 opened this issue Sep 3, 2024 · 3 comments

Comments

@ADiko1997
Copy link

Hi, have you evaluated the model using only GPT3.5/Claude without HBR? This is important for the research community to compare against your work.

@Espere-1119-Song
Copy link
Collaborator

Sure, you can refer to Table K10 and Table K11 in the Appendix of our paper

@ADiko1997
Copy link
Author

ADiko1997 commented Sep 3, 2024 via email

@lan-lw
Copy link

lan-lw commented Nov 11, 2024

Hi can you provide the moviechat+ model using GPT3.5/Claude without HBR ? I didn't find the results in moviechat+ paper.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants