Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data sampling of InternVL2.5-MPO #923

Open
manglu097 opened this issue Feb 24, 2025 · 2 comments
Open

Data sampling of InternVL2.5-MPO #923

manglu097 opened this issue Feb 24, 2025 · 2 comments

Comments

@manglu097
Copy link

“Regarding the phrase ‘For instructions with clear ground truths’ mentioned in Section 3.1 of the article, I would like to know how the author evaluates whether the generated responses match the ground truth?” Thanks

Image

@yuecao0119
Copy link
Collaborator

Hello,

As stated in the original text, we force the model to output the final answer in the form of 'Final Answer: xxx' at the end. So for the final answer, we can match it with ground truth rules to determine if the answer is correct.

@manglu097
Copy link
Author

“Thank you for your response! Do you mean exact matching? However, the model’s output ‘final answer’ might have the same meaning but in different forms, such as 0.3 and 3/10, or ‘yes’ and ‘yep’. In cases like this, how do you handle the matching? Is there a script in the repository for this? (I couldn’t find it.)”

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants