Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Hotfix] Qwen 실행 및 성능에 대한 버그 해결 #38

Merged
merged 2 commits into from
Nov 22, 2024

Conversation

jagaldol
Copy link
Contributor

📝 Summary

Qwen에서 성능이 안좋게 나오며 버그가 있던 부분을 수정하였습니다.

validation dataset에 이미 정답이 포함되어 있는데 추론을 시켰음
- qwen의 경우 처음부터 model과 tokenizer의 길이가 달라서 오류를 발생시킴
- 불필요한 로직이자, 버그 유발로 보여 삭제
@jagaldol jagaldol added Priority: High 우선적으로 처리해야 할 중요한 작업 Type: Bug 오류나 버그 등 문제 labels Nov 22, 2024
@jagaldol jagaldol self-assigned this Nov 22, 2024
@jagaldol jagaldol merged commit 2cfaf95 into main Nov 22, 2024
3 checks passed
@jagaldol jagaldol deleted the hotfix-qwen-bug-fix branch November 22, 2024 08:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Priority: High 우선적으로 처리해야 할 중요한 작업 Type: Bug 오류나 버그 등 문제
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants