We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Qwen2.5
Qwen2.5-72B-Instruct,Qwen2.5-32B-Instruct
vllm
您好,请问一下为什么Qwen2.5官方发布的榜单中72B和32B的LiveCodeBench分数跟LiveCodeBench作者公开72B和32B的分数差异这么大?
The text was updated successfully, but these errors were encountered:
还有个疑问就是:LiveCodeBench基准中有代码生成、自我修复、代码执行、测试输出预测 4个场景,请问Qwen2.5官方发布的榜单中的LiveCodeBench分数使用的是哪个场景的分数?
Sorry, something went wrong.
你需要拖动具体日期,与我们表格中的日期对应。
选择2023-5~2024-9,Qwen2.5-72B-Instruct分数时50,和Qwen2.5官网发布的55.5差距还是很大;
huybery
No branches or pull requests
Model Series
Qwen2.5
What are the models used?
Qwen2.5-72B-Instruct,Qwen2.5-32B-Instruct
What is the scenario where the problem happened?
vllm
Is this a known issue?
Information about environment
您好,请问一下为什么Qwen2.5官方发布的榜单中72B和32B的LiveCodeBench分数跟LiveCodeBench作者公开72B和32B的分数差异这么大?
Log output
Description
您好,请问一下为什么Qwen2.5官方发布的榜单中72B和32B的LiveCodeBench分数跟LiveCodeBench作者公开72B和32B的分数差异这么大?
The text was updated successfully, but these errors were encountered: