Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use int4 version of Qwen-VL? #6

Open
kexul opened this issue Apr 11, 2024 · 2 comments
Open

Use int4 version of Qwen-VL? #6

kexul opened this issue Apr 11, 2024 · 2 comments
Labels
good first issue Good for newcomers

Comments

@kexul
Copy link

kexul commented Apr 11, 2024

Hi, since int4 version of Qwen-vl is avaialble and more friendly for low end gpu, is it a plug and play model for clot?

@kexul
Copy link
Author

kexul commented Apr 11, 2024

Result with int4 version of QwenVL:
图片
Not as funny as your example.

@zhongshsh
Copy link
Collaborator

Our model is based on Qwen/Qwen-VL-Chat, and using the int4 version of Qwen-VL directly might compromise the quality of responses. Humor is subjective. You can try generating a few more times to find responses that match your sense of humor.

@dedekinds dedekinds added the good first issue Good for newcomers label Apr 12, 2024
@zhongshsh zhongshsh pinned this issue Apr 12, 2024
@dedekinds dedekinds unpinned this issue Apr 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

3 participants