Use int4 version of Qwen-VL? #6

kexul · 2024-04-11T04:18:40Z

Hi, since int4 version of Qwen-vl is avaialble and more friendly for low end gpu, is it a plug and play model for clot?

kexul · 2024-04-11T06:18:28Z

Result with int4 version of QwenVL:

Not as funny as your example.

zhongshsh · 2024-04-12T08:28:26Z

Our model is based on Qwen/Qwen-VL-Chat, and using the int4 version of Qwen-VL directly might compromise the quality of responses. Humor is subjective. You can try generating a few more times to find responses that match your sense of humor.

dedekinds added the good first issue Good for newcomers label Apr 12, 2024

zhongshsh pinned this issue Apr 12, 2024

dedekinds unpinned this issue Apr 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use int4 version of Qwen-VL? #6

Use int4 version of Qwen-VL? #6

kexul commented Apr 11, 2024 •

edited

Loading

kexul commented Apr 11, 2024

zhongshsh commented Apr 12, 2024

Use int4 version of Qwen-VL? #6

Use int4 version of Qwen-VL? #6

Comments

kexul commented Apr 11, 2024 • edited Loading

kexul commented Apr 11, 2024

zhongshsh commented Apr 12, 2024

kexul commented Apr 11, 2024 •

edited

Loading