Can I use the same model (Codestral) for both completion and chat? #3343

abceleung · 2024-10-30T09:46:54Z

abceleung
Oct 30, 2024

I see that Codestral is recommended for both chat and completion. Can Tabby use the same instance of Codestral for both of these tasks?

Also, can I use a quantized model (like this one)?

Answered by zwpaper

Oct 30, 2024

Hi @abceleung,

Thank you for trying Tabby!

Yes, you can use the same instance when you specify the same configuration. A simple example would be running Tabby with the same model designated for both completion and chat:

tabby serve --model Codestral-22B --chat-model Codestral-22B

By default, Tabby utilizes the Q8 quantized model. However, if you need additional quantized models, you can fork the repository at https://github.com/tabbyml/registry-tabby to create your own registry.

View full answer

zwpaper · 2024-10-30T16:52:16Z

zwpaper
Oct 30, 2024
Collaborator

Hi @abceleung,

Thank you for trying Tabby!

Yes, you can use the same instance when you specify the same configuration. A simple example would be running Tabby with the same model designated for both completion and chat:

tabby serve --model Codestral-22B --chat-model Codestral-22B

By default, Tabby utilizes the Q8 quantized model. However, if you need additional quantized models, you can fork the repository at https://github.com/tabbyml/registry-tabby to create your own registry.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can I use the same model (Codestral) for both completion and chat? #3343

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Can I use the same model (Codestral) for both completion and chat? #3343

abceleung Oct 30, 2024

Replies: 1 comment

zwpaper Oct 30, 2024 Collaborator

abceleung
Oct 30, 2024

zwpaper
Oct 30, 2024
Collaborator