Support Granite Code 3B/8B #1262

gabe-l-hart · 2024-10-03T16:18:08Z

🚀 The feature, motivation and pitch

The torchchat framework provides an excellent platform for embedding models into many different edge-centric platforms.

The Granite Code models, specifically the 3B-128k and 8B-128k variants, are a family of models from IBM that support a wide variety of code-related tasks. The models are released under the Apache-3 license and are therefore well-suited to embedded use-cases where code intelligence is needed.

The request here is to extend the model support in torchchat to support running the 3B and 8B long-context variants of Granite Code in order to enable usage of these models across embedded use-cases.

Alternatives

Depending on the goals of the torchchat framework, extending support to non-llama models may or may not be a project goal. There are other embedded frameworks out there (notably llama.cpp and the many projects that wrap it), so these can be used to run Granite Code in embedded environments. Our goal at IBM is to provide users with as many choices as possible on how to run all of our Granite family models, so our hope is that torchchat can be a strong piece of this story!

Additional context

The 3B and 8B models use the llama architecture in transformers, so they are close to fully supported as-is. There are a few crucial pieces that are present in the transformers implementation that are missing in torchchat:

Safetensors support: Support Huggingface models from safetensors #1249
Tied word embeddings: Add support for tied word embeddings #1252
Bias tensors: Add support for separate bias tensors #1250
Non-tiktoken/sentencepiece tokenizers: Add support for tokenizers tokenizers #1251

RFC (Optional)

I've worked through the initial steps of solving all of these outstanding issues (see the corresponding issues). Once these are solved, the addition of these Granite Code models should consist of the following steps:

Adding new entries to models.json
Adding the right set of model-specific params to model_params

The text was updated successfully, but these errors were encountered:

gabe-l-hart mentioned this issue Oct 3, 2024

Safetensors #1255

Merged

gabe-l-hart linked a pull request Oct 31, 2024 that will close this issue

Granite code support #1336

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Granite Code 3B/8B #1262

Support Granite Code 3B/8B #1262

gabe-l-hart commented Oct 3, 2024

Support Granite Code 3B/8B #1262

Support Granite Code 3B/8B #1262

Comments

gabe-l-hart commented Oct 3, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)