Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Granite Code 3B/8B #1262

Open
gabe-l-hart opened this issue Oct 3, 2024 · 0 comments · May be fixed by #1336
Open

Support Granite Code 3B/8B #1262

gabe-l-hart opened this issue Oct 3, 2024 · 0 comments · May be fixed by #1336

Comments

@gabe-l-hart
Copy link
Contributor

🚀 The feature, motivation and pitch

The torchchat framework provides an excellent platform for embedding models into many different edge-centric platforms.

The Granite Code models, specifically the 3B-128k and 8B-128k variants, are a family of models from IBM that support a wide variety of code-related tasks. The models are released under the Apache-3 license and are therefore well-suited to embedded use-cases where code intelligence is needed.

The request here is to extend the model support in torchchat to support running the 3B and 8B long-context variants of Granite Code in order to enable usage of these models across embedded use-cases.

Alternatives

Depending on the goals of the torchchat framework, extending support to non-llama models may or may not be a project goal. There are other embedded frameworks out there (notably llama.cpp and the many projects that wrap it), so these can be used to run Granite Code in embedded environments. Our goal at IBM is to provide users with as many choices as possible on how to run all of our Granite family models, so our hope is that torchchat can be a strong piece of this story!

Additional context

The 3B and 8B models use the llama architecture in transformers, so they are close to fully supported as-is. There are a few crucial pieces that are present in the transformers implementation that are missing in torchchat:

RFC (Optional)

I've worked through the initial steps of solving all of these outstanding issues (see the corresponding issues). Once these are solved, the addition of these Granite Code models should consist of the following steps:

@gabe-l-hart gabe-l-hart linked a pull request Oct 31, 2024 that will close this issue
6 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant