Feature Request: Add OLMoE #9317

natolambert · 2024-09-04T22:33:20Z

Prerequisites

I am running the latest code. Mention the version if possible as well.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Add this model (and other variants) https://huggingface.co/allenai/OLMoE-1B-7B-0924-Instruct

Motivation

We recently released the OLMoE model at Ai2. 1.3b active / 6.9b total param MoE model. Seems solid, and we'd love people to use it.

Possible Implementation

Should be able to quickly use mix of existing OLMo implementation + Transformers version https://github.com/huggingface/transformers/blob/main/src/transformers/models/olmoe/modeling_olmoe.py

rkinas · 2024-09-05T08:46:55Z

Voted! It is ideal for mobile solution. Quantized will be even better :)

natolambert · 2024-09-05T17:40:05Z

I may try this in my free time, but I'm kind of chaotic these days with research + writing so not too optimistic on timeline. I'll also try and get the OLMoE lead on it after VLLM.

Meshwa428 · 2024-09-10T14:20:23Z

Any updates

foldl · 2024-09-13T06:50:10Z

#9462 is doing this.

And, I have also implemented this in ChatLLM.cpp.

dirkgr · 2024-10-16T21:28:47Z

Looks like this is done?

felladrin · 2024-10-16T21:39:23Z

Looks like this is done?

Indeed. Should have been completed by this PR:

Implement OLMoE architecture #9462

GGUFs are already available on HF

https://huggingface.co/models?search=olmoe%20gguf

mseri · 2024-11-27T18:22:52Z

Does this mean that it will be easier to implement compatibility to https://allenai.org/blog/olmo2 ?

mseri · 2024-11-27T18:25:36Z

Already there: #10500 :)

natolambert added the enhancement New feature or request label Sep 4, 2024

github-actions bot added the stale label Oct 14, 2024

github-actions bot removed the stale label Oct 17, 2024

github-actions bot added the stale label Nov 16, 2024

github-actions bot removed the stale label Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Add OLMoE #9317

Feature Request: Add OLMoE #9317

natolambert commented Sep 4, 2024

rkinas commented Sep 5, 2024

natolambert commented Sep 5, 2024

Meshwa428 commented Sep 10, 2024

foldl commented Sep 13, 2024

dirkgr commented Oct 16, 2024

felladrin commented Oct 16, 2024

mseri commented Nov 27, 2024

mseri commented Nov 27, 2024

Feature Request: Add OLMoE #9317

Feature Request: Add OLMoE #9317

Comments

natolambert commented Sep 4, 2024

Prerequisites

Feature Description

Motivation

Possible Implementation

rkinas commented Sep 5, 2024

natolambert commented Sep 5, 2024

Meshwa428 commented Sep 10, 2024

foldl commented Sep 13, 2024

dirkgr commented Oct 16, 2024

felladrin commented Oct 16, 2024

mseri commented Nov 27, 2024

mseri commented Nov 27, 2024