Add Sophia optimizer #852

PeanutButterRat · 2024-11-11T19:41:43Z

Is your feature request related to a problem? Please describe:
Sophia is a fairly new optimization algorithm for training language models and boasts some substantial improvements (2x speed up over Adam in wall-clock time in GPT-2 tests). Sophia seems like a nice addition to Fairseq2.

Describe the solution you would like:
It would be nice to offer Sophia as a finetuning recipe invoked with something like...

fairseq2 lm instruction_finetune --preset llama3_2_1b_instruct_sophiag

Describe the alternatives you have considered:
AdamW already exists as part of Fairseq2 as the default optimization algorithm.

Additional Context:
None

The text was updated successfully, but these errors were encountered:

PeanutButterRat added the enhancement New feature or request label Nov 11, 2024

PeanutButterRat linked a pull request Nov 11, 2024 that will close this issue

Add SophiaG optimizer #844

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Sophia optimizer #852

Add Sophia optimizer #852

PeanutButterRat commented Nov 11, 2024

Add Sophia optimizer #852

Add Sophia optimizer #852

Comments

PeanutButterRat commented Nov 11, 2024