You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I try to train the model Gemma-2B 32K seq len with 2K segment size on a single A6000Ada 48G
But even if I adjust the parameters in train.gemma.infini.noclm.sh like the following, it still shows that the GPU memory is exceeded.
Is this normal?
The text was updated successfully, but these errors were encountered:
Ozawa333
changed the title
What is the min GPU memory required to train the model?
What is the min GPU memory required to fine-tune the model?
May 10, 2024
First of all, thank you very much for your work.
I try to train the model
Gemma-2B 32K seq len with 2K segment size
on a single A6000Ada 48GBut even if I adjust the parameters in
train.gemma.infini.noclm.sh
like the following, it still shows that the GPU memory is exceeded.Is this normal?
The text was updated successfully, but these errors were encountered: