-
Notifications
You must be signed in to change notification settings - Fork 991
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
oscillations in the loss #822
Comments
Did you shuffle the dataset?
…On Mon, Feb 19, 2024, 7:16 PM nicolas-dufour ***@***.***> wrote:
Hi!
I see in the loss for cc3m model that there are oscillations in the loss.
When using the same webdataset framework I also have oscillations as well.
In my case it's more annoying because the oscillation amplitude is greater
than the decrease of the loss per epoch.
Do you know what can be the reason of such behaviour?
From my experiments, it seems to be linked to the webdataset since a
traditional dataloader don't suffer from such issues. In my case the period
of the oscillation is of the number of steps per epoch (on cc12m).
Thanks for the help!
—
Reply to this email directly, view it on GitHub
<#822>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAR437QQHCZX5DAIV62Z3ALYUOJHVAVCNFSM6AAAAABDP4MM22VHI2DSMVQWIX3LMV43ASLTON2WKOZSGE2DEOJSHA4TOMA>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Hey @rom1504
Thanks for the help! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi!
I see in the loss for cc3m model that there are oscillations in the loss.
When using the same webdataset framework I also have oscillations as well. In my case it's more annoying because the oscillation amplitude is greater than the decrease of the loss per epoch.
Do you know what can be the reason of such behaviour?
From my experiments, it seems to be linked to the webdataset since a traditional dataloader don't suffer from such issues. In my case the period of the oscillation is of the number of steps per epoch (on cc12m).
Thanks for the help!
The text was updated successfully, but these errors were encountered: