You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
it seemed we use the average Hessian for all of params, I want to know how to group my params like take one output channel as one block and compute the average as their diagonal estimation values.
The text was updated successfully, but these errors were encountered:
it seemed we use the average Hessian for all of params, I want to know how to group my params like take one output channel as one block and compute the average as their diagonal estimation values.
The text was updated successfully, but these errors were encountered: