Change TransformState to NamedTuple #106

SamDuffield · 2024-07-12T12:03:51Z

This is quite a large PR unfortunately but fixes #83 and also cleans up the TransformState code by moving from mutable dataclass to immutable NamedTuple which is also better encapsulation practice for functional code.

One thing that's nice is that NamedTuple is already added to the pytree registry for optree and torch (fixing #83), so we don't need to do that manually as before.

There was an issue with the aux handling as modifying aux is not possible to do in-place as we don't know the structure of aux before log_posterior is called, also aux is not guaranteed to be a TensorTree and could contain strings etc.

The proposed fix is to return a new state with all other attributes modified in-place (i.e. pointers to old state) but aux replaced.

KaelanDt

Looks good to me, but in my opinion it would be good to check how this affects memory consumption and allocation - since you allocate a new state whenever you update it, which may be quite inefficient

posteriors/types.py

posteriors/laplace/dense_fisher.py

SamDuffield · 2024-07-23T10:51:58Z

Looks good to me, but in my opinion it would be good to check how this affects memory consumption and allocation - since you allocate a new state whenever you update it, which may be quite inefficient

This PR shouldn't affect memory consumption, it just changes the handling of the algorithm states to a better convention.

It would be good to have some numerics on memory consumption and the pros and cons of using inplace and even whether we should continue to support it.

There is also an element of horses-for-courses since for MCMC-style where you want to collect samples along a trajectory you need inplace=False whereas for optimization or deep ensemble-style where you only care about the final result you may prefer inplace=True if its faster (which I'm not sure on).

KaelanDt · 2024-07-24T09:38:55Z

Looks good to me, but in my opinion it would be good to check how this affects memory consumption and allocation - since you allocate a new state whenever you update it, which may be quite inefficient

This PR shouldn't affect memory consumption, it just changes the handling of the algorithm states to a better convention.

It would be good to have some numerics on memory consumption and the pros and cons of using inplace and even whether we should continue to support it.

There is also an element of horses-for-courses since for MCMC-style where you want to collect samples along a trajectory you need inplace=False whereas for optimization or deep ensemble-style where you only care about the final result you may prefer inplace=True if its faster (which I'm not sure on).

I'm not sure about this: the previous inplace behaviour would change the elements of a previously allocated object. Now, you re-allocate a new object every time

KaelanDt

Looks good to me apart from a docstring to change

SamDuffield · 2024-07-24T10:26:20Z

Looks good to me, but in my opinion it would be good to check how this affects memory consumption and allocation - since you allocate a new state whenever you update it, which may be quite inefficient

This PR shouldn't affect memory consumption, it just changes the handling of the algorithm states to a better convention.
It would be good to have some numerics on memory consumption and the pros and cons of using inplace and even whether we should continue to support it.
There is also an element of horses-for-courses since for MCMC-style where you want to collect samples along a trajectory you need inplace=False whereas for optimization or deep ensemble-style where you only care about the final result you may prefer inplace=True if its faster (which I'm not sure on).

I'm not sure about this: the previous inplace behaviour would change the elements of a previously allocated object. Now, you re-allocate a new object every time

We define a new NamedTuple but if inplace=True all Tensors are pointers to the same memory as the previous ones (aside from aux which is not guaranteed to be a TensorTree). This is also checked in the tests.

Change TransformState to NamedTuple

9545811

SamDuffield requested a review from KaelanDt July 12, 2024 12:04

KaelanDt reviewed Jul 22, 2024

View reviewed changes

posteriors/types.py Outdated Show resolved Hide resolved

posteriors/laplace/dense_fisher.py Show resolved Hide resolved

SamDuffield added 2 commits July 23, 2024 12:16

Change class type docstring from Args to Attributes

338879a

Update inplace gotcha

5275003

SamDuffield requested a review from KaelanDt July 23, 2024 11:26

KaelanDt requested changes Jul 24, 2024

View reviewed changes

Update docstring

772c209

KaelanDt approved these changes Jul 24, 2024

View reviewed changes

SamDuffield merged commit e08e729 into main Jul 24, 2024
2 checks passed

SamDuffield deleted the named-tuple-transform-state branch July 24, 2024 10:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change TransformState to NamedTuple #106

Change TransformState to NamedTuple #106

SamDuffield commented Jul 12, 2024

KaelanDt left a comment

SamDuffield commented Jul 23, 2024

KaelanDt commented Jul 24, 2024

KaelanDt left a comment

SamDuffield commented Jul 24, 2024 •

edited

Loading

Change TransformState to NamedTuple #106

Change TransformState to NamedTuple #106

Conversation

SamDuffield commented Jul 12, 2024

KaelanDt left a comment

Choose a reason for hiding this comment

SamDuffield commented Jul 23, 2024

KaelanDt commented Jul 24, 2024

KaelanDt left a comment

Choose a reason for hiding this comment

SamDuffield commented Jul 24, 2024 • edited Loading

SamDuffield commented Jul 24, 2024 •

edited

Loading