-
-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New layer architecture #159
Open
hweom
wants to merge
5
commits into
fff-rs:master
Choose a base branch
from
hweom:new_layers
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Commits on Mar 12, 2022
-
1. Static network graph is separated from invocation context. a) Static graph captures layers, connections between them and shapes of the units of data. b) Invocation context specifies the batch size and stores all data associated with an invocation (data, gradients). 2. Batch size is now explicit in the context instead of being implicitly extracted by layers from incoming data. 3. Separation into Layer and ILayer is now gone, everything is now handled in layer implementations (with "leaf" layers focusing on data manipulations while container layers focusing on network composition). This is still a very early prototype not intended for mergin: 1. Solver architecture not changed and just crudely hacked to support new network architecture. 2. Shared weights not supported. 3. Serialization not supported.
Mikhail Balakhno authored and Mikhail Balakhno committedMar 12, 2022 Configuration menu - View commit details
-
Copy full SHA for 76b9899 - Browse repository at this point
Copy the full SHA 76b9899View commit details
Commits on Apr 28, 2022
-
1. Static network graph is separated from invocation context. a) Static graph captures layers, connections between them and shapes of the units of data. b) Invocation context specifies the batch size and stores all data associated with an invocation (data, gradients). 2. Batch size is now explicit in the context instead of being implicitly extracted by layers from incoming data. 3. Separation into Layer and ILayer is now gone, everything is now handled in layer implementations (with "leaf" layers focusing on data manipulations while container layers focusing on network composition). 4. Solvers replaced by a more linear architecture of a top-level Trainer and different Optimizers (although only SGD with momentum is currently supported since both RMSprop and Adam require squaring backend support). This is still a very early prototype not intended for mergin: 1. Shared weights not supported. 2. Serialization not supported. 3. Not all layers are migrated.
Mikhail Balakhno authored and Mikhail Balakhno committedApr 28, 2022 Configuration menu - View commit details
-
Copy full SHA for cdc5d6f - Browse repository at this point
Copy the full SHA cdc5d6fView commit details
Commits on May 7, 2022
-
1. Static network graph is separated from invocation context. a) Static graph captures layers, connections between them and shapes of the units of data. b) Invocation context specifies the batch size and stores all data associated with an invocation (data, gradients). 2. Batch size is now explicit in the context instead of being implicitly extracted by layers from incoming data. 3. Separation into Layer and ILayer is now gone, everything is now handled in layer implementations (with "leaf" layers focusing on data manipulations while container layers focusing on network composition). 4. Solvers replaced by a more linear architecture of a top-level Trainer and different Optimizers (SGD with momentum and Adam are currently supported). This is still a very early prototype not intended for mergin: 1. Shared weights not supported. 2. Serialization not supported. 3. Not all layers are migrated.
Mikhail Balakhno authored and Mikhail Balakhno committedMay 7, 2022 Configuration menu - View commit details
-
Copy full SHA for 9344920 - Browse repository at this point
Copy the full SHA 9344920View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7aaa0df - Browse repository at this point
Copy the full SHA 7aaa0dfView commit details
Commits on Jul 2, 2022
-
Mikhail Balakhno authored and Mikhail Balakhno committed
Jul 2, 2022 Configuration menu - View commit details
-
Copy full SHA for eeb0bea - Browse repository at this point
Copy the full SHA eeb0beaView commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.