Releases: EnzymeAD/Reactant.jl
Releases · EnzymeAD/Reactant.jl
v0.2.7
v0.2.6
Reactant v0.2.6
Merged pull requests:
- feat: functionalities for supporting NeuralOperators.jl (#217) (@avik-pal)
- CompatHelper: add new compat entry for AbstractFFTs in [weakdeps] at version 1, (keep existing compat) (#229) (@github-actions[bot])
- feat: overload
mul!
(#230) (@avik-pal) - feat: extend
mul!
work for subarrays (#231) (@avik-pal) - fix abs of complex number (#239) (@Pangoraw)
- feat: support setindex with views (#240) (@avik-pal)
- fix: bypass segfault with fill complex (#245) (@avik-pal)
- feat: add support for
clamp
andclamp!
(#247) (@avik-pal) - fix: reduction of complex arrays (#248) (@avik-pal)
- feat: add support for NNlib make causal mask (#249) (@avik-pal)
- feat: partial NNlib.gather support + better indexing support (#252) (@avik-pal)
- Add 32bit CI (#254) (@wsmoses)
- implement
@trace
for (#255) (@Pangoraw) - Format code of branch "main" (#257) (@github-actions[bot])
- feat: more coverage for NNlib functions (#258) (@avik-pal)
- feat: inherit scalar indexing functionality from GPUArraysCore (#268) (@avik-pal)
- Disable verifier in main pass manager pipeline (#269) (@Pangoraw)
- chore: bump ReactantCore version (#270) (@avik-pal)
- fix: add GPU backend as default if available (#272) (@avik-pal)
- Use gensym-ed variable in
@compile
and@jit
(#274) (@jumerckx)
Closed issues:
YaoBlocks.cunmat
has problems during tracing (#174)- Lux example MLP with TrainState API throws JIT session errors (#220)
- Use
GPUArraysCore
for scalar indexing flags (#232) - Reactant fails to compile a function accepting complex parameters (#235)
- Name clash when calling
Reactant.@compile
over a function that is namedf
(#237) getindex
assumes static indexing (#243)mlirDenseElementsAttrSplatGet
with ComplexNumbers lead to a SegFault (#244)- conv! (#251)
- 32-Bit CI (#253)
v0.2.5
Reactant v0.2.5
Merged pull requests:
v0.2.4
Reactant v0.2.4
Merged pull requests:
- Move Julia bindings to MLIR dialects out of JLL (#166) (@mofeing)
- Implement
conj
,conj!
forTracedRArray
(#169) (@mofeing) - feat: allow passing a optimize option to MLIR (#171) (@avik-pal)
- Format code of branch "main" (#175) (@github-actions[bot])
- Regenerate MLIR Dialects (#177) (@github-actions[bot])
- Update Project.toml (#179) (@wsmoses)
- Fix
similar
onTracedRArray
(#184) (@mofeing) - feat: support conditionals (#185) (@avik-pal)
- Improve GitHub Actions settings (#186) (@giordano)
- build(deps): bump codecov/codecov-action from 1 to 4 (#187) (@dependabot[bot])
- build(deps): bump peter-evans/create-pull-request from 6 to 7 (#188) (@dependabot[bot])
- Add needs primal for forward mode (#192) (@wsmoses)
- fix primitive_type for complex (#193) (@Pangoraw)
- Add jit macro for simplified testing (#194) (@wsmoses)
- complex DenseElementsAttribute (#195) (@Pangoraw)
- Make libMLIR_h automatically regenerate (#197) (@Pangoraw)
- prevent docs warning (#199) (@Pangoraw)
- update libMLIR_h bindings and fix bindings update step n°2 (#202) (@Pangoraw)
- Refactor regenerate-mlir-bindings.yml workflow (#204) (@mofeing)
- feat: support tracing scalars (#205) (@avik-pal)
- Add C-API for constructing Complex Attributes (#208) (@mofeing)
- feat: allow type-casting numbers to tracednumbers (#209) (@avik-pal)
- feat: specialize dispatches for faster concrete array generation (#213) (@avik-pal)
- feat: add
sync
to synchonize all result buffers (#216) (@avik-pal) - fix complex attr c-api (#218) (@mofeing)
- fix resource exhaustion bug #190 (#219) (@mofeing)
- Refactor
mlirComplexAttrDoubleGet
,mlirComplexAttrDoubleGetChecked
(#221) (@mofeing) - feat: allow conversion to numbers (#222) (@avik-pal)
- Regenerate MLIR Bindings (#223) (@github-actions[bot])
- refactor: use
@jit
in the tests (#224) (@avik-pal) - feat: make compile and code_hlo macros work with broadcasting (#225) (@avik-pal)
Closed issues:
- Shortcuts for
to_rarray
(#138) - Infinte recursion when compiling a function (#168)
ConcreteRArray
onComplex
array zeroes the buffer (#170)XlaRuntimeError
when passing aComplex
buffer to a compiled function (#172)- Forward-mode differentiation is broken (#183)
- Resource exhaustion when compiling many functions (#190)
- Error on regenerating "libMLIR_h.jl" on GitHub runner (#201)
v0.2.3
Reactant v0.2.3
Merged pull requests:
- Export
GetDefaultTargetTriple
symbol (#160) (@mofeing) - feat: implement a separate
TracedRNumber
(#161) (@avik-pal) - Generalize
Base._cat
to non-Val
, typedBase._cat_t
and implementtyped_hcat
,typed_vcat
,typed_hvcat
,typed_hvncat
(#163) (@mofeing) - Bypass identical construction on
TracedRArray
(#165) (@mofeing) - test: separate out test groups (#167) (@avik-pal)
Closed issues:
v0.2.2
Reactant v0.2.2
Merged pull requests:
- feat: compiling vision transformers (#130) (@avik-pal)
- feat: robust handling of wrapped arrays of reactant arrays (#133) (@avik-pal)
- Replace
getfield
fortraced_getfield
inset!
(#134) (@mofeing) - perf: benchmarking CI (#136) (@avik-pal)
- Add fallback gpu setindex (#137) (@wsmoses)
- Bump dependencies (#143) (@wsmoses)
- Build (#145) (@wsmoses)
- feat: generalize indexing to all wrappers (#146) (@avik-pal)
- Arm64 (#147) (@wsmoses)
- Format code of branch "main" (#153) (@github-actions[bot])
- fix: broadcasted type casting (#156) (@avik-pal)
- Fix vararg (#158) (@wsmoses)
Closed issues:
- Handling exponents (#27)
- Incosistent semantics of element-wise application of array functions compared to Julia (#51)
- NNlib activations cannot be compiled without manual dispatches (#54)
- Linear algebra on
TracedRArray
(#131) - scalar broadcasting not functional (#135)
libdevice
not found (#149)- Vararg compilation broken (#151)
- Can I compile in-place functions? (#152)
Optimisers.jl
fails to compile (#154)- Type-casting doesn't compile (#155)
- Unbreak Reactant benchmark CI (#157)
v0.2.1
Reactant v0.2.1
Merged pull requests:
- Format code of branch "main" (#101) (@github-actions[bot])
- Compile NNlib.maxpool and NNlib.meanpool (#102) (@Pangoraw)
- feat: handle cat/hcat/vcat (#103) (@avik-pal)
- feat: implementing view/getindex/setindex (#104) (@avik-pal)
- perf: test compile times for neural networks -- VGG16 (#105) (@avik-pal)
- fix: local JLL build script (#108) (@avik-pal)
- NNlib: forward feature group counts to convolution (#109) (@Pangoraw)
- tests: more comprehensive NN testing (#110) (@avik-pal)
- ci(github-actions): update the formatter action (#111) (@avik-pal)
- Format code of branch "main" (#112) (@github-actions[bot])
- Format code of branch "main" (#113) (@github-actions[bot])
- feat: allow building for CPU target on CUDA-enabled machines (#114) (@avik-pal)
- NNlib: handle kernel flip with reverse instead of window_reversal (#115) (@Pangoraw)
- Format code of branch "main" (#116) (@github-actions[bot])
- Configure CompatHelper workflow (#118) (@mofeing)
- chore: testing Enzyme 0.13 (#119) (@avik-pal)
- Fix non-used argument (#123) (@wsmoses)
- Fix typo in
mlirIsNull
call (#124) (@mofeing) - Format code of branch "main" (#125) (@github-actions[bot])
- Refactor
compile
and test buffer donation (#126) (@mofeing) - Replace
IdDict
forOrderedIdDict
(#128) (@mofeing) - CompatHelper: add new compat entry for OrderedCollections at version 1, (keep existing compat) (#129) (@github-actions[bot])
Closed issues:
v0.2.0
Reactant v0.2.0
Merged pull requests:
- Move shape information of
RArray
to field (#58) (@mofeing) - Refactor
traced_type
to allow customization for external types (#59) (@mofeing) - Despecialize some methods to lower excessive codegen (#61) (@mofeing)
- Add benchmarking infrastructure (#65) (@mofeing)
- Tpu (#66) (@wsmoses)
- Refactor compilation (#68) (@mofeing)
- Format code of branch "main" (#69) (@github-actions[bot])
- Format code of branch "main" (#70) (@github-actions[bot])
- ci(buildkite): fix scripts (#71) (@avik-pal)
- Fix
permutedims
onTracedRArray
(#72) (@mofeing) - Fix broadcasting on
ConcreteRArray
(#73) (@mofeing) - Small fixes on abstract interpreter (#74) (@mofeing)
- Try hermetic cuda (#75) (@wsmoses)
- Implement
create_result
method forAbstractDict
(#76) (@mofeing) - Implement
only
forTracedRArray
(#77) (@mofeing) - Reuse code cache in abstract interpretation to minimize
compile
time (#78) (@mofeing) - TPU index setup (#81) (@wsmoses)
- Format code of branch "main" (#82) (@github-actions[bot])
- Export
LoadPjrtPlugin
,InitializePjrtPlugin
,GetCApiClient
functions (#84) (@mofeing) - Compile NNlib convolution (#87) (@Pangoraw)
- Format code of branch "main" (#89) (@github-actions[bot])
- Cleanup api (#90) (@wsmoses)
- Imrpove FFI (#91) (@wsmoses)
- Format code of branch "main" (#92) (@github-actions[bot])
- Fix libtpu auto download (#93) (@wsmoses)
- CUDA build local (#94) (@wsmoses)
- Format code of branch "main" (#95) (@github-actions[bot])
- Add back support for buffer donation (#96) (@mofeing)
Closed issues: