Torch 2.0 #169

varshith15 · 2023-04-11T11:07:08Z

Is there a way to leverage torch2.0's compile using tensorrt as a backend directly? without all the current tedious process? https://pytorch.org/docs/stable/dynamo/get-started.html

And any thoughts on torch 2.0 in general? Has anyone tried it out?
I've tried it out for a few of the transformer models, there doesn't seem to be any improvement.
@pommedeterresautee @ayoub-louati

pommedeterresautee · 2023-04-14T12:08:52Z

Yes tensorRT is supported out of the box. However, it adds its own overhead and is not always best choice in my tests. Kernl runs on top of PyTorch 2.0. The 2.0 targets mostly for now training (and not inference).

varshith15 · 2023-04-20T18:50:56Z

@pommedeterresautee any thoughts on Apache tvm?

pommedeterresautee · 2023-04-26T15:03:08Z

TVM was best for non GPU stuff. Recently they started to support better GPU through cutlass + adding possibility to program at block of threads level (CTAs), but IMO Triton is a better choice for now when Nvidia hw is your target

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Torch 2.0 #169

Torch 2.0 #169

varshith15 commented Apr 11, 2023 •

edited

Loading

pommedeterresautee commented Apr 14, 2023

varshith15 commented Apr 20, 2023

pommedeterresautee commented Apr 26, 2023 •

edited

Loading

Torch 2.0 #169

Torch 2.0 #169

Comments

varshith15 commented Apr 11, 2023 • edited Loading

pommedeterresautee commented Apr 14, 2023

varshith15 commented Apr 20, 2023

pommedeterresautee commented Apr 26, 2023 • edited Loading

varshith15 commented Apr 11, 2023 •

edited

Loading

pommedeterresautee commented Apr 26, 2023 •

edited

Loading