Support Pytorch quantization toolkit based QAT sample using FX backend of Torch-TRT #1672

peri044 · 2023-02-14T23:12:38Z

peri044
Feb 14, 2023
Collaborator

Pytorch quantization toolkit support in FX backend

Goal(s)

Support Pytorch quantization toolkit based QAT sample using FX backend of Torch-TRT

Usecases

Proposed APIs / UX

The usage is similar to the sample here https://github.com/pytorch/TensorRT/blob/main/tests/py/qat/test_qat_trt_accuracy.py

Limitations

No known limitations at this time

Internal Implementation

Design

We need to write converters for quantize and dequantize ops in FX. Converters in TS : https://github.com/pytorch/TensorRT/blob/main/core/conversion/converters/impl/quantization.cpp

Extensions Required to Core API implementations

N/A

Data Structures

N/A

Details specific for TorchScript Support

N/A

Details specific for FX support

Write new converters for quantization ops in FX.

Implementation Phases

Prototype - S

Implement a converters for quantization ops. Once this is done, use the QAT training sample and get a trained nn.Module and compile it with FX backend.

MVP `(<1.4.0>)` - S

Implement a converters for quantization ops. Once this is done, use the QAT training sample and get a trained nn.Module and compile it with FX backend.

Both prototype and MVP would be the same for this feature. TS converters cannot be reused.

peri044 · 2023-02-14T23:14:03Z

peri044
Feb 14, 2023
Collaborator Author

@narendasan to review

0 replies

narendasan · 2023-02-17T20:08:24Z

narendasan
Feb 17, 2023
Collaborator

Seems good to me if its just a converter

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Pytorch quantization toolkit based QAT sample using FX backend of Torch-TRT #1672

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Support Pytorch quantization toolkit based QAT sample using FX backend of Torch-TRT #1672

peri044 Feb 14, 2023 Collaborator

Pytorch quantization toolkit support in FX backend

Goal(s)

Usecases

Proposed APIs / UX

Limitations

Internal Implementation

Design

Extensions Required to Core API implementations

Data Structures

Details specific for TorchScript Support

Details specific for FX support

Implementation Phases

Prototype - S

MVP (<1.4.0>) - S

Replies: 2 comments

peri044 Feb 14, 2023 Collaborator Author

narendasan Feb 17, 2023 Collaborator

peri044
Feb 14, 2023
Collaborator

MVP `(<1.4.0>)` - S

peri044
Feb 14, 2023
Collaborator Author

narendasan
Feb 17, 2023
Collaborator