sequential adaquant updating only one batch with quantized input? #3

hy-chen · 2021-04-27T14:50:15Z

Line 481 in 69077c9

    
           cached_input_output[layer][0] = (cached_qinput[layer][0],cached_input_output[layer][0][1])

[0] index into the first batch.

isn't sequential adaquant supposed to update the input cache of all batches to the quantized values?

itayhubara · 2021-04-27T15:41:52Z

In line 465 I set quantize to true thus all values are quantized. In line 481 I just replace the FP32 record I have with the results from the quantized model.

hy-chen · 2021-04-27T16:48:09Z

Hi itayhubara,

Thanks for replying. Yes the quantize is set to True. But in the record that you will be using for optimization, only the first batch is updated (the replacement action you mentioned).

This is because cached_input_output is organized as [layer1, layer2 ... ] and each layer is [batch1, batch2, ...]. So in line 481, only the first batch's FP32 record is replaced with the first batch of the quantized values.

Could you let me know if this is true? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sequential adaquant updating only one batch with quantized input? #3

sequential adaquant updating only one batch with quantized input? #3

hy-chen commented Apr 27, 2021

itayhubara commented Apr 27, 2021

hy-chen commented Apr 27, 2021 •

edited

Loading

sequential adaquant updating only one batch with quantized input? #3

sequential adaquant updating only one batch with quantized input? #3

Comments

hy-chen commented Apr 27, 2021

itayhubara commented Apr 27, 2021

hy-chen commented Apr 27, 2021 • edited Loading

hy-chen commented Apr 27, 2021 •

edited

Loading