Inconsistent quantization format (PAMS, DAQ, CADYQ)

Good day, thanks for your ablation study presented in the paper. But I wonder why quantization format between repos for PAMS, DAQ and CaDYQ differs.
From the PAMS paper fig.1 we expect the following quantization format in ResBlock:
![image](https://github.com/Cheeun/DAQ-pytorch/assets/57963393/da2dfc89-5361-48c8-93d1-cf90424a9ed6)
From the PAMS repo we have the following code for the forward method:

```python
def forward(self, x):
      residual = self.quant_act1(self.shortcut(x))
      body = self.body(x).mul(self.res_scale)
      res = self.quant_act3(body)
      res += residual

      return res
```

From DAQ repo:

```python
def forward(self, x):        

      if self.a_bit!=32:
          out= self.quant1(x)
      else:
          out=x
      
      out = self.conv1(out)
      # if self.bn:
      #     out = self.BN1(out)


      out1 = self.act(out)


      if self.a_bit!=32:
          out1= self.quant2(out1)

      res = self.conv2(out1)
      # if self.bn:
      #     res = self.BN2(res)
      res = res.mul(self.res_scale)

      res += x
      return res
```

From CaDYQ repo:

```python
def forward_ori(self, x):
      weighted_bits = x[4]
      f = x[3]
      bits = x[2]
      grad = x[0]
      x = x[1]

      x = self.shortcut(x)
      grad, x, bits, weighted_bits = self.bitsel1([grad, x, bits, weighted_bits]) # cadyq
      residual = x
      # grad,x,bits,weighted_bits= self.body[0]() # cadyq
      # x = self.body[1:3](x) # conv-relu
      x = self.body[0:2](x) # conv-relu
      # grad,x,bits,weighted_bits= self.body[3]([grad,x,bits,weighted_bits]) # cadyq
      grad,x,bits,weighted_bits= self.body[2]([grad,x,bits,weighted_bits]) # cadyq
      # out = self.body[4](x) # conv
      out = self.body[3](x) # conv
      f1 = out
      out = out.mul(self.res_scale)
      out = self.quant_act3(out)
      out += residual
      if self.loss_kdf:
          if f is None:
              f = f1.unsqueeze(0)
          else: 
              f = torch.cat([f, f1.unsqueeze(0)], dim=0)
      else:
          f = None


      return [grad, out, bits, f, weighted_bits]
```


So I have the following questions:
1) Why do we add a shortcut module as it has no affect on the input?
2) Am I right that the PAMS repo doesn't quantize the input to the first conv, while fig.1 in their paper implies quantization, so you added it in DAQ and CaDYQ?
3) Why do you apply quantization of the final conv output? Again, fig. 1 in PAMS paper doesn't imply it, but it seems more hardware-friendly (sum of two quantized value instead of FP and quant as the latter will need broadcasting to match the data type).
4) In DAQ paper you don't quantize skip connection before the sum (you sum with the original input x), while in CaDYQ you do (residual variable is the quantized version of x), is it correct? If yes, what is the reason for such a difference?

Thank you very much in advance for your answers!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent quantization format (PAMS, DAQ, CADYQ) #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Inconsistent quantization format (PAMS, DAQ, CADYQ) #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions