q,k,v have different shape but torch.stack works? #202

junsukha · 2023-08-08T01:14:53Z

In labml_nn/diffusion/stable_diffusion/model/unet_attention.py,
def flash_attention(self, q: torch.Tensor, k: torch.Tensor, v: torch.Tensor): has torch.stack((q,k,v), dim=2) where, I believe, q is of different shape from k and v.

How does torch.stack work then?

When I run text_to_image.py,
q, k, v are of shape ([8, 1024, 640]), ([8, 77, 640]), ([8, 77, 640]) respectively.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

q,k,v have different shape but torch.stack works? #202

q,k,v have different shape but torch.stack works? #202

junsukha commented Aug 8, 2023 •

edited

Loading

q,k,v have different shape but torch.stack works? #202

q,k,v have different shape but torch.stack works? #202

Comments

junsukha commented Aug 8, 2023 • edited Loading

junsukha commented Aug 8, 2023 •

edited

Loading