Issue for ring-llama/test.ipynb #15

fayejf · 2024-04-15T23:27:47Z

Thanks for the great work!

I can load the model with LlamaRingFlashAttention and move to the device but I've seen

RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.

when I run y = model.generate

What did I miss? Thanks in advance!

The text was updated successfully, but these errors were encountered:

fayejf · 2024-04-15T23:38:44Z

Also I was seeing

RuntimeError: Sizes of tensors must match except in dimension 2. Expected size 32 but got size 4 for tensor number 1 in the list.

with another model llama based model "01-ai/Yi-6B-200K"with LlamaRingFlashAttention
it was fine with LlamaFlashAttention2

Provide feedback