Pinned Loading
-
pytorch_tzw
pytorch_tzw PublicForked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python
-
Megatron-DeepSpeed
Megatron-DeepSpeed PublicForked from microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python
-
xDiT
xDiT PublicForked from xdit-project/xDiT
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters
Python
-
NVIDIA/Megatron-LM
NVIDIA/Megatron-LM PublicOngoing research training transformer models at scale
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.