We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Code blocks for writing triton kernels
Forked from pytorch/ao
Native PyTorch library for quantization and sparsity
Python
Forked from gpu-mode/ring-attention
Optimized kernels for ring-attention [WIP]
Jupyter Notebook 2
Ring attention implementation with flash attention
Python 1k 99
Forked from karpathy/llm.c
LLM training in simple, raw C/CUDA
Cuda
Puffing up reinforcement learning
C 6.1k 515
There was an error while loading. Please reload this page.