We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Flash Attention in ~100 lines of CUDA (forward pass only)
Cuda 1.2k 114
My CUDA solution to the 1BRC
Cuda 11 3
Mixed precision training from scratch with Tensors and CUDA
Python 30 4
a minimal cache manager for PagedAttention, on top of llama3.
Python 146 12
DIY Instagram Chat Automation with Google Sheets
HTML 240 29
Tile primitives for speedy kernels
Cuda 3.5k 300
There was an error while loading. Please reload this page.