We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ 5.7k 896
The Artifact Evaluation Version of SOSP Paper #19
C++ 54 13
SGLang is a high-performance serving framework for large language models and multimodal models.
Python 29.8k 6.8k
This is the code repository of paper "LightDSA: Enabling Efficient DSA Through Hardware-Aware Transparent Optimization"
C++ 5
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
Python 966 88
A workload for deploying LLM inference services on Kubernetes
Go 251 66
There was an error while loading. Please reload this page.