waltforme

Jun Duan waltforme

Achievements

vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 84.3k 18.5k
llm-d-incubation/llm-d-fast-model-actuation llm-d-incubation/llm-d-fast-model-actuation Public

Kubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping

Go 16 16
kubestellar/kubestellar kubestellar/kubestellar Public

KubeStellar - a flexible solution for multi-cluster configuration management for edge, multi-cloud, and hybrid cloud

Go 691 297
random random Public

A collection of random stuff that is worth documenting and sharing

Python