Pull requests · SemiAnalysisAI/InferenceX · GitHub
Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update DSV4 GB300 Dynamo vLLM Recipes
#2010 opened Jul 3, 2026 by hjjq Collaborator Loading…
[WIP] M3 B300 indexer cache full-sweep-enabled
#2009 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
[WIP] Update Minimax M3 FP4 B200 Eagle full-sweep-enabled
#2007 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
[WIP] Update Minimax M3 FP4 B300 Eagle full-sweep-enabled
#2006 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
CollectiveX v1: cross-vendor EP benchmark suite
#2004 opened Jul 3, 2026 by Oseltamivir Collaborator Loading…
[AMD] MiniMax-M3 FP4/FP8 MI355X ATOMESH (disagg): refactor config & add MTP recipes AMD evals-only Suppress throughput and run only eval jobs; combine with all-evals to expand selection full-sweep-enabled
#2000 opened Jul 3, 2026 by seungrokj Collaborator Loading…
8 tasks
[WIP] Test Kimi 2.5 B300 Agg full-sweep-enabled
#1998 opened Jul 3, 2026 by wzhao18 Collaborator Loading…
[DNM][AMD] agentX benchmark (v1.0)
#1996 opened Jul 3, 2026 by seungrokj Collaborator Loading…
chore(deps): bump the github-actions group across 1 directory with 3 updates dependencies Pull requests that update a dependency file github_actions Pull requests that update GitHub Actions code
#1995 opened Jul 3, 2026 by dependabot Bot Loading…
Update Minimax M3 B300 FP4 vllm full-sweep-enabled
#1994 opened Jul 2, 2026 by wzhao18 Collaborator Loading…
[WIP] [do not merge] Add MiniMax-M3 FP4 B200 Dynamo-vLLM disagg config full-sweep-fail-fast-no-canary Full sweep, no canary gate; first failure in a matrix cancels that matrix
#1982 opened Jul 2, 2026 by jasonlizhengjian Collaborator Loading…
test the GB300 cluster after the node patch full-sweep-enabled
#1961 opened Jun 30, 2026 by richardhuo-nv Collaborator Loading…
Update Qwen3.5 FP4 MI355X MTP recipe with tuned env/flags
#1957 opened Jun 29, 2026 by amd-fuyuajin Collaborator Loading…
[AMD] Enable AITER MoE for MiniMax-M3 MI355X vLLM MTP benchmarks
#1955 opened Jun 29, 2026 by Fangzhou-Ai Collaborator Draft
2 of 3 tasks
Add MTP evaluation
#1953 opened Jun 29, 2026 by hjjq Collaborator Loading…
ProTip! Follow long discussions with comments:>50.