Tags · apaz-cli/llama.cpp · GitHub
Skip to content

Tags: apaz-cli/llama.cpp

Tags

b6123

Toggle b6123's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
cuda: refactored ssm_scan and use CUB (ggml-org#13291)

* cuda: refactored ssm_scan to use CUB

* fixed compilation error when when not using CUB

* assign L to constant and use size_t instead of int

* deduplicated functions

* change min blocks per mp to 1

* Use cub load and store warp transpose

* suppress clang warning