Author | SHA1 Message | Date |
---|---|---|
|
fc37c192ae Refine CPU load behavior with system memory visibility | 11 months ago |
|
8727a9c140 Record more GPU information | 1 year ago |
|
34b9db5afc Request and model concurrency | 1 year ago |
|
c336693f07 calculate overhead based number of gpu devices (#1875) | 1 year ago |
|
a2ad952440 Fix windows system memory lookup | 1 year ago |
|
35934b2e05 Adapted rocm support to cgo based llama.cpp | 1 year ago |