Daniel Hiltgen
|
8727a9c140
Record more GPU information
|
1 سال پیش |
Daniel Hiltgen
|
34b9db5afc
Request and model concurrency
|
1 سال پیش |
Michael Yang
|
7e33a017c0
partial offloading
|
1 سال پیش |
Michael Yang
|
91b3e4d282
update memory calcualtions
|
1 سال پیش |
Daniel Hiltgen
|
6d84f07505
Detect AMD GPU info via sysfs and block old cards
|
1 سال پیش |
Daniel Hiltgen
|
8da7bef05f
Support multiple variants for a given llm lib type
|
1 سال پیش |
Jeffrey Morgan
|
c336693f07
calculate overhead based number of gpu devices (#1875)
|
1 سال پیش |
Daniel Hiltgen
|
a2ad952440
Fix windows system memory lookup
|
1 سال پیش |
Daniel Hiltgen
|
d966b730ac
Switch windows build to fully dynamic
|
1 سال پیش |
Daniel Hiltgen
|
7555ea44f8
Revamp the dynamic library shim
|
1 سال پیش |
Daniel Hiltgen
|
35934b2e05
Adapted rocm support to cgo based llama.cpp
|
1 سال پیش |