Jeffrey Morgan
|
f6cb0a553c
update cuda overhead to 15% or 400MiB
|
1 vuosi sitten |
Jeffrey Morgan
|
2680078c13
fix build on linux
|
1 vuosi sitten |
Jeffrey Morgan
|
f1b7e5f560
update overhead to 15%
|
1 vuosi sitten |
Jeffrey Morgan
|
cb534e6ac2
use 10% vram overhead for cuda
|
1 vuosi sitten |
Jeffrey Morgan
|
08f1e18965
Offload layers to GPU based on new model size estimates (#1850)
|
1 vuosi sitten |
Daniel Hiltgen
|
d74ce6bd4f
Detect very old CUDA GPUs and fall back to CPU
|
1 vuosi sitten |
Daniel Hiltgen
|
a2ad952440
Fix windows system memory lookup
|
1 vuosi sitten |
Daniel Hiltgen
|
d966b730ac
Switch windows build to fully dynamic
|
1 vuosi sitten |
Daniel Hiltgen
|
7555ea44f8
Revamp the dynamic library shim
|
1 vuosi sitten |
Daniel Hiltgen
|
1b991d0ba9
Refine build to support CPU only
|
1 vuosi sitten |
Daniel Hiltgen
|
35934b2e05
Adapted rocm support to cgo based llama.cpp
|
1 vuosi sitten |