.. |
ext_server
|
730dcfcc7a
Refine debug logging for llm
|
1 year ago |
generate
|
0f5b843319
Refine Accelerate usage on mac
|
1 year ago |
llama.cpp @ 011e8ec577
|
ffaf52e1e9
update submodule to `011e8ec577fd135cbc02993d3ea9840c516d6a1c`
|
1 year ago |
dyn_ext_server.c
|
6a042438af
Switch to local dlopen symbols
|
1 year ago |
dyn_ext_server.go
|
3bc28736cd
Merge pull request #2143 from dhiltgen/llm_verbosity
|
1 year ago |
dyn_ext_server.h
|
39928a42e8
Always dynamically load the llm server library
|
1 year ago |
ggml.go
|
eaed6f8c45
add max context length check
|
1 year ago |
gguf.go
|
cd22855ef8
refactor tensor read
|
1 year ago |
llama.go
|
4a33cede20
remove unused fields and functions
|
1 year ago |
llm.go
|
4458efb73a
Load all layers on `arm64` macOS if model is small enough (#2149)
|
1 year ago |
payload_common.go
|
dc88cc3981
use `gzip` for runner embedding (#2067)
|
1 year ago |
payload_darwin_amd64.go
|
1b249748ab
Add multiple CPU variants for Intel Mac
|
1 year ago |
payload_darwin_arm64.go
|
1b249748ab
Add multiple CPU variants for Intel Mac
|
1 year ago |
payload_linux.go
|
1b249748ab
Add multiple CPU variants for Intel Mac
|
1 year ago |
payload_test.go
|
7427fa1387
Fix up the CPU fallback selection
|
1 year ago |
payload_windows.go
|
1b249748ab
Add multiple CPU variants for Intel Mac
|
1 year ago |
utils.go
|
fccf8d179f
partial decode ggml bin for more info
|
1 year ago |