.. |
ext_server
|
de781b37c8
rm unused infill
|
11 月之前 |
generate
|
7ca9605f54
speed up tests by only building static lib (#4740)
|
11 月之前 |
llama.cpp @ 74f33adf5f
|
95af97b9f3
server: try github.com/minio/sha256-simd
|
11 月之前 |
patches
|
22f5c12ced
Update llama.cpp submodule to `5921b8f0` (#4731)
|
11 月之前 |
filetype.go
|
d6f692ad1a
Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322)
|
11 月之前 |
ggla.go
|
171eb040fc
simplify safetensors reading
|
11 月之前 |
ggml.go
|
d51f15257c
Update llm/ggml.go
|
11 月之前 |
gguf.go
|
171eb040fc
simplify safetensors reading
|
11 月之前 |
llm.go
|
763bb65dbb
use `int32_t` for call to tokenize (#4738)
|
11 月之前 |
llm_darwin_amd64.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 年之前 |
llm_darwin_arm64.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 年之前 |
llm_linux.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 年之前 |
llm_windows.go
|
058f6cd2cc
Move nested payloads to installer and zip file on windows
|
1 年之前 |
memory.go
|
4cc3be3035
Move envconfig and consolidate env vars (#4608)
|
11 月之前 |
payload.go
|
058f6cd2cc
Move nested payloads to installer and zip file on windows
|
1 年之前 |
server.go
|
a50a87a7b8
partial offloading: allow flash attention and disable mmap (#4734)
|
11 月之前 |
status.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 年之前 |