Blake Mizerany 95af97b9f3 server: try github.com/minio/sha256-simd 11 月之前
..
ext_server de781b37c8 rm unused infill 11 月之前
generate 7ca9605f54 speed up tests by only building static lib (#4740) 11 月之前
llama.cpp @ 74f33adf5f 95af97b9f3 server: try github.com/minio/sha256-simd 11 月之前
patches 22f5c12ced Update llama.cpp submodule to `5921b8f0` (#4731) 11 月之前
filetype.go d6f692ad1a Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322) 11 月之前
ggla.go 171eb040fc simplify safetensors reading 11 月之前
ggml.go d51f15257c Update llm/ggml.go 11 月之前
gguf.go 171eb040fc simplify safetensors reading 11 月之前
llm.go 763bb65dbb use `int32_t` for call to tokenize (#4738) 11 月之前
llm_darwin_amd64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前
llm_darwin_arm64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前
llm_linux.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前
llm_windows.go 058f6cd2cc Move nested payloads to installer and zip file on windows 1 年之前
memory.go 4cc3be3035 Move envconfig and consolidate env vars (#4608) 11 月之前
payload.go 058f6cd2cc Move nested payloads to installer and zip file on windows 1 年之前
server.go a50a87a7b8 partial offloading: allow flash attention and disable mmap (#4734) 11 月之前
status.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前