Daniel Hiltgen 5784c05397 Merge pull request #5854 from dhiltgen/win_exit_status 10 月之前
..
ext_server b9f5e16c80 Introduce `/api/embed` endpoint supporting batch embedding (#5127) 10 月之前
generate 283948c83b Adjust windows ROCm discovery 10 月之前
llama.cpp @ d94c6e0ccb f8fedbda20 Update llama.cpp submodule commit to `d94c6e0c` (#5805) 10 月之前
patches f8fedbda20 Update llama.cpp submodule commit to `d94c6e0c` (#5805) 10 月之前
filetype.go d6f692ad1a Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322) 1 年之前
ggla.go cb42e607c5 llm: speed up gguf decoding by a lot (#5246) 11 月之前
ggml.go 5a739ff4cb chatglm graph 10 月之前
ggml_test.go cb42e607c5 llm: speed up gguf decoding by a lot (#5246) 11 月之前
gguf.go 4a565cbf94 add chat and generate tests with mock runner 10 月之前
llm.go 10e768826c fix: quant err message (#5616) 10 月之前
llm_darwin_amd64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前
llm_darwin_arm64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前
llm_linux.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 年之前
llm_windows.go 058f6cd2cc Move nested payloads to installer and zip file on windows 1 年之前
memory.go 8e0641a9bf handle asymmetric embedding KVs 11 月之前
memory_test.go cb42e607c5 llm: speed up gguf decoding by a lot (#5246) 11 月之前
payload.go 0e982bc1f4 Fix corner cases on tmp cleaner on mac 10 月之前
server.go a3c20e3f18 Refine error reporting for subprocess crash 10 月之前
status.go 4d71c559b2 fix error detection by limiting model loading error parsing (#5472) 10 月之前