Jeffrey Morgan ce0dc33cb8 llm: patch to fix qwen 2 temporarily on nvidia (#4897) 10 ay önce
..
ext_server 829ff87bd1 revert tokenize ffi (#4761) 11 ay önce
generate 7ca9605f54 speed up tests by only building static lib (#4740) 11 ay önce
llama.cpp @ 5921b8f089 22f5c12ced Update llama.cpp submodule to `5921b8f0` (#4731) 11 ay önce
patches ce0dc33cb8 llm: patch to fix qwen 2 temporarily on nvidia (#4897) 10 ay önce
filetype.go d6f692ad1a Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322) 11 ay önce
ggla.go 171eb040fc simplify safetensors reading 11 ay önce
ggml.go 9b6c2e6eb6 detect chat template from KV 10 ay önce
gguf.go e40145a39d lint 11 ay önce
llm.go 829ff87bd1 revert tokenize ffi (#4761) 11 ay önce
llm_darwin_amd64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 yıl önce
llm_darwin_arm64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 yıl önce
llm_linux.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 yıl önce
llm_windows.go 058f6cd2cc Move nested payloads to installer and zip file on windows 1 yıl önce
memory.go 6297f85606 gofmt, goimports 11 ay önce
payload.go 04f3c12bb7 replace x/exp/slices with slices 11 ay önce
server.go e40145a39d lint 11 ay önce
status.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 yıl önce