Michael Yang 7e33a017c0 partial offloading 1 year ago
..
ext_server 0a0e9f3e0f Apply 01-cache.diff 1 year ago
generate 1524f323a3 Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564) 1 year ago
llama.cpp @ 1b67731e18 5ec12cec6c update llama.cpp submodule to `1b67731` (#3561) 1 year ago
patches 0035e31af8 Bump to b2581 1 year ago
ggla.go 8b2c10061c refactor tensor query 1 year ago
ggml.go 7e33a017c0 partial offloading 1 year ago
gguf.go 8b2c10061c refactor tensor query 1 year ago
llm.go 9502e5661f cgo quantize 1 year ago
llm_darwin_amd64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 year ago
llm_darwin_arm64.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 year ago
llm_linux.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 year ago
llm_windows.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 year ago
payload.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 year ago
server.go 7e33a017c0 partial offloading 1 year ago
status.go 58d95cc9bd Switch back to subprocessing for llama.cpp 1 year ago