OpenSource/ollama @ e873841cbb38d9d8f1b058e1338d88eaffbf9afa

Michael Yang e873841cbb deepseek v2 graph		11 months ago
..
ext_server	fb9cdfa723 Fix server.cpp for the new cuda build macros	11 months ago
generate	b0930626c5 Add back lower level parallel flags	11 months ago
llama.cpp @ 7c26775adb	152fc202f5 llm: update llama.cpp commit to `7c26775` (#4896)	11 months ago
patches	152fc202f5 llm: update llama.cpp commit to `7c26775` (#4896)	11 months ago
filetype.go	d6f692ad1a Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322)	1 year ago
ggla.go	171eb040fc simplify safetensors reading	1 year ago
ggml.go	e873841cbb deepseek v2 graph	11 months ago
gguf.go	7bdcd1da94 Revert "Merge pull request #4938 from ollama/mxyng/fix-byte-order"	11 months ago
llm.go	829ff87bd1 revert tokenize ffi (#4761)	11 months ago
llm_darwin_amd64.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago
llm_darwin_arm64.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago
llm_linux.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago
llm_windows.go	058f6cd2cc Move nested payloads to installer and zip file on windows	1 year ago
memory.go	359b15a597 Handle models with divergent layer sizes	11 months ago
memory_test.go	6f351bf586 review comments and coverage	11 months ago
payload.go	6f351bf586 review comments and coverage	11 months ago
server.go	7784ca33ce Tighten up memory prediction logging	11 months ago
status.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago