OpenSource/ollama @ jmorganca/llama-cpp-7c26775

jmorganca 9b5b69c00f llm: update llama.cpp submodule to `7c26775`		10 months ago
..
ext_server	fb9cdfa723 Fix server.cpp for the new cuda build macros	10 months ago
generate	0577af98f4 More parallelism on windows generate	10 months ago
llama.cpp @ 7c26775adb	9b5b69c00f llm: update llama.cpp submodule to `7c26775`	10 months ago
patches	ce0dc33cb8 llm: patch to fix qwen 2 temporarily on nvidia (#4897)	11 months ago
filetype.go	d6f692ad1a Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322)	11 months ago
ggla.go	171eb040fc simplify safetensors reading	11 months ago
ggml.go	6fd04ca922 Improve multi-gpu handling at the limit	10 months ago
gguf.go	7bdcd1da94 Revert "Merge pull request #4938 from ollama/mxyng/fix-byte-order"	10 months ago
llm.go	829ff87bd1 revert tokenize ffi (#4761)	11 months ago
llm_darwin_amd64.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago
llm_darwin_arm64.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago
llm_linux.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago
llm_windows.go	058f6cd2cc Move nested payloads to installer and zip file on windows	1 year ago
memory.go	17df6520c8 Remove mmap related output calc logic	10 months ago
memory_test.go	6f351bf586 review comments and coverage	10 months ago
payload.go	6f351bf586 review comments and coverage	10 months ago
server.go	da3bf23354 Workaround gfx900 SDMA bugs	10 months ago
status.go	58d95cc9bd Switch back to subprocessing for llama.cpp	1 year ago