.. |
ext_server
|
0a0e9f3e0f
Apply 01-cache.diff
|
1 year ago |
generate
|
1524f323a3
Revert "build.go: introduce a friendlier way to build Ollama (#3548)" (#3564)
|
1 year ago |
llama.cpp @ 7593639ce3
|
f335722275
update llama.cpp submodule to `7593639` (#3665)
|
1 year ago |
patches
|
0035e31af8
Bump to b2581
|
1 year ago |
ggla.go
|
8b2c10061c
refactor tensor query
|
1 year ago |
ggml.go
|
3397eff0cd
mixtral mem
|
1 year ago |
gguf.go
|
6d53b67c2c
Merge pull request #3663 from ollama/mxyng/fix-padding
|
1 year ago |
llm.go
|
9502e5661f
cgo quantize
|
1 year ago |
llm_darwin_amd64.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 year ago |
llm_darwin_arm64.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 year ago |
llm_linux.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 year ago |
llm_windows.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 year ago |
payload.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 year ago |
server.go
|
41a272de9f
darwin: no partial offloading if required memory greater than system
|
1 year ago |
status.go
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 year ago |