frob
|
4df98f3eb5
Move cgroups fix out of AMD section. (#9072)
|
hai 2 meses |
Blake Mizerany
|
348b3e0983
server/internal: copy bmizerany/ollama-go to internal package (#9294)
|
hai 2 meses |
Parth Sareen
|
0b7e1676eb
sample: add sampling package for new engine (#8410)
|
hai 2 meses |
Parth Sareen
|
314573bfe8
config: allow setting context length through env var (#8938)
|
hai 2 meses |
Blake Mizerany
|
4604b10306
go.mod: bump to go1.24 (#9242)
|
hai 2 meses |
Jeffrey Morgan
|
8c13cfa4dd
ml/backend/ggml: fix crash on windows paths with wide characters (#9305)
|
hai 2 meses |
Jeffrey Morgan
|
7cfd4aee4d
docs: add additional ROCm docs for building (#9066)
|
hai 2 meses |
Blake Mizerany
|
68bac1e0a6
server: group routes by category and purpose (#9270)
|
hai 2 meses |
Jesse Gross
|
f53f4198c3
ml: Abstract attention out of model definitions
|
hai 2 meses |
Michael Yang
|
2192a28eed
ml/backend/ggml: fix rms norm
|
hai 2 meses |
Junyan Qin (Chin)
|
5d81c1a184
docs: add `RockChinQ/LangBot` to integrations list (#9272)
|
hai 2 meses |
Jesse Gross
|
5c5535c064
models: Prune unused outputs earlier in the forward pass
|
hai 2 meses |
Jesse Gross
|
e5bcc51ae1
ggml-backend: Don't recreate the scheduler for each context
|
hai 2 meses |
Jesse Gross
|
bd6a7d5e64
ollamarunner: Pass runner performance parameters to backends
|
hai 2 meses |
Bruce MacDonald
|
14b5a9a150
api: document client stream behavior with a test (#8996)
|
hai 2 meses |
Michael Yang
|
ba9ec3d05e
ci: use clang for windows cpu builds
|
hai 2 meses |
frob
|
7c168b08c9
server: add missing function parens to debug log (#9255)
|
hai 2 meses |
danielekp
|
3d4cc7833c
docs: Add yla to community integrations
|
hai 2 meses |
Lucas Hahn
|
351a85d9ea
openai: add 'timeout' to allowable x-stainless headers (#9237)
|
hai 2 meses |
Michael Yang
|
bda4ef6c56
reorder patches
|
hai 2 meses |
Michael Yang
|
1e438b237c
Merge pull request #9203 from ollama/mxyng/sapphirerapids
|
hai 2 meses |
yuiseki
|
d721a02e7d
test: add test cases for ListHandler (#9146)
|
hai 2 meses |
zyxucp
|
778603a818
docs: Add AntSK to Community Integrations (#9214)
|
hai 2 meses |
maninhill
|
3c874df46e
docs: Add MaxKB to Community Integrations (#9212)
|
hai 2 meses |
Jeffrey Morgan
|
d2eb226c91
llama: add patch to fix ggml backend reg on Linux with utf-8 characters in the path (#9159)
|
hai 2 meses |
Michael Yang
|
e13e7c8d94
Merge pull request #9079 from jeremyschlatter/main
|
hai 2 meses |
Jeremy Schlatter
|
78f403ff45
address code review comments
|
hai 2 meses |
Michael Yang
|
5f8c03189e
build: remove backend build for sapphirerapids
|
hai 2 meses |
Michael Yang
|
08a299e1d0
cmake: avoid building intel backends on linux
|
hai 2 meses |
Michael Yang
|
7b5d916a9a
ci: set owner/group in tarball
|
hai 2 meses |