Blake Mizerany
|
76e903cf9d
.github/workflows: swap order of go test and golangci-lint (#9389)
|
2 ヶ月 前 |
Jeffrey Morgan
|
a5272130c4
ml/backend/ggml: follow on fixes after updating vendored code (#9388)
|
2 ヶ月 前 |
Jeffrey Morgan
|
d7d7e99662
llama: update llama.cpp vendor code to commit d7cfe1ff (#9356)
|
2 ヶ月 前 |
Gordon Kamer
|
2db96c18e7
readme: add Nichey to community integrations (#9370)
|
2 ヶ月 前 |
Daniel Hiltgen
|
e12af460ed
Add cuda Blackwell architecture for v12 (#9350)
|
2 ヶ月 前 |
Jeffrey Morgan
|
3ad4bc8afe
llama: removed unused 'vendoring' file (#9351)
|
2 ヶ月 前 |
Blake Mizerany
|
0d694793f2
.github: always run tests, and other helpful fixes (#9348)
|
2 ヶ月 前 |
Daniel Hiltgen
|
e91ae3d47d
Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304)
|
2 ヶ月 前 |
José Pekkarinen
|
6ecd7f64ba
docker: upgrade rocm to 6.3.3 (#8211)
|
2 ヶ月 前 |
Chuanhui Liu
|
888855675e
docs: rocm install link (#9346)
|
2 ヶ月 前 |
Michael Yang
|
b16367b4b2
fix: add back bf16 support
|
2 ヶ月 前 |
Pavol Rusnak
|
a499390648
build: support Compute Capability 5.0, 5.2 and 5.3 for CUDA 12.x (#8567)
|
2 ヶ月 前 |
frob
|
4df98f3eb5
Move cgroups fix out of AMD section. (#9072)
|
2 ヶ月 前 |
Blake Mizerany
|
348b3e0983
server/internal: copy bmizerany/ollama-go to internal package (#9294)
|
2 ヶ月 前 |
Parth Sareen
|
0b7e1676eb
sample: add sampling package for new engine (#8410)
|
2 ヶ月 前 |
Parth Sareen
|
314573bfe8
config: allow setting context length through env var (#8938)
|
2 ヶ月 前 |
Blake Mizerany
|
4604b10306
go.mod: bump to go1.24 (#9242)
|
2 ヶ月 前 |
Jeffrey Morgan
|
8c13cfa4dd
ml/backend/ggml: fix crash on windows paths with wide characters (#9305)
|
2 ヶ月 前 |
Jeffrey Morgan
|
7cfd4aee4d
docs: add additional ROCm docs for building (#9066)
|
2 ヶ月 前 |
Blake Mizerany
|
68bac1e0a6
server: group routes by category and purpose (#9270)
|
2 ヶ月 前 |
Jesse Gross
|
f53f4198c3
ml: Abstract attention out of model definitions
|
2 ヶ月 前 |
Michael Yang
|
2192a28eed
ml/backend/ggml: fix rms norm
|
2 ヶ月 前 |
Junyan Qin (Chin)
|
5d81c1a184
docs: add `RockChinQ/LangBot` to integrations list (#9272)
|
2 ヶ月 前 |
Jesse Gross
|
5c5535c064
models: Prune unused outputs earlier in the forward pass
|
2 ヶ月 前 |
Jesse Gross
|
e5bcc51ae1
ggml-backend: Don't recreate the scheduler for each context
|
2 ヶ月 前 |
Jesse Gross
|
bd6a7d5e64
ollamarunner: Pass runner performance parameters to backends
|
2 ヶ月 前 |
Bruce MacDonald
|
14b5a9a150
api: document client stream behavior with a test (#8996)
|
2 ヶ月 前 |
Michael Yang
|
ba9ec3d05e
ci: use clang for windows cpu builds
|
2 ヶ月 前 |
frob
|
7c168b08c9
server: add missing function parens to debug log (#9255)
|
2 ヶ月 前 |
danielekp
|
3d4cc7833c
docs: Add yla to community integrations
|
2 ヶ月 前 |