Commit History

Author SHA1 Message Date
  Blake Mizerany 2099e2d267 CONTRIBUTING: provide clarity on good commit messages, and bad (#9405) 2 months ago
  Bruce MacDonald 0c1041ad85 runner: default to greedy sampler for performance (#9407) 2 months ago
  Parth Sareen c245b0406f sample: remove transforms from greedy sampling (#9377) 2 months ago
  Michael Yang 8b194b7520 kvcache: update tests 2 months ago
  Michael Yang 3e8b8a1933 ml: update Context.Forward interface 2 months ago
  Blake Mizerany 41dc280491 server/internal/registry: implement CloseNotify and Flush (for now) (#9402) 2 months ago
  Michael Yang 53d2990d9b model: add bos token if configured 2 months ago
  Jesse Gross e185c08ad9 go.mod: Use full version for go 1.24.0 2 months ago
  Blake Mizerany 2412adf42b server/internal: replace model delete API with new registry handler. (#9347) 2 months ago
  Steven Hartland be2ac1ed93 docs: fix api examples link (#9360) 2 months ago
  Eries Trisnadi dc13813a03 server: allow vscode-file origins (#9313) 2 months ago
  Michael Yang d6af13efed runner: simplify tensor split parsing 2 months ago
  Michael Yang a59f665235 ml/backend/ggml: fix debug logging 2 months ago
  Daniel Hiltgen 688925aca9 Windows ARM build (#9120) 2 months ago
  Blake Mizerany 76e903cf9d .github/workflows: swap order of go test and golangci-lint (#9389) 2 months ago
  Jeffrey Morgan a5272130c4 ml/backend/ggml: follow on fixes after updating vendored code (#9388) 2 months ago
  Jeffrey Morgan d7d7e99662 llama: update llama.cpp vendor code to commit d7cfe1ff (#9356) 2 months ago
  Gordon Kamer 2db96c18e7 readme: add Nichey to community integrations (#9370) 2 months ago
  Daniel Hiltgen e12af460ed Add cuda Blackwell architecture for v12 (#9350) 2 months ago
  Jeffrey Morgan 3ad4bc8afe llama: removed unused 'vendoring' file (#9351) 2 months ago
  Blake Mizerany 0d694793f2 .github: always run tests, and other helpful fixes (#9348) 2 months ago
  Daniel Hiltgen e91ae3d47d Update ROCm (6.3 linux, 6.2 windows) and CUDA v12.8 (#9304) 2 months ago
  José Pekkarinen 6ecd7f64ba docker: upgrade rocm to 6.3.3 (#8211) 2 months ago
  Chuanhui Liu 888855675e docs: rocm install link (#9346) 2 months ago
  Michael Yang b16367b4b2 fix: add back bf16 support 2 months ago
  Pavol Rusnak a499390648 build: support Compute Capability 5.0, 5.2 and 5.3 for CUDA 12.x (#8567) 2 months ago
  frob 4df98f3eb5 Move cgroups fix out of AMD section. (#9072) 2 months ago
  Blake Mizerany 348b3e0983 server/internal: copy bmizerany/ollama-go to internal package (#9294) 2 months ago
  Parth Sareen 0b7e1676eb sample: add sampling package for new engine (#8410) 2 months ago
  Parth Sareen 314573bfe8 config: allow setting context length through env var (#8938) 2 months ago