Commit History

Autor SHA1 Mensaxe Data
  ParthSareen a4265c278a wip hai 2 meses
  Jeffrey Morgan 1deafd8254 llama: update vendored code to commit 46e3556 (#8308) hai 3 meses
  Jesse Gross 08a832b482 llama: Ensure KV cache is fully defragmented. hai 4 meses
  Jeffrey Morgan 527cc97899 llama: update vendored code to commit 40c6d79f (#7875) hai 4 meses
  Daniel Hiltgen 4879a234c4 build: Make target improvements (#7499) hai 4 meses
  Sam 1bdab9fdb1 llm: introduce k/v context quantization (vRAM improvements) (#6279) hai 5 meses
  ItzCrazyKns e3936d4fb3 Support Multiple LoRa Adapters (#7667) hai 5 meses
  Jesse Gross 71e6a0d0d1 runner.go: Don't try to extract image tags for text models hai 5 meses
  Jesse Gross 2cd11ae365 runner.go: Add unit tests for context shifting hai 5 meses
  Jesse Gross 3478b2cf14 runner.go: Fix deadlock with many concurrent requests hai 5 meses
  Daniel Hiltgen b85520bfb9 logs: explain client aborts better (#7783) hai 5 meses
  Jesse Gross c4b34f2a2a runner.go: Truncate inputs that exceed context rather than shifting hai 5 meses
  Jesse Gross c3ff916431 runner.go: Don't add inputs to cache view until actually processed hai 5 meses
  Jesse Gross 3fc1dc0e6f runner.go: Hard fail on errors rather than potentially infinite looping hai 5 meses
  Jesse Gross 7121dfa309 runner.go: Retry decoding after defragmentation if needed hai 5 meses
  Jesse Gross 5f68fcab12 runner.go: Use correct index when retrieving embedding results hai 5 meses
  Jesse Gross d875e99e46 runner.go: Propagate panics back to the user. hai 5 meses
  Jesse Gross 8a35bb926e runner.go: Increase survivability of main processing loop hai 5 meses
  Jesse Gross c25ffde91d runner.go: Don't trim whitespace from inputs hai 5 meses
  Jesse Gross 17b386a891 runner.go: Enforce NUM_PARALLEL directly in the runner hai 5 meses
  Michael Yang 549c2bdfcf Merge pull request #7657 from ollama/mxyng/sync hai 5 meses
  Michael Yang 5b3393b6a2 fix(mllama): sync backend between batches hai 5 meses
  Jesse Gross d7eb05b936 runner.go: Fix off-by-one for num predicted hai 5 meses
  Jesse Gross 65973ceb64 runner.go: Make KV entry accounting more robust hai 5 meses
  Jesse Gross a909417602 runner.go: Remove unused arguments hai 6 meses
  Jesse Gross 312d9de1d1 llama: Improve error handling hai 6 meses
  Jesse Gross a103dae01e runner.go: Only allocate 1 element embedding batches for mllama hai 6 meses
  Jesse Gross 26acdcf44e runner.go: Don't set cross attention before sending embeddings hai 6 meses
  Jesse Gross c826e57475 runner.go: Better abstract vision model integration hai 6 meses
  Daniel Hiltgen 712e99d477 Soften windows clang requirement (#7428) hai 6 meses