Historial de Commits

Autor SHA1 Mensaje Fecha
  Jeffrey Morgan 1deafd8254 llama: update vendored code to commit 46e3556 (#8308) hace 3 meses
  Jesse Gross 08a832b482 llama: Ensure KV cache is fully defragmented. hace 4 meses
  Jeffrey Morgan 527cc97899 llama: update vendored code to commit 40c6d79f (#7875) hace 4 meses
  Daniel Hiltgen 4879a234c4 build: Make target improvements (#7499) hace 4 meses
  Sam 1bdab9fdb1 llm: introduce k/v context quantization (vRAM improvements) (#6279) hace 5 meses
  ItzCrazyKns e3936d4fb3 Support Multiple LoRa Adapters (#7667) hace 5 meses
  Jesse Gross 71e6a0d0d1 runner.go: Don't try to extract image tags for text models hace 5 meses
  Jesse Gross 2cd11ae365 runner.go: Add unit tests for context shifting hace 5 meses
  Jesse Gross 3478b2cf14 runner.go: Fix deadlock with many concurrent requests hace 5 meses
  Daniel Hiltgen b85520bfb9 logs: explain client aborts better (#7783) hace 5 meses
  Jesse Gross c4b34f2a2a runner.go: Truncate inputs that exceed context rather than shifting hace 5 meses
  Jesse Gross c3ff916431 runner.go: Don't add inputs to cache view until actually processed hace 5 meses
  Jesse Gross 3fc1dc0e6f runner.go: Hard fail on errors rather than potentially infinite looping hace 5 meses
  Jesse Gross 7121dfa309 runner.go: Retry decoding after defragmentation if needed hace 5 meses
  Jesse Gross 5f68fcab12 runner.go: Use correct index when retrieving embedding results hace 5 meses
  Jesse Gross d875e99e46 runner.go: Propagate panics back to the user. hace 5 meses
  Jesse Gross 8a35bb926e runner.go: Increase survivability of main processing loop hace 5 meses
  Jesse Gross c25ffde91d runner.go: Don't trim whitespace from inputs hace 5 meses
  Jesse Gross 17b386a891 runner.go: Enforce NUM_PARALLEL directly in the runner hace 5 meses
  Michael Yang 549c2bdfcf Merge pull request #7657 from ollama/mxyng/sync hace 5 meses
  Michael Yang 5b3393b6a2 fix(mllama): sync backend between batches hace 5 meses
  Jesse Gross d7eb05b936 runner.go: Fix off-by-one for num predicted hace 5 meses
  Jesse Gross 65973ceb64 runner.go: Make KV entry accounting more robust hace 5 meses
  Jesse Gross a909417602 runner.go: Remove unused arguments hace 6 meses
  Jesse Gross 312d9de1d1 llama: Improve error handling hace 6 meses
  Jesse Gross a103dae01e runner.go: Only allocate 1 element embedding batches for mllama hace 6 meses
  Jesse Gross 26acdcf44e runner.go: Don't set cross attention before sending embeddings hace 6 meses
  Jesse Gross c826e57475 runner.go: Better abstract vision model integration hace 6 meses
  Daniel Hiltgen 712e99d477 Soften windows clang requirement (#7428) hace 6 meses
  Jesse Gross de1557a0dc runner.go: Better handle return NULL values from llama.cpp hace 6 meses