Historique des commits

Auteur SHA1 Message Date
  Jeffrey Morgan 1deafd8254 llama: update vendored code to commit 46e3556 (#8308) il y a 3 mois
  Jesse Gross 08a832b482 llama: Ensure KV cache is fully defragmented. il y a 4 mois
  Jeffrey Morgan 527cc97899 llama: update vendored code to commit 40c6d79f (#7875) il y a 4 mois
  Daniel Hiltgen 4879a234c4 build: Make target improvements (#7499) il y a 4 mois
  Sam 1bdab9fdb1 llm: introduce k/v context quantization (vRAM improvements) (#6279) il y a 5 mois
  ItzCrazyKns e3936d4fb3 Support Multiple LoRa Adapters (#7667) il y a 5 mois
  Jesse Gross 71e6a0d0d1 runner.go: Don't try to extract image tags for text models il y a 5 mois
  Jesse Gross 2cd11ae365 runner.go: Add unit tests for context shifting il y a 5 mois
  Jesse Gross 3478b2cf14 runner.go: Fix deadlock with many concurrent requests il y a 5 mois
  Daniel Hiltgen b85520bfb9 logs: explain client aborts better (#7783) il y a 5 mois
  Jesse Gross c4b34f2a2a runner.go: Truncate inputs that exceed context rather than shifting il y a 5 mois
  Jesse Gross c3ff916431 runner.go: Don't add inputs to cache view until actually processed il y a 5 mois
  Jesse Gross 3fc1dc0e6f runner.go: Hard fail on errors rather than potentially infinite looping il y a 5 mois
  Jesse Gross 7121dfa309 runner.go: Retry decoding after defragmentation if needed il y a 5 mois
  Jesse Gross 5f68fcab12 runner.go: Use correct index when retrieving embedding results il y a 5 mois
  Jesse Gross d875e99e46 runner.go: Propagate panics back to the user. il y a 5 mois
  Jesse Gross 8a35bb926e runner.go: Increase survivability of main processing loop il y a 5 mois
  Jesse Gross c25ffde91d runner.go: Don't trim whitespace from inputs il y a 5 mois
  Jesse Gross 17b386a891 runner.go: Enforce NUM_PARALLEL directly in the runner il y a 5 mois
  Michael Yang 549c2bdfcf Merge pull request #7657 from ollama/mxyng/sync il y a 5 mois
  Michael Yang 5b3393b6a2 fix(mllama): sync backend between batches il y a 5 mois
  Jesse Gross d7eb05b936 runner.go: Fix off-by-one for num predicted il y a 5 mois
  Jesse Gross 65973ceb64 runner.go: Make KV entry accounting more robust il y a 5 mois
  Jesse Gross a909417602 runner.go: Remove unused arguments il y a 6 mois
  Jesse Gross 312d9de1d1 llama: Improve error handling il y a 6 mois
  Jesse Gross a103dae01e runner.go: Only allocate 1 element embedding batches for mllama il y a 6 mois
  Jesse Gross 26acdcf44e runner.go: Don't set cross attention before sending embeddings il y a 6 mois
  Jesse Gross c826e57475 runner.go: Better abstract vision model integration il y a 6 mois
  Daniel Hiltgen 712e99d477 Soften windows clang requirement (#7428) il y a 6 mois
  Jesse Gross de1557a0dc runner.go: Better handle return NULL values from llama.cpp il y a 6 mois