Historique des commits

Auteur SHA1 Message Date
  Parth Sareen 630e7dc6ff api: structured outputs - chat endpoint (#7900) il y a 5 mois
  Sam 539be43640 llm: normalise kvct parameter handling (#7926) il y a 5 mois
  Sam 1bdab9fdb1 llm: introduce k/v context quantization (vRAM improvements) (#6279) il y a 5 mois
  ItzCrazyKns e3936d4fb3 Support Multiple LoRa Adapters (#7667) il y a 5 mois
  Daniel Hiltgen b85520bfb9 logs: explain client aborts better (#7783) il y a 5 mois
  Daniel Hiltgen 909a88c5c0 Improve crash reporting (#7728) il y a 5 mois
  Daniel Hiltgen 81d55d3e4d fix index out of range on zero layer metal load (#7696) il y a 5 mois
  Daniel Hiltgen df011054fa Jetpack support for Go server (#7217) il y a 5 mois
  Jesse Gross a909417602 runner.go: Remove unused arguments il y a 6 mois
  Jesse Gross de1557a0dc runner.go: Better handle return NULL values from llama.cpp il y a 6 mois
  Patrick Devine c7cb0f0602 image processing for llama3.2 (#6963) il y a 6 mois
  Gabe Goodhart f2890a4494 IBM granite/granitemoe architecture support (#6760) il y a 6 mois
  Daniel Hiltgen 05cd82ef94 Rename gpu package discover (#7143) il y a 6 mois
  Daniel Hiltgen 24636dfa87 Discovery CPU details for default thread selection (#6264) il y a 6 mois
  Jesse Gross 03408f3437 server: Don't clear cmd when closing a server il y a 6 mois
  Jeffrey Morgan 96efd9052f Re-introduce the `llama` package (#5034) il y a 6 mois
  Daniel Hiltgen cd5c8f6471 Optimize container images for startup (#6547) il y a 7 mois
  Daniel Hiltgen 4a8069f9c4 Quiet down dockers new lint warnings (#6716) il y a 7 mois
  Daniel Hiltgen 6719097649 llm: make load time stall duration configurable via OLLAMA_LOAD_TIMEOUT il y a 8 mois
  Daniel Hiltgen 037a4d103e Log system memory at info (#6617) il y a 8 mois
  Sean Khatiri 397cae7962 llm: fix typo in comment (#6530) il y a 8 mois
  Daniel Hiltgen 0f92b19bec Only enable numa on CPUs (#6484) il y a 8 mois
  Daniel Hiltgen 74d45f0102 Refactor linux packaging il y a 9 mois
  Jeffrey Morgan 15c2d8fe14 server: parallelize embeddings in API web handler instead of in subprocess runner (#6220) il y a 8 mois
  Daniel Hiltgen 25906d72d1 llm: prevent loading too large models on windows (#5926) il y a 8 mois
  Jeffrey Morgan de4fc29773 llm: reserve required number of slots for embeddings (#6219) il y a 9 mois
  Daniel Hiltgen f457d63400 Implement linux NUMA detection il y a 9 mois
  Michael Yang b732beba6a lint il y a 9 mois
  Michael Yang 5c1912769e Merge pull request #5473 from ollama/mxyng/environ il y a 9 mois
  royjhan 1b44d873e7 Add Metrics to `api\embed` response (#5709) il y a 9 mois