Commit History

Автор SHA1 Съобщение Дата
  Jeffrey Morgan 15c2d8fe14 server: parallelize embeddings in API web handler instead of in subprocess runner (#6220) преди 9 месеца
  Daniel Hiltgen 25906d72d1 llm: prevent loading too large models on windows (#5926) преди 9 месеца
  Jeffrey Morgan de4fc29773 llm: reserve required number of slots for embeddings (#6219) преди 9 месеца
  Daniel Hiltgen f457d63400 Implement linux NUMA detection преди 9 месеца
  Michael Yang b732beba6a lint преди 9 месеца
  Michael Yang 5c1912769e Merge pull request #5473 from ollama/mxyng/environ преди 9 месеца
  royjhan 1b44d873e7 Add Metrics to `api\embed` response (#5709) преди 9 месеца
  Tibor Schmidt f3d7a481b7 feat: add support for min_p (resolve #1142) (#1825) преди 9 месеца
  Daniel Hiltgen e12fff8810 Enable windows error dialog for subprocess startup преди 10 месеца
  Michael Yang e2c3f6b3e2 string преди 10 месеца
  Michael Yang 55cd3ddcca bool преди 10 месеца
  Michael Yang 35b89b2eab rfc: dynamic environ lookup преди 10 месеца
  Daniel Hiltgen a3c20e3f18 Refine error reporting for subprocess crash преди 9 месеца
  Daniel Hiltgen 283948c83b Adjust windows ROCm discovery преди 9 месеца
  royjhan b9f5e16c80 Introduce `/api/embed` endpoint supporting batch embedding (#5127) преди 10 месеца
  Jeffrey Morgan ef98803d63 llm: looser checks for minimum memory (#5677) преди 10 месеца
  Jeffrey Morgan c4cf8ad559 llm: avoid loading model if system memory is too small (#5637) преди 10 месеца
  Jeffrey Morgan 791650ddef sched: only error when over-allocating system memory (#5626) преди 10 месеца
  Daniel Hiltgen 22c81f62ec Remove duplicate merge glitch преди 10 месеца
  Michael Yang 9bbddc37a7 Merge pull request #5126 from ollama/mxyng/messages преди 10 месеца
  Jeffrey Morgan 53da2c6965 llm: remove ambiguous comment when putting upper limit on predictions to avoid infinite generation (#5535) преди 10 месеца
  Michael Yang ac7a842e55 fix model reloading преди 10 месеца
  Daniel Hiltgen ccd7785859 Merge pull request #5243 from dhiltgen/modelfile_use_mmap преди 10 месеца
  Daniel Hiltgen 0e982bc1f4 Fix corner cases on tmp cleaner on mac преди 10 месеца
  Josh Yan 33a65e3ba3 error преди 10 месеца
  Daniel Hiltgen 97c9e11768 Switch use_mmap to a pointer type преди 10 месеца
  Daniel Hiltgen 3518aaef33 Merge pull request #4218 from dhiltgen/auto_parallel преди 10 месеца
  Blake Mizerany cb42e607c5 llm: speed up gguf decoding by a lot (#5246) преди 10 месеца
  Daniel Hiltgen 17b7186cd7 Enable concurrency by default преди 1 година
  Daniel Hiltgen 5bf5aeec01 Refine mmap default logic on linux преди 10 месеца