Historique des commits

Auteur SHA1 Message Date
  Michael Yang 58245413f4 next ollama runner (#7913) il y a 2 mois
  Stefan Weil abfdc4710f all: fix typos in documentation, code, and comments (#7021) il y a 4 mois
  Daniel Hiltgen 05cd82ef94 Rename gpu package discover (#7143) il y a 6 mois
  Daniel Hiltgen d632e23fba Add Windows arm64 support to official builds (#5712) il y a 7 mois
  Patrick Devine abed273de3 add "stop" command (#6739) il y a 7 mois
  Michael Yang 77903ab8b4 llama3.1 il y a 9 mois
  Jeffrey Morgan 15c2d8fe14 server: parallelize embeddings in API web handler instead of in subprocess runner (#6220) il y a 8 mois
  Michael Yang b732beba6a lint il y a 9 mois
  Michael Yang df993fa37b comments il y a 9 mois
  Michael Yang 5e9db9fb0b refactor convert il y a 11 mois
  Michael Yang 5c1912769e Merge pull request #5473 from ollama/mxyng/environ il y a 9 mois
  royjhan 1b44d873e7 Add Metrics to `api\embed` response (#5709) il y a 9 mois
  Daniel Hiltgen 345420998e Prevent partial loading on mixed GPU brands il y a 9 mois
  Michael Yang 0f1910129f int il y a 10 mois
  Jeffrey Morgan 80ee9b5e47 Remove out of space test temporarily (#5825) il y a 9 mois
  Daniel Hiltgen 06e5d74e34 Merge pull request #5506 from dhiltgen/sched_tests il y a 9 mois
  royjhan b9f5e16c80 Introduce `/api/embed` endpoint supporting batch embedding (#5127) il y a 9 mois
  Daniel Hiltgen f4408219e9 Refine scheduler unit tests for reliability il y a 10 mois
  Daniel Hiltgen af28b94533 Merge pull request #5469 from dhiltgen/prevent_system_oom il y a 10 mois
  Daniel Hiltgen 955f2a4e03 Only set default keep_alive on initial model load il y a 10 mois
  Daniel Hiltgen 3c75113e37 Prevent loading models larger than total memory il y a 10 mois
  Daniel Hiltgen 3518aaef33 Merge pull request #4218 from dhiltgen/auto_parallel il y a 10 mois
  Blake Mizerany cb42e607c5 llm: speed up gguf decoding by a lot (#5246) il y a 10 mois
  Daniel Hiltgen 17b7186cd7 Enable concurrency by default il y a 1 an
  Daniel Hiltgen 45cacbaf05 Merge pull request #4517 from dhiltgen/gpu_incremental il y a 10 mois
  Daniel Hiltgen 6f351bf586 review comments and coverage il y a 11 mois
  Daniel Hiltgen fc37c192ae Refine CPU load behavior with system memory visibility il y a 11 mois
  Daniel Hiltgen 6fd04ca922 Improve multi-gpu handling at the limit il y a 11 mois
  Jeffrey Morgan dd7c9ebeaf server: longer timeout in `TestRequests` (#5046) il y a 10 mois
  Michael Yang e40145a39d lint il y a 11 mois