Commit History

Author SHA1 Message Date
  Stefan Weil abfdc4710f all: fix typos in documentation, code, and comments (#7021) 4 months ago
  Sam 1bdab9fdb1 llm: introduce k/v context quantization (vRAM improvements) (#6279) 5 months ago
  Daniel Hiltgen 05cd82ef94 Rename gpu package discover (#7143) 6 months ago
  Michael Yang 77903ab8b4 llama3.1 9 months ago
  Michael Yang b732beba6a lint 9 months ago
  Michael Yang df993fa37b comments 9 months ago
  Michael Yang 5e9db9fb0b refactor convert 11 months ago
  Michael Yang 35b89b2eab rfc: dynamic environ lookup 10 months ago
  Blake Mizerany cb42e607c5 llm: speed up gguf decoding by a lot (#5246) 10 months ago
  Daniel Hiltgen 6f351bf586 review comments and coverage 11 months ago
  Daniel Hiltgen 6fd04ca922 Improve multi-gpu handling at the limit 11 months ago