Commit History

Author SHA1 Message Date
  Josh Yan a083852eb5 quantize progress 9 months ago
  Josh Yan 6bab0e2368 lint 9 months ago
  Josh Yan d25f85ede4 on disk copy 10 months ago
  Josh Yan ca0ee1d4fe rm config 10 months ago
  Josh Yan c3b411a515 on disk copy 10 months ago
  Daniel Hiltgen ccd7785859 Merge pull request #5243 from dhiltgen/modelfile_use_mmap 10 months ago
  royjhan 996bb1b85e OpenAI: /v1/models and /v1/models/{model} compatibility (#5007) 10 months ago
  Daniel Hiltgen 97c9e11768 Switch use_mmap to a pointer type 10 months ago
  Daniel Hiltgen 7e7749224c Fix use_mmap parsing for modelfiles 10 months ago
  royjhan fedf71635e Extend api/show and ollama show to return more model info (#4881) 10 months ago
  Daniel Hiltgen 171796791f Adjust mmap logic for cuda windows for faster model load 10 months ago
  royjhan 89c79bec8c Add ModifiedAt Field to /api/show (#5033) 10 months ago
  Patrick Devine c69bc19e46 move OLLAMA_HOST to envconfig (#5009) 10 months ago
  royjhan 4bf1da4944 Separate ListResponse and ModelResponse for api/tags vs api/ps (#4842) 11 months ago
  Michael Yang c895a7d13f some gocritic 11 months ago
  Patrick Devine 6845988807 Ollama `ps` command for showing currently loaded models (#4327) 11 months ago
  Jeffrey Morgan 6602e793c0 Use `--quantize` flag and `quantize` api parameter (#4321) 11 months ago
  Bruce MacDonald c02db93243 omit empty done reason 11 months ago
  Bruce MacDonald cfa84b8470 add done_reason to the api (#4235) 11 months ago
  Jeffrey Morgan d5eec16d23 use model defaults for `num_gqa`, `rope_frequency_base ` and `rope_frequency_scale` (#1983) 11 months ago
  Eli Bendersky d77c1c5f9d api: fill up API documentation (#3596) 1 year ago
  Jackie Li af47413dba Add MarshalJSON to Duration (#3284) 1 year ago
  Patrick Devine 9009bedf13 better checking for OLLAMA_HOST variable (#3661) 1 year ago
  Jeffrey Morgan 993cf8bf55 llm: limit generation to 10x context size to avoid run on generations (#3918) 1 year ago
  Cheng 62be2050dd chore: use errors.New to replace fmt.Errorf will much better (#3789) 1 year ago
  Eli Bendersky ad90b9ab3d api: start adding documentation to package api (#2878) 1 year ago
  Michael Yang 01114b4526 fix: rope 1 year ago
  Michael Yang 9502e5661f cgo quantize 1 year ago
  Michael Yang be517e491c no rope parameters 1 year ago
  Jeffrey Morgan 3b4bab3dc5 Fix embeddings load model behavior (#2848) 1 year ago