Patrick Devine
|
4cc3be3035
Move envconfig and consolidate env vars (#4608)
|
11 ماه پیش |
Michael Yang
|
f36f1d6be9
tidy intermediate blobs
|
11 ماه پیش |
Michael Yang
|
3520c0e4d5
cache and reuse intermediate blobs
|
11 ماه پیش |
Patrick Devine
|
ccdf0b2a44
Move the parser back + handle utf16 files (#4533)
|
11 ماه پیش |
Daniel Hiltgen
|
02b31c9dc8
Don't return error on signal exit
|
11 ماه پیش |
Patrick Devine
|
d1692fd3e0
fix the cpu estimatedTotal memory + get the expiry time for loading models (#4461)
|
11 ماه پیش |
Patrick Devine
|
f2cf97d6f1
fix typo in modelfile generation (#4439)
|
11 ماه پیش |
Ryo Machida
|
798b107f19
Fixed the API endpoint /api/tags when the model list is empty. (#4424)
|
11 ماه پیش |
Patrick Devine
|
7ca71a6b0f
don't abort when an invalid model name is used in /save (#4416)
|
11 ماه پیش |
Patrick Devine
|
6845988807
Ollama `ps` command for showing currently loaded models (#4327)
|
11 ماه پیش |
Jeffrey Morgan
|
6602e793c0
Use `--quantize` flag and `quantize` api parameter (#4321)
|
11 ماه پیش |
Michael Yang
|
e03637176d
fix(routes): skip bad manifests
|
11 ماه پیش |
Daniel Hiltgen
|
3ae2f441e0
Fix race in shutdown logic
|
11 ماه پیش |
Daniel Hiltgen
|
8727a9c140
Record more GPU information
|
1 سال پیش |
Bruce MacDonald
|
cfa84b8470
add done_reason to the api (#4235)
|
11 ماه پیش |
Michael Yang
|
a7ee84fc31
routes: skip invalid filepaths
|
11 ماه پیش |
Jeffrey Morgan
|
d5eec16d23
use model defaults for `num_gqa`, `rope_frequency_base ` and `rope_frequency_scale` (#1983)
|
11 ماه پیش |
Bruce MacDonald
|
cef45feaa4
Add preflight OPTIONS handling and update CORS config (#4086)
|
11 ماه پیش |
Bruce MacDonald
|
8cbd3e7510
skip hidden files in list models handler (#4247)
|
1 سال پیش |
Bruce MacDonald
|
dc9b1111e0
fix invalid destination error message
|
1 سال پیش |
Michael Yang
|
ffbd3d173f
Merge pull request #3715 from ollama/mxyng/modelname-2
|
1 سال پیش |
Michael Yang
|
1e0a669f75
Merge pull request #3682 from ollama/mxyng/quantize-all-the-things
|
1 سال پیش |
Michael Yang
|
548a7df014
update list handler to use model.Name
|
1 سال پیش |
Jeffrey Morgan
|
39d9d22ca3
close server on receiving signal (#4213)
|
1 سال پیش |
Michael Yang
|
9685c34509
quantize any fp16/fp32 model
|
1 سال پیش |
Daniel Hiltgen
|
f56aa20014
Centralize server config handling
|
1 سال پیش |
Daniel Hiltgen
|
20f6c06569
Make maximum pending request configurable
|
1 سال پیش |
Michael Yang
|
b7a87a22b6
Merge pull request #4059 from ollama/mxyng/parser-2
|
1 سال پیش |
Michael Yang
|
e9ae607ece
Merge pull request #3892 from ollama/mxyng/parser
|
1 سال پیش |
Michael Yang
|
45b6a12e45
server: target invalid
|
1 سال پیش |