Roy Han
|
eb7cc2d1ce
image embeddings
|
9 miesięcy temu |
Roy Han
|
fb390b8902
embedding type 64
|
9 miesięcy temu |
royjhan
|
b7c622dd32
Merge branch 'main' into royh-batchembed
|
9 miesięcy temu |
Roy Han
|
17de2b4405
Refactoring of legacy and new
|
10 miesięcy temu |
Daniel Hiltgen
|
ccd7785859
Merge pull request #5243 from dhiltgen/modelfile_use_mmap
|
10 miesięcy temu |
royjhan
|
a5f23d766e
Merge branch 'main' into royh-batchembed
|
10 miesięcy temu |
royjhan
|
996bb1b85e
OpenAI: /v1/models and /v1/models/{model} compatibility (#5007)
|
10 miesięcy temu |
Roy Han
|
00a4cb26ca
use float32
|
10 miesięcy temu |
Roy Han
|
aee25acb5b
move normalization to go
|
10 miesięcy temu |
Daniel Hiltgen
|
97c9e11768
Switch use_mmap to a pointer type
|
10 miesięcy temu |
Roy Han
|
80c1a3f812
playing around with truncate stuff
|
10 miesięcy temu |
Roy Han
|
5213c12354
clean up
|
10 miesięcy temu |
Roy Han
|
c406fa7a4c
api/embed draft
|
10 miesięcy temu |
Roy Han
|
ff191d7cba
Initial Draft
|
10 miesięcy temu |
Roy Han
|
0f87628b6d
Revert "Initial Batch Embedding"
|
10 miesięcy temu |
Daniel Hiltgen
|
7e7749224c
Fix use_mmap parsing for modelfiles
|
10 miesięcy temu |
royjhan
|
fedf71635e
Extend api/show and ollama show to return more model info (#4881)
|
10 miesięcy temu |
Roy Han
|
c22d54895a
Initial Batch Embedding
|
10 miesięcy temu |
Daniel Hiltgen
|
171796791f
Adjust mmap logic for cuda windows for faster model load
|
10 miesięcy temu |
royjhan
|
89c79bec8c
Add ModifiedAt Field to /api/show (#5033)
|
10 miesięcy temu |
Patrick Devine
|
c69bc19e46
move OLLAMA_HOST to envconfig (#5009)
|
10 miesięcy temu |
royjhan
|
4bf1da4944
Separate ListResponse and ModelResponse for api/tags vs api/ps (#4842)
|
11 miesięcy temu |
Michael Yang
|
c895a7d13f
some gocritic
|
11 miesięcy temu |
Patrick Devine
|
6845988807
Ollama `ps` command for showing currently loaded models (#4327)
|
11 miesięcy temu |
Jeffrey Morgan
|
6602e793c0
Use `--quantize` flag and `quantize` api parameter (#4321)
|
11 miesięcy temu |
Bruce MacDonald
|
c02db93243
omit empty done reason
|
11 miesięcy temu |
Bruce MacDonald
|
cfa84b8470
add done_reason to the api (#4235)
|
11 miesięcy temu |
Jeffrey Morgan
|
d5eec16d23
use model defaults for `num_gqa`, `rope_frequency_base ` and `rope_frequency_scale` (#1983)
|
11 miesięcy temu |
Eli Bendersky
|
d77c1c5f9d
api: fill up API documentation (#3596)
|
1 rok temu |
Jackie Li
|
af47413dba
Add MarshalJSON to Duration (#3284)
|
1 rok temu |