Patrick Devine
|
790d24eb7b
add show command (#474)
|
1 年之前 |
Michael Yang
|
681f3c4c42
fix num_keep
|
1 年之前 |
Michael Yang
|
eeb40a672c
fix list models for windows
|
1 年之前 |
Michael Yang
|
0f541a0367
s/ListResponseModel/ModelResponse/
|
1 年之前 |
Bruce MacDonald
|
42998d797d
subprocess llama.cpp server (#401)
|
1 年之前 |
Patrick Devine
|
8bbff2df98
add model IDs (#439)
|
1 年之前 |
Michael Yang
|
95187d7e1e
build release mode
|
1 年之前 |
Jeffrey Morgan
|
a9f6c56652
fix `FROM` instruction erroring when referring to a file
|
1 年之前 |
Ryan Baker
|
0a892419ad
Strip protocol from model path (#377)
|
1 年之前 |
Bruce MacDonald
|
326de48930
use loaded llm for embeddings
|
1 年之前 |
Patrick Devine
|
d9cf18e28d
add maximum retries when pushing (#334)
|
1 年之前 |
Michael Yang
|
6517bcc53c
Merge pull request #290 from jmorganca/add-adapter-layers
|
1 年之前 |
Michael Yang
|
6a6828bddf
Merge pull request #167 from jmorganca/decode-ggml
|
1 年之前 |
Jeffrey Morgan
|
040a5b9750
clean up cli flags
|
1 年之前 |
Michael Yang
|
6de5d032e1
implement loading ggml lora adapters through the modelfile
|
1 年之前 |
Michael Yang
|
fccf8d179f
partial decode ggml bin for more info
|
1 年之前 |
Bruce MacDonald
|
4b3507f036
embeddings endpoint
|
1 年之前 |
Bruce MacDonald
|
868e3b31c7
allow for concurrent pulls of the same files
|
1 年之前 |
Bruce MacDonald
|
09d8bf6730
fix build errors
|
1 年之前 |
Bruce MacDonald
|
7a5f3616fd
embed text document in modelfile
|
1 年之前 |
Jeffrey Morgan
|
cff002b824
use content type `application/x-ndjson` for streaming responses
|
1 年之前 |
Jeffrey Morgan
|
a027a7dd65
add `0.0.0.0` as an allowed origin by default
|
1 年之前 |
Bruce MacDonald
|
21ddcaa1f1
pr comments
|
1 年之前 |
Michael Yang
|
f2074ed4c0
Merge pull request #306 from jmorganca/default-keep-system
|
1 年之前 |
Bruce MacDonald
|
a6f6d18f83
embed text document in modelfile
|
1 年之前 |
Michael Yang
|
4dc5b117dd
automatically set num_keep if num_keep < 0
|
1 年之前 |
cmiller01
|
fb593b7bfc
pass flags to `serve` to allow setting allowed-origins + host and port
|
1 年之前 |
Jeffrey Morgan
|
e3fb1fd3f1
server: compare options correctly
|
1 年之前 |
Bruce MacDonald
|
8b1e791820
allow specifying zero values in modelfile
|
1 年之前 |
Jeffrey Morgan
|
03cff3a225
server: reset digest at end of generate
|
1 年之前 |