Michael Yang
|
b9495ea162
load projectors
|
1 éve |
Bruce MacDonald
|
195e3d9dbd
chat api endpoint (#1392)
|
1 éve |
Jeffrey Morgan
|
00d06619a1
Revert "chat api (#991)" while context variable is fixed
|
1 éve |
Bruce MacDonald
|
7a0899d62d
chat api (#991)
|
1 éve |
Jing Zhang
|
82b9b329ff
windows CUDA support (#1262)
|
1 éve |
Jeffrey Morgan
|
a3fcecf943
only set `main_gpu` if value > 0 is provided
|
1 éve |
Purinda Gunasekara
|
be61a81758
main-gpu argument is not getting passed to llamacpp, fixed. (#1192)
|
1 éve |
Jeffrey Morgan
|
36a3bbf65f
Update llm/llama.go
|
1 éve |
Bruce MacDonald
|
43a726149d
fix potentially inaccurate error message
|
1 éve |
Jeffrey Morgan
|
41434a7cdc
build intel mac with correct binary and compile flags
|
1 éve |
Jeffrey Morgan
|
5cba29b9d6
JSON mode: add `"format" as an api parameter (#1051)
|
1 éve |
Bruce MacDonald
|
1ae84bc2a2
skip gpu if less than 2GB VRAM are available (#1059)
|
1 éve |
Jeffrey Morgan
|
c44b619428
remove unused `fmt.Println`
|
1 éve |
Jeffrey Morgan
|
17678b7225
Restore system prompt on requests and default `num_keep` to `0`
|
1 éve |
Jeffrey Morgan
|
2e53704685
default rope params to 0 for new models (#968)
|
1 éve |
Michael Yang
|
642128b75a
append LD_LIBRARY_PATH
|
1 éve |
Bruce MacDonald
|
6d283882b1
catch insufficient permissions nvidia err (#934)
|
1 éve |
Bruce MacDonald
|
2665f3c28e
offload 75% of available vram to improve stability (#921)
|
1 éve |
Jeffrey Morgan
|
7ed5a39bc7
simpler check for model loading compatibility errors
|
1 éve |
Jeffrey Morgan
|
a7dad24d92
add error for `falcon` and `starcoder` vocab compatibility (#844)
|
1 éve |
Michael Yang
|
235e43d7f6
Merge pull request #833 from discovertomorrow/leadingspace
|
1 éve |
Arne Müller
|
730996e530
use TrimPrefix instead of TrimLeft
|
1 éve |
Arne Müller
|
ce6197a8e0
removed redundant strings.CutPrefix from Decode
|
1 éve |
Arne Müller
|
46b9953f32
use strings.TrimLeft to remove spaces
|
1 éve |
Bruce MacDonald
|
565648f3f7
relay CUDA errors to the client (#825)
|
1 éve |
Arne Müller
|
90c49bed57
moved removal of leading space into Predict
|
1 éve |
Arne Müller
|
5dc0cff459
fix whitespace removal
|
1 éve |
Michael Yang
|
b36b0b71f8
use cut prefix
|
1 éve |
Michael Yang
|
094df37563
remove unused struct
|
1 éve |
Bruce MacDonald
|
bd93a94abd
fix MB VRAM log output (#824)
|
1 éve |