Michael Yang
|
b732beba6a
lint
|
9 月之前 |
Michael Yang
|
5c1912769e
Merge pull request #5473 from ollama/mxyng/environ
|
9 月之前 |
Daniel Hiltgen
|
345420998e
Prevent partial loading on mixed GPU brands
|
9 月之前 |
Michael Yang
|
85d9d73a72
comments
|
9 月之前 |
Michael Yang
|
0f1910129f
int
|
10 月之前 |
Michael Yang
|
8570c1c0ef
keepalive
|
10 月之前 |
Michael Yang
|
55cd3ddcca
bool
|
10 月之前 |
Jeffrey Morgan
|
791650ddef
sched: only error when over-allocating system memory (#5626)
|
9 月之前 |
Jeffrey Morgan
|
e4ff73297d
server: fix model reloads when setting `OLLAMA_NUM_PARALLEL` (#5560)
|
9 月之前 |
Jeffrey Morgan
|
0ee87615c7
sched: don't error if paging to disk on Windows and macOS (#5523)
|
9 月之前 |
Daniel Hiltgen
|
af28b94533
Merge pull request #5469 from dhiltgen/prevent_system_oom
|
10 月之前 |
Daniel Hiltgen
|
955f2a4e03
Only set default keep_alive on initial model load
|
10 月之前 |
Daniel Hiltgen
|
3c75113e37
Prevent loading models larger than total memory
|
10 月之前 |
Daniel Hiltgen
|
cff3f44f4a
Fix case for NumCtx
|
10 月之前 |
Daniel Hiltgen
|
3518aaef33
Merge pull request #4218 from dhiltgen/auto_parallel
|
10 月之前 |
Blake Mizerany
|
cb42e607c5
llm: speed up gguf decoding by a lot (#5246)
|
10 月之前 |
Daniel Hiltgen
|
9929751cc8
Disable concurrency for AMD + Windows
|
10 月之前 |
Daniel Hiltgen
|
17b7186cd7
Enable concurrency by default
|
1 年之前 |
Daniel Hiltgen
|
6f351bf586
review comments and coverage
|
11 月之前 |
Daniel Hiltgen
|
ff4f0cbd1d
Prevent multiple concurrent loads on the same gpus
|
11 月之前 |
Daniel Hiltgen
|
fc37c192ae
Refine CPU load behavior with system memory visibility
|
11 月之前 |
Daniel Hiltgen
|
434dfe30c5
Reintroduce nvidia nvml library for windows
|
11 月之前 |
Daniel Hiltgen
|
48702dd149
Harden unload for empty runners
|
11 月之前 |
Daniel Hiltgen
|
5e8ff556cb
Support forced spreading for multi GPU
|
11 月之前 |
Michael Yang
|
e40145a39d
lint
|
11 月之前 |
Michael Yang
|
c895a7d13f
some gocritic
|
11 月之前 |
Michael Yang
|
04f3c12bb7
replace x/exp/slices with slices
|
11 月之前 |
Patrick Devine
|
4cc3be3035
Move envconfig and consolidate env vars (#4608)
|
11 月之前 |
Sang Park
|
4434d7f447
Correct typo in error message (#4535)
|
11 月之前 |
Daniel Hiltgen
|
ec231a7923
Remove VRAM convergence check for windows
|
11 月之前 |