Daniel Hiltgen
|
0a954e5066
Fix stale test logic
|
1 rok temu |
Jeffrey Morgan
|
dfa2f32ca0
unload in critical section (#4187)
|
1 rok temu |
Daniel Hiltgen
|
f56aa20014
Centralize server config handling
|
1 rok temu |
Daniel Hiltgen
|
9a32c514cb
Soften timeouts on sched unit tests
|
1 rok temu |
Daniel Hiltgen
|
d6e3b64582
Fix concurrency for CPU mode
|
1 rok temu |
Jeffrey Morgan
|
00b0699c75
Reload model if `num_gpu` changes (#3920)
|
1 rok temu |
Bryce Reitano
|
36a6daccab
Restructure loading conditional chain
|
1 rok temu |
Bryce Reitano
|
ceb0e26e5e
Provide variable ggml for TestLoad
|
1 rok temu |
Bryce Reitano
|
284e02bed0
Move ggml loading to when we attempt fitting
|
1 rok temu |
Daniel Hiltgen
|
d8851cb7a0
Harden sched TestLoad
|
1 rok temu |
Daniel Hiltgen
|
34b9db5afc
Request and model concurrency
|
1 rok temu |