Author | SHA1 Message | Date |
---|---|---|
|
9b5a3c5991 Merge pull request #3914 from dhiltgen/mac_perf | 1 year ago |
|
00b0699c75 Reload model if `num_gpu` changes (#3920) | 1 year ago |
|
b123be5b71 Adjust context size for parallelism | 1 year ago |
|
36a6daccab Restructure loading conditional chain | 1 year ago |
|
284e02bed0 Move ggml loading to when we attempt fitting | 1 year ago |
|
34b9db5afc Request and model concurrency | 1 year ago |