Commit History

Autor SHA1 Mensaxe Data
  Daniel Hiltgen 9b5a3c5991 Merge pull request #3914 from dhiltgen/mac_perf hai 1 ano
  Jeffrey Morgan 00b0699c75 Reload model if `num_gpu` changes (#3920) hai 1 ano
  Daniel Hiltgen b123be5b71 Adjust context size for parallelism hai 1 ano
  Bryce Reitano 36a6daccab Restructure loading conditional chain hai 1 ano
  Bryce Reitano 284e02bed0 Move ggml loading to when we attempt fitting hai 1 ano
  Daniel Hiltgen 34b9db5afc Request and model concurrency hai 1 ano