Commit History

Autor SHA1 Mensaxe Data
  Daniel Hiltgen 30a7d7096c Bump VRAM buffer back up hai 11 meses
  Daniel Hiltgen 8727a9c140 Record more GPU information hai 1 ano
  Michael Yang 4736391bfb llm: add minimum based on layer size hai 1 ano
  Daniel Hiltgen 380378cc80 Use our libraries first hai 1 ano
  Daniel Hiltgen af9eb36f9f Merge pull request #4135 from dhiltgen/no_physx hai 1 ano
  Daniel Hiltgen 06093fd396 Merge pull request #4067 from dhiltgen/cudart hai 1 ano
  Daniel Hiltgen f56aa20014 Centralize server config handling hai 1 ano
  Daniel Hiltgen b1ad3a43cb Skip PhysX cudart library hai 1 ano
  Daniel Hiltgen 089daaeabc Add CUDA Driver API for GPU discovery hai 1 ano
  Daniel Hiltgen 34b9db5afc Request and model concurrency hai 1 ano
  Michael Yang 7e33a017c0 partial offloading hai 1 ano
  Daniel Hiltgen 1f11b52511 Refined min memory from testing hai 1 ano
  Daniel Hiltgen 526d4eb204 Release gpu discovery library after use hai 1 ano
  Michael Yang 91b3e4d282 update memory calcualtions hai 1 ano
  Jeremy dfc6721b20 add support for libcudart.so for CUDA devices (adds Jetson support) hai 1 ano
  Daniel Hiltgen 6c5ccb11f9 Revamp ROCm support hai 1 ano
  Daniel Hiltgen be330174dd Allow setting max vram for workarounds hai 1 ano
  Daniel Hiltgen 9754c6d9d8 Harden AMD driver lookup logic hai 1 ano
  Daniel Hiltgen 6d84f07505 Detect AMD GPU info via sysfs and block old cards hai 1 ano
  Daniel Hiltgen 4072b5879b Merge pull request #2246 from dhiltgen/reject_cuda_without_avx hai 1 ano
  Daniel Hiltgen 15562e887d Don't disable GPUs on arm without AVX hai 1 ano
  Daniel Hiltgen f07f8b7a9e Harden for zero detected GPUs hai 1 ano
  Daniel Hiltgen e02ecfb6c8 Merge pull request #2116 from dhiltgen/cc_50_80 hai 1 ano
  Daniel Hiltgen 667a2ba18a Detect lack of AVX and fallback to CPU mode hai 1 ano
  Daniel Hiltgen 9d7b5d6c91 Ignore AMD integrated GPUs hai 1 ano
  Daniel Hiltgen 013fd07139 More logging for gpu management hai 1 ano
  Daniel Hiltgen 987c16b2f7 Report more information about GPUs in verbose mode hai 1 ano
  Daniel Hiltgen a447a083f2 Add compute capability 5.0, 7.5, and 8.0 hai 1 ano
  Jeffrey Morgan f32ea81b21 increase minimum overhead to 1024MiB (#2114) hai 1 ano
  Daniel Hiltgen 681a914990 Add support for CUDA 5.2 cards hai 1 ano