提交歷史

作者 SHA1 備註 提交日期
  Jeffrey Morgan be721ca0df add more search paths for cuda libs 1 年之前
  Jeffrey Morgan 34344d801c clean up cmake `build` directory when cross compiling macOS builds 1 年之前
  Robin Glauser e868c8a5c7 Update api.md (#1878) 1 年之前
  Jeffrey Morgan c336693f07 calculate overhead based number of gpu devices (#1875) 1 年之前
  Daniel Hiltgen e89dc1d54b Merge pull request #1874 from dhiltgen/correct_cuda_min 1 年之前
  Daniel Hiltgen 1961a81f03 Set corret CUDA minimum compute capability version 1 年之前
  Jeffrey Morgan 8a8c7e7f8d only build for metal on `arm64` 1 年之前
  Jeffrey Morgan 6df83e6daa update rough cuda overhead estimate to 15% + 384MiB 1 年之前
  Michael Yang 62023177f6 Merge pull request #1614 from jmorganca/mxyng/fix-set-template 1 年之前
  Jeffrey Morgan 6164f378f2 revert cuda overhead to 20% 1 年之前
  Jeffrey Morgan f387e9631b use runner if cuda alloc won't fit 1 年之前
  Jeffrey Morgan 6566387ae3 add `TODO` for cuda overhead 1 年之前
  Jeffrey Morgan 37708931fb update cuda overhead to 20% to fix crashes when switching between models and large context sizes 1 年之前
  Jeffrey Morgan f6cb0a553c update cuda overhead to 15% or 400MiB 1 年之前
  Jeffrey Morgan 2680078c13 fix build on linux 1 年之前
  Jeffrey Morgan f1b7e5f560 update overhead to 15% 1 年之前
  Jeffrey Morgan cb534e6ac2 use 10% vram overhead for cuda 1 年之前
  Jeffrey Morgan 58ce2d8273 better estimate scratch buffer size 1 年之前
  Jeffrey Morgan 18ddf6d57d fix windows build 1 年之前
  Michael Yang 61e6502449 Merge pull request #1818 from jmorganca/mxyng/fix-alt-prompt 1 年之前
  Jeffrey Morgan 08f1e18965 Offload layers to GPU based on new model size estimates (#1850) 1 年之前
  Bruce MacDonald 7e8f7c8358 remove ggml automatic re-pull (#1856) 1 年之前
  Bruce MacDonald 3f3eb19a3b document response in modelfile template variables (#1428) 1 年之前
  Daniel Hiltgen 059ae4585e Merge pull request #1834 from dhiltgen/old_cuda 1 年之前
  Daniel Hiltgen 6347f501ca Merge pull request #1828 from dhiltgen/fix_llava 1 年之前
  Jeffrey Morgan 5feec959ad dont use `-Wall` in static build (#1833) 1 年之前
  Jeffrey Morgan dbdd50b283 add `-DCMAKE_SYSTEM_NAME=Darwin` cmake flag (#1832) 1 年之前
  Daniel Hiltgen d74ce6bd4f Detect very old CUDA GPUs and fall back to CPU 1 年之前
  Guilherme Baptista 57942b4676 Update README.md - Community Integrations - Ollama for Ruby (#1830) 1 年之前
  Daniel Hiltgen e0d05b0f1e Accept windows paths for image processing 1 年之前