Historique des commits

Auteur SHA1 Message Date
  Jeffrey Morgan a3fcecf943 only set `main_gpu` if value > 0 is provided il y a 1 an
  Purinda Gunasekara be61a81758 main-gpu argument is not getting passed to llamacpp, fixed. (#1192) il y a 1 an
  Jeffrey Morgan 36a3bbf65f Update llm/llama.go il y a 1 an
  Bruce MacDonald 43a726149d fix potentially inaccurate error message il y a 1 an
  Jeffrey Morgan 41434a7cdc build intel mac with correct binary and compile flags il y a 1 an
  Jeffrey Morgan 5cba29b9d6 JSON mode: add `"format" as an api parameter (#1051) il y a 1 an
  Bruce MacDonald 1ae84bc2a2 skip gpu if less than 2GB VRAM are available (#1059) il y a 1 an
  Jeffrey Morgan c44b619428 remove unused `fmt.Println` il y a 1 an
  Jeffrey Morgan 17678b7225 Restore system prompt on requests and default `num_keep` to `0` il y a 1 an
  Jeffrey Morgan 2e53704685 default rope params to 0 for new models (#968) il y a 1 an
  Michael Yang 642128b75a append LD_LIBRARY_PATH il y a 1 an
  Bruce MacDonald 6d283882b1 catch insufficient permissions nvidia err (#934) il y a 1 an
  Bruce MacDonald 2665f3c28e offload 75% of available vram to improve stability (#921) il y a 1 an
  Jeffrey Morgan 7ed5a39bc7 simpler check for model loading compatibility errors il y a 1 an
  Jeffrey Morgan a7dad24d92 add error for `falcon` and `starcoder` vocab compatibility (#844) il y a 1 an
  Michael Yang 235e43d7f6 Merge pull request #833 from discovertomorrow/leadingspace il y a 1 an
  Arne Müller 730996e530 use TrimPrefix instead of TrimLeft il y a 1 an
  Arne Müller ce6197a8e0 removed redundant strings.CutPrefix from Decode il y a 1 an
  Arne Müller 46b9953f32 use strings.TrimLeft to remove spaces il y a 1 an
  Bruce MacDonald 565648f3f7 relay CUDA errors to the client (#825) il y a 1 an
  Arne Müller 90c49bed57 moved removal of leading space into Predict il y a 1 an
  Arne Müller 5dc0cff459 fix whitespace removal il y a 1 an
  Michael Yang b36b0b71f8 use cut prefix il y a 1 an
  Michael Yang 094df37563 remove unused struct il y a 1 an
  Bruce MacDonald bd93a94abd fix MB VRAM log output (#824) il y a 1 an
  Michael Yang f55bdb6f10 Merge pull request #799 from deichbewohner/jsonmarshaling il y a 1 an
  Michael Yang 2870a9bfc8 Merge pull request #812 from jmorganca/mxyng/fix-format-string il y a 1 an
  Arne Müller 8fa3f366ad Removed newline trimming and used buffer directly in POST request. il y a 1 an
  Michael Yang fddb303f23 fix: format string wrong type il y a 1 an
  Michael Yang cb4a80b693 fix: regression unsupported metal types il y a 1 an