Historique des commits

Auteur SHA1 Message Date
  Michael Yang 949d7b1c48 add gguf file types (#2532) il y a 1 an
  Michael Yang eaed6f8c45 add max context length check il y a 1 an
  Michael Yang 2bb2bdd5d4 fix lint il y a 1 an
  Jeffrey Morgan 08f1e18965 Offload layers to GPU based on new model size estimates (#1850) il y a 1 an
  Bruce MacDonald 811b1f03c8 deprecate ggml il y a 1 an
  Jeffrey Morgan d9a250e9b5 seek to end of file when decoding older model formats il y a 1 an
  Jeffrey Morgan 944519ed16 seek to eof for older model binaries il y a 1 an
  Michael Yang 72e7a49aa9 seek instead of copyn il y a 1 an
  Michael Yang 2cb0fa7d40 split from into one or more models il y a 1 an
  Michael Yang b2816bca67 unnecessary ReadSeeker for DecodeGGML il y a 1 an
  Michael Yang 125d0a013a ggufv3 il y a 1 an
  Michael Yang c02c0cd483 starcoder il y a 1 an
  Bruce MacDonald 86279f4ae3 unbound max num gpu layers (#591) il y a 1 an
  Bruce MacDonald 4cba75efc5 remove tmp directories created by previous servers (#559) il y a 1 an
  Bruce MacDonald 66003e1d05 subprocess improvements (#524) il y a 1 an
  Bruce MacDonald 2540c9181c support for packaging in multiple cuda runners (#509) il y a 1 an
  Michael Yang 7dee25a07f fix falcon decode il y a 1 an
  Bruce MacDonald 09dd2aeff9 GGUF support (#441) il y a 1 an
  Michael Yang b1cececb8e add 34b model type il y a 1 an
  Michael Yang a894cc792d model and file type as strings il y a 1 an
  Michael Yang 6ed991c8e2 ggml: fix off by one error il y a 1 an
  Michael Yang fccf8d179f partial decode ggml bin for more info il y a 1 an