Commit History

Author SHA1 Message Date
  jmorganca aaca2ce093 wip 1 year ago
  jmorganca 323a3f1f3a cuda linux 1 year ago
  jmorganca 31e0de825e disable log file 1 year ago
  jmorganca 878eb9a19f add llava 1 year ago
  jmorganca 3d656588a7 `avx2` should only add `avx2` 1 year ago
  jmorganca 4dd63c1fef move `runner` package down 1 year ago
  jmorganca 82214396b5 replace static build in `llm` 1 year ago
  jmorganca 491ff41675 Initial `llama` Go module 1 year ago
  jmorganca 075f2e88d9 add sync of llama.cpp 1 year ago
  Michael Yang fccf8d179f partial decode ggml bin for more info 1 year ago
  Bruce MacDonald 984c9c628c fix embeddings invalid values 1 year ago
  Bruce MacDonald 09d8bf6730 fix build errors 1 year ago
  Bruce MacDonald 7a5f3616fd embed text document in modelfile 1 year ago
  Michael Yang f2074ed4c0 Merge pull request #306 from jmorganca/default-keep-system 1 year ago
  Bruce MacDonald a6f6d18f83 embed text document in modelfile 1 year ago
  Jeffrey Morgan 5eb712f962 trim whitespace before checking stop conditions 1 year ago
  Michael Yang 4dc5b117dd automatically set num_keep if num_keep < 0 1 year ago
  Michael Yang b9f4d67554 configurable rope frequency parameters 1 year ago
  Michael Yang c5bcf32823 update llama.cpp 1 year ago
  Michael Yang 74a5f7e698 no gpu for 70B model 1 year ago
  Michael Yang 319f078dd9 remove -Werror 1 year ago
  Jeffrey Morgan 7da249fcc1 only build metal for `darwin,arm` target 1 year ago
  Bruce MacDonald 184ad8f057 allow specifying stop conditions in modelfile 1 year ago
  Michael Yang 3549676678 embed ggml-metal.metal 1 year ago
  Michael Yang fadf75f99d add stop conditions 1 year ago
  Michael Yang ad3a7d0e2c add NumGQA 1 year ago
  Michael Yang cca61181cb sample metrics 1 year ago
  Michael Yang c490416189 lock on llm.lock(); decrease batch size 1 year ago
  Michael Yang f62a882760 add session expiration 1 year ago
  Michael Yang 3003fc03fc update predict code 1 year ago