Commit History

Author SHA1 Message Date
  Jesse Gross 854a9195f3 attention: Remove unnecessary contiguous operations 2 months ago
  Michael Yang 53d2990d9b model: add bos token if configured 2 months ago
  Jesse Gross f53f4198c3 ml: Abstract attention out of model definitions 2 months ago
  Jesse Gross 5c5535c064 models: Prune unused outputs earlier in the forward pass 2 months ago
  Jesse Gross ed443a0393 Runner for Ollama engine 4 months ago
  Jesse Gross 6945617af5 models: Move model into their own directory 2 months ago