Jeffrey Morgan
|
e093db92c4
sample: temporarily use grammars for constrained generation in new engine (#9586)
|
1 month ago |
Jeffrey Morgan
|
1deafd8254
llama: update vendored code to commit 46e3556 (#8308)
|
3 months ago |
Jeffrey Morgan
|
527cc97899
llama: update vendored code to commit 40c6d79f (#7875)
|
4 months ago |
Parth Sareen
|
630e7dc6ff
api: structured outputs - chat endpoint (#7900)
|
4 months ago |
Gabe Goodhart
|
f2890a4494
IBM granite/granitemoe architecture support (#6760)
|
6 months ago |
Jeffrey Morgan
|
96efd9052f
Re-introduce the `llama` package (#5034)
|
6 months ago |