Jeffrey Morgan
|
e093db92c4
sample: temporarily use grammars for constrained generation in new engine (#9586)
|
hai 1 mes |
Jeffrey Morgan
|
1deafd8254
llama: update vendored code to commit 46e3556 (#8308)
|
hai 3 meses |
Pascal Patry
|
c216850523
llama: parse JSON schema using nlohmann::ordered_json to maintain ordering (#8071)
|
hai 4 meses |
Jeffrey Morgan
|
527cc97899
llama: update vendored code to commit 40c6d79f (#7875)
|
hai 4 meses |
Parth Sareen
|
630e7dc6ff
api: structured outputs - chat endpoint (#7900)
|
hai 4 meses |
Jesse Gross
|
312d9de1d1
llama: Improve error handling
|
hai 6 meses |
Gabe Goodhart
|
f2890a4494
IBM granite/granitemoe architecture support (#6760)
|
hai 6 meses |
Jeffrey Morgan
|
96efd9052f
Re-introduce the `llama` package (#5034)
|
hai 6 meses |