Jesse Gross
|
0c220935bd
input: Rename Options to Batch
|
1 ヶ月 前 |
Jeffrey Morgan
|
da0e345200
ml: use input context for extracting outputs (#9875)
|
1 ヶ月 前 |
Jesse Gross
|
282bfaaa95
ollamarunner: Use a separate context per multimodal input
|
1 ヶ月 前 |
Michael Yang
|
5e2e0b46b1
fix: error if image requested without vision model
|
1 ヶ月 前 |
Jesse Gross
|
a1cda80bcb
model: Update encoder cache to use multimodal input processing handler
|
1 ヶ月 前 |
Michael Yang
|
7bae7fa5ce
ml/backend/ggml: create tensor on specific backend
|
2 ヶ月 前 |
Jesse Gross
|
a7e63b82be
ollamarunner: Improve multimodal input handling
|
1 ヶ月 前 |
Daniel Hiltgen
|
1fdb351c37
New engine: vision models and auto-fallback (#9113)
|
1 ヶ月 前 |
Jesse Gross
|
854a9195f3
attention: Remove unnecessary contiguous operations
|
2 ヶ月 前 |
Michael Yang
|
53d2990d9b
model: add bos token if configured
|
2 ヶ月 前 |
Jesse Gross
|
5c5535c064
models: Prune unused outputs earlier in the forward pass
|
2 ヶ月 前 |
Jesse Gross
|
ed443a0393
Runner for Ollama engine
|
4 ヶ月 前 |
Jesse Gross
|
6945617af5
models: Move model into their own directory
|
2 ヶ月 前 |