Jesse Gross
|
2d6eac9084
kvcache: Optimize sliding window attention
|
1 month ago |
Jesse Gross
|
3ed7ad3ab3
kvcache: Pass granular cache size into implementations
|
1 month ago |
Jesse Gross
|
0c220935bd
input: Rename Options to Batch
|
1 month ago |
jmorganca
|
c6b6938b3a
kvcache: fix tests by adding AvgPool2D stub
|
1 month ago |
Jesse Gross
|
0e886595bf
Fix tests and drift from main
|
1 month ago |
Jesse Gross
|
4346c2409d
fix drift from main
|
1 month ago |
Patrick Devine
|
5f74d1fd47
gemma2 impl
|
2 months ago |
Jesse Gross
|
a1cda80bcb
model: Update encoder cache to use multimodal input processing handler
|
1 month ago |
Michael Yang
|
58b9ec1f6b
kvcache: update tests
|
2 months ago |
Jesse Gross
|
ee141cc821
ml: Empty tensor constructor for tensors
|
2 months ago |
Michael Yang
|
8b194b7520
kvcache: update tests
|
2 months ago |
Michael Yang
|
3e8b8a1933
ml: update Context.Forward interface
|
2 months ago |
Daniel Hiltgen
|
df2680b4b9
Wire up system info log for new engine (#9123)
|
2 months ago |
Jesse Gross
|
ed443a0393
Runner for Ollama engine
|
4 months ago |