Patrick Devine
|
4bed739259
add verbose mode to the show command (#9640)
|
1 месяц назад |
Patrick Devine
|
80c7ce381b
fix: change default context size for gemma3 (#9744)
|
1 месяц назад |
Michael Yang
|
ccfd41c4f0
Merge pull request #9742 from ollama/mxyng/engine-error-embeddings
|
1 месяц назад |
Michael Yang
|
ec46f3286c
engine: error on embeddings; not currently implemented
|
1 месяц назад |
Michael Yang
|
45a13b1dec
Merge pull request #9688 from Shane-XB-Qian/debug_mistype_lld
|
1 месяц назад |
Parth Sareen
|
5c0b663969
sample: separate softmax and temperature transforms (#9732)
|
1 месяц назад |
shane.xb.qian
|
30d7a59ba8
ollama-debug.c: change 'ld' to 'PRIi64'
|
1 месяц назад |
ParthSareen
|
4aeb67ef4c
sample: do all sorting in topK
|
1 месяц назад |
ParthSareen
|
3ba91634c1
sample: simplify top_k=0 sorting
|
1 месяц назад |
ParthSareen
|
1b7433b71e
sample: use container/heap for top_k
|
1 месяц назад |
Bruce MacDonald
|
a70820daa0
models/gemma3: remove final logit softcap (#9692)
|
1 месяц назад |
Shane-XB-Qian
|
6b45b1d6b4
cli: adding support ctrl-n/p like general cli (#9136)
|
1 месяц назад |
shane.xb.qian
|
85ab552028
ollama-debug.c: correct mistype
|
1 месяц назад |
frob
|
b3af953a55
cli: don't exit for invalid model during /load. (#9576)
|
1 месяц назад |
Michael
|
ad4e0bf3be
Adding Gemma 3 to readme (#9671)
|
1 месяц назад |
Michael Yang
|
aee28501b5
Merge pull request #9661 from ollama/gemma
|
1 месяц назад |
jmorganca
|
83f0ec8269
all: address linter errors
|
1 месяц назад |
jmorganca
|
c6b6938b3a
kvcache: fix tests by adding AvgPool2D stub
|
1 месяц назад |
jmorganca
|
fb4664fcec
model: add more spm tokenizer tests
|
1 месяц назад |
jmorganca
|
20e3593863
model: validate left and right pairs before merging them
|
1 месяц назад |
Michael Yang
|
63a394068c
use 2d pooling
|
1 месяц назад |
Daniel Hiltgen
|
ab39e08eb9
llm: auto detect models that require Ollama Engine (#1)
|
1 месяц назад |
jmorganca
|
11bfa62796
add trailing \n\n after <end_of_image> to match reference implementation
|
1 месяц назад |
jmorganca
|
f63e62e546
reduce kernel size, add TODO for loading from config
|
1 месяц назад |
jmorganca
|
65b0f329d1
Revert "Allow models to force a new batch"
|
1 месяц назад |
Jesse Gross
|
06007c0a18
Allow models to force a new batch
|
1 месяц назад |
Jesse Gross
|
a8e83a7654
Disable causal attention based on batch index
|
1 месяц назад |
Jesse Gross
|
475005504e
Restrict Gemma to a single image per request
|
1 месяц назад |
Jesse Gross
|
2c40c4d35e
Fix follow up images and images split across batches
|
1 месяц назад |
Michael Yang
|
e95278932b
use non-causal mask only for image positions
|
1 месяц назад |