OpenSource/ollama

Autor	SHA1 Mensaxe	Data
Michael Yang	74bd09652d ml/backend/ggml: load tensors in 32KiB chunks	hai 1 mes
Bruce MacDonald	df94175a0f ggml: return error on failure to read tensor data (#9872)	hai 1 mes
Michael Yang	021dcf089d Merge pull request #9824 from ollama/mxyng/sched	hai 1 mes
Jeffrey Morgan	364629b8d6 ml/backend/ggml: allocate memory with malloc when loading model (#9822)	hai 1 mes
Michael Yang	4561fff36e conditionally enable parallel pipelines	hai 1 mes
Michael Yang	63a394068c use 2d pooling	hai 1 mes
Michael Yang	c5cbe4fc2a fallback to cpu	hai 1 mes
Michael Yang	9e4642e9b3 ollama debug tensor	hai 1 mes
Michael Yang	6b0486c216 duplicate token_embd to output	hai 1 mes
Michael Yang	8934324b72 use fast attention	hai 1 mes
Michael Yang	0df1800436 set non-causal attention	hai 1 mes
Michael Yang	4b037a97dc add gemma vision encoder	hai 1 mes
Patrick Devine	5f74d1fd47 gemma2 impl	hai 2 meses
Jesse Gross	4100ed7bdd ml: Add support for quantized KV cache	hai 2 meses
Jesse Gross	25f9b152f9 ggml-backend: Ensure allocation meet backend requirements	hai 1 mes
Jesse Gross	98272fbd58 additional review comments	hai 1 mes
Michael Yang	b27e8f3f10 ml/backend/ggml: use backend buffer type	hai 1 mes
Michael Yang	45df786f09 comments	hai 1 mes
Michael Yang	daaf42e4a4 ml/backend/ggml: clean up	hai 2 meses
Michael Yang	2dc60d4620 ml/backend/ggml: offload vision to cpu	hai 2 meses
Michael Yang	b5312f30e8 ml/backend/ggml: handle tensor split	hai 2 meses
Michael Yang	26c2e0bd35 ml/backend/ggml: handle user specified cpu offloading	hai 2 meses
Michael Yang	bf920883d5 ml/backend/ggml: set cpu n_threads	hai 2 meses
Michael Yang	7bae7fa5ce ml/backend/ggml: create tensor on specific backend	hai 2 meses
Michael Yang	764e199d67 kvcache: create cache ctx per layer	hai 2 meses
Michael Yang	bfce55db3d model: load non-repeated tensors into multiple backends	hai 2 meses
Michael Yang	bab6f34dc0 ml/backend/ggml: update model loading for hybrid/multi backends	hai 2 meses
Michael Yang	05a01fdecb ml/backend/ggml: consolidate system info logging	hai 2 meses
Jesse Gross	21aa666a1e ml: Enable support for flash attention	hai 2 meses
Jesse Gross	ee141cc821 ml: Empty tensor constructor for tensors	hai 2 meses

Posterior Anterior

Commit History Buscar

Commit History