Michael Yang
|
171eb040fc
simplify safetensors reading
|
11 months ago |
Michael Yang
|
bbbd9f20f3
cleanup
|
11 months ago |
Michael Yang
|
547132e820
bpe pretokenizer
|
11 months ago |
Patrick Devine
|
c8cf0d94ed
llama3 conversion
|
1 year ago |
Patrick Devine
|
14476d48cc
fixes for gguf (#3863)
|
1 year ago |
Michael Yang
|
e74163af4c
fix padding to only return padding
|
1 year ago |
Michael Yang
|
6d53b67c2c
Merge pull request #3663 from ollama/mxyng/fix-padding
|
1 year ago |
Michael Yang
|
969238b19e
fix padding in decode
|
1 year ago |
Patrick Devine
|
9f8691c6c8
Add llama2 / torch models for `ollama create` (#3607)
|
1 year ago |
Michael Yang
|
8b2c10061c
refactor tensor query
|
1 year ago |
Michael Yang
|
d338d70492
refactor model parsing
|
1 year ago |
Patrick Devine
|
5a5efee46b
Add gemma safetensors conversion (#3250)
|
1 year ago |
Patrick Devine
|
1b272d5bcd
change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347)
|
1 year ago |
Michael Yang
|
22f326464e
Merge pull request #3083 from ollama/mxyng/refactor-readseeker
|
1 year ago |
Blake Mizerany
|
6ce37e4d96
llm,readline: use errors.Is instead of simple == check (#3161)
|
1 year ago |
Michael Yang
|
0085297928
refactor readseeker
|
1 year ago |
Michael Yang
|
76bdebbadf
decode ggla
|
1 year ago |
Patrick Devine
|
2c017ca441
Convert Safetensors to an Ollama model (#2824)
|
1 year ago |
Michael Yang
|
949d7b1c48
add gguf file types (#2532)
|
1 year ago |
Michael Yang
|
cd22855ef8
refactor tensor read
|
1 year ago |
Michael Yang
|
eaed6f8c45
add max context length check
|
1 year ago |
Jeffrey Morgan
|
08f1e18965
Offload layers to GPU based on new model size estimates (#1850)
|
1 year ago |
Michael Yang
|
56ffc3023a
remove per-model types
|
1 year ago |
Michael Yang
|
5a5dca13b2
comments
|
1 year ago |
Michael Yang
|
72e7a49aa9
seek instead of copyn
|
1 year ago |
Michael Yang
|
2cb0fa7d40
split from into one or more models
|
1 year ago |
Michael Yang
|
199941cd15
fix: gguf int type
|
1 year ago |
Michael Yang
|
c5e1bbabda
instead of static number of parameters for each model family, get the real number from the tensors (#1022)
|
1 year ago |
Michael Yang
|
125d0a013a
ggufv3
|
1 year ago |
Michael Yang
|
c02c0cd483
starcoder
|
1 year ago |