Josh Yan
|
cc17b02b23
update
|
8 月之前 |
Josh Yan
|
73d69bc90b
remove types
|
8 月之前 |
Josh Yan
|
07c0f66f5e
rm print
|
9 月之前 |
Josh Yan
|
4a7bfca902
change progress msg
|
9 月之前 |
Josh Yan
|
04f2154505
fixed cgo
|
9 月之前 |
Josh Yan
|
de9b21b472
quantize progress
|
9 月之前 |
Michael Yang
|
b732beba6a
lint
|
9 月之前 |
Josh
|
10e768826c
fix: quant err message (#5616)
|
9 月之前 |
Daniel Hiltgen
|
b51e3b63ac
Statically link c++ and thread lib
|
9 月之前 |
jmorganca
|
a08f20d910
release: remove unwanted mingw dll.a files
|
10 月之前 |
jmorganca
|
6cea036027
Revert "llm: only statically link libstdc++"
|
10 月之前 |
jmorganca
|
5796bfc401
llm: only statically link libstdc++
|
10 月之前 |
jmorganca
|
f1a379aa56
llm: statically link pthread and stdc++ dependencies in windows build
|
10 月之前 |
Jeffrey Morgan
|
5304b765b2
llm: put back old include dir (#5507)
|
10 月之前 |
Jeffrey Morgan
|
78fb33dd07
fix typo in cgo directives in `llm.go` (#5501)
|
10 月之前 |
Jeffrey Morgan
|
8f8e736b13
update llama.cpp submodule to `d7fd29f` (#5475)
|
10 月之前 |
Michael Yang
|
829ff87bd1
revert tokenize ffi (#4761)
|
11 月之前 |
Jeffrey Morgan
|
763bb65dbb
use `int32_t` for call to tokenize (#4738)
|
11 月之前 |
Michael Yang
|
bf54c845e9
vocab only
|
11 月之前 |
Michael Yang
|
26a00a0410
use ffi for tokenizing/detokenizing
|
11 月之前 |
Michael Yang
|
01811c176a
comments
|
1 年之前 |
Michael Yang
|
9685c34509
quantize any fp16/fp32 model
|
1 年之前 |
Hernan Martinez
|
86e67fc4a9
Add import declaration for windows,arm64 to llm.go
|
1 年之前 |
Michael Yang
|
9502e5661f
cgo quantize
|
1 年之前 |
Daniel Hiltgen
|
58d95cc9bd
Switch back to subprocessing for llama.cpp
|
1 年之前 |
Michael Yang
|
91b3e4d282
update memory calcualtions
|
1 年之前 |
Michael Yang
|
d338d70492
refactor model parsing
|
1 年之前 |
Patrick Devine
|
1b272d5bcd
change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347)
|
1 年之前 |
Jeffrey Morgan
|
f9cd55c70b
disable gpu for certain model architectures and fix divide-by-zero on memory estimation
|
1 年之前 |
Daniel Hiltgen
|
6c5ccb11f9
Revamp ROCm support
|
1 年之前 |