Bruce MacDonald f2ba1311aa improve vram safety with 5% vram memory buffer (#724) 1 year ago
..
llama.cpp ab0668293c llm: fix build on `amd64` 1 year ago
falcon.go c02c0cd483 starcoder 1 year ago
ggml.go c02c0cd483 starcoder 1 year ago
gguf.go c02c0cd483 starcoder 1 year ago
llama.go f2ba1311aa improve vram safety with 5% vram memory buffer (#724) 1 year ago
llm.go d06bc0cb6e enable q8, q5, 5_1, and f32 for linux gpu (#699) 1 year ago
starcoder.go c02c0cd483 starcoder 1 year ago
utils.go fccf8d179f partial decode ggml bin for more info 1 year ago