Anuraag (Rag) Agrawal e28f2d4900 openai: return usage as final chunk for streams (#6784) 4 months ago
..
images 1713eddcd0 Fix import image width (#6528) 8 months ago
README.md 8cc0ee2efe Doc container usage and workaround for nvidia errors 11 months ago
api.md 527cc97899 llama: update vendored code to commit 40c6d79f (#7875) 4 months ago
development.md 82a02e18d9 build: fix typo in override variable (#8031) 4 months ago
docker.md a0ea067b63 build: fix arm container image (#7674) 5 months ago
faq.md 1bdab9fdb1 llm: introduce k/v context quantization (vRAM improvements) (#6279) 4 months ago
gpu.md 4879a234c4 build: Make target improvements (#7499) 4 months ago
import.md 2f0a8c8778 docs: fix minor typo in import.md (#7764) 5 months ago
linux.md 4879a234c4 build: Make target improvements (#7499) 4 months ago
modelfile.md 2b82c5a8a1 docs: correct default num_predict value in modelfile.md (#7693) 4 months ago
openai.md e28f2d4900 openai: return usage as final chunk for streams (#6784) 4 months ago
template.md 55ea963c9e update default model to llama3.2 (#6959) 7 months ago
troubleshooting.md abfdc4710f all: fix typos in documentation, code, and comments (#7021) 4 months ago
tutorials.md 85951d25ef Created tutorial for running Ollama on NVIDIA Jetson devices (#1098) 1 year ago
windows.md 4879a234c4 build: Make target improvements (#7499) 4 months ago