Nincs leírás

221 Ágak

Patrick Devine e6d0062c13 move model struct		1 éve
api	6e2be5a8a0 add create, pull, and push	1 éve
app	560f36e6c8 app: set `first-time-run` to `true` instead of `false`	1 éve
cmd	2e1394e405 add progressbar for model pulls	1 éve
docs	6fdea03049 docs: remove `python.md`	1 éve
examples	993cb9fad6 examples: add basic python example	1 éve
llama	40c9dc0a31 fix multibyte responses	1 éve
parser	0573eae4b4 changes to the parser, FROM line, and fix commands	1 éve
scripts	4dd296e155 build app in publish script	1 éve
server	e6d0062c13 move model struct	1 éve
web	787d965331 web: disable signup button while submitting	1 éve
.dockerignore	6292f4b64c update `Dockerfile`	1 éve
.gitignore	7c71c10d4f fix compilation issue in Dockerfile, remove from `README.md` until ready	1 éve
.prettierrc.json	8685a5ad18 move .prettierrc.json to root	1 éve
Dockerfile	7c71c10d4f fix compilation issue in Dockerfile, remove from `README.md` until ready	1 éve
LICENSE	df5fdd6647 `proto` -> `ollama`	1 éve
README.md	304f419429 update `README.md` API reference	1 éve
ggml-metal.metal	e64ef69e34 look for ggml-metal in the same directory as the binary	1 éve
go.mod	a806b03f62 no errgroup	1 éve
go.sum	a806b03f62 no errgroup	1 éve
main.go	1775647f76 continue conversation	1 éve
models.json	5028de2901 update vicuna model	1 éve

Ollama

Run large language models with llama.cpp.

Note: certain models that can be run with Ollama are intended for research and/or non-commercial use only.

Features

Download and run popular large language models
Switch between multiple models on the fly
Hardware acceleration where available (Metal, CUDA)
Fast inference server written in Go, powered by llama.cpp
REST API to use with your application (python, typescript SDKs coming soon)

Install

Download for macOS
Download for Windows (coming soon)

You can also build the binary from source.

Quickstart

Run a fast and simple model.

ollama run orca

Example models

💬 Chat

Have a conversation.

ollama run vicuna "Why is the sky blue?"

🗺️ Instructions

Get a helping hand.

ollama run orca "Write an email to my boss."

🔎 Ask questions about documents

Send the contents of a document and ask questions about it.

ollama run nous-hermes "$(cat input.txt)", please summarize this story

📖 Storytelling

Venture into the unknown.

ollama run nous-hermes "Once upon a time"

Advanced usage

Run a local model

ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

Building

go build .

To run it start the server:

./ollama server &

Finally, run a model!

./ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

API Reference

`POST /api/pull`

Download a model

curl -X POST http://localhost:11343/api/pull -d '{"model": "orca"}'

`POST /api/generate`

Complete a prompt

curl -X POST http://localhost:11434/api/generate -d '{"model": "orca", "prompt": "hello!"}'

README.md