説明なし

Patrick Devine 9818af9763 fix extended tag names		1 年間前
api	9f6e97865c allow pushing/pulling to insecure registries (#157)	1 年間前
app	dfceca48a7 update icons to have different images for bright and dark mode	1 年間前
cmd	9f6e97865c allow pushing/pulling to insecure registries (#157)	1 年間前
docs	52f04e39f2 Note that CGO must be enabled in dev docs	1 年間前
examples	8454f298ac fix example `Modelfile`s	1 年間前
format	5bea29f610 add new list command (#97)	1 年間前
library	6a19724d5f remove colon from library modelfiles	1 年間前
llama	8526e1f5f1 add llama.cpp mpi, opencl files	1 年間前
parser	d59b164fa2 add prompt back to parser	1 年間前
progressbar	e4d7f3e287 vendor in progress bar and change to bytes instead of bibytes (#130)	1 年間前
scripts	4dd296e155 build app in publish script	1 年間前
server	9818af9763 fix extended tag names	1 年間前
web	3c8f4c03d7 web: tweak homepage text	1 年間前
.dockerignore	6292f4b64c update `Dockerfile`	1 年間前
.gitignore	7c71c10d4f fix compilation issue in Dockerfile, remove from `README.md` until ready	1 年間前
.prettierrc.json	8685a5ad18 move .prettierrc.json to root	1 年間前
Dockerfile	7c71c10d4f fix compilation issue in Dockerfile, remove from `README.md` until ready	1 年間前
LICENSE	df5fdd6647 `proto` -> `ollama`	1 年間前
README.md	91cd54016c add basic REST api documentation	1 年間前
ggml-metal.metal	e64ef69e34 look for ggml-metal in the same directory as the binary	1 年間前
go.mod	e4d7f3e287 vendor in progress bar and change to bytes instead of bibytes (#130)	1 年間前
go.sum	e4d7f3e287 vendor in progress bar and change to bytes instead of bibytes (#130)	1 年間前
main.go	1775647f76 continue conversation	1 年間前

Ollama

Note: Ollama is in early preview. Please report any issues you find.

Run, create, and share large language models (LLMs).

Download

Download for macOS on Apple Silicon (Intel coming soon)
Download for Windows and Linux (coming soon)
Build from source

Quickstart

To run and chat with Llama 2, the new model by Meta:

ollama run llama2

Model library

ollama includes a library of open-source models:

Model	Parameters	Size	Download
Llama2	7B	3.8GB	`ollama pull llama2`
Llama2 13B	13B	7.3GB	`ollama pull llama2:13b`
Orca Mini	3B	1.9GB	`ollama pull orca`
Vicuna	7B	3.8GB	`ollama pull vicuna`
Nous-Hermes	13B	7.3GB	`ollama pull nous-hermes`
Wizard Vicuna Uncensored	13B	7.3GB	`ollama pull wizard-vicuna`

Note: You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.

Examples

Run a model

ollama run llama2
>>> hi
Hello! How can I help you today?

Create a custom model

Pull a base model:

ollama pull llama2

Create a Modelfile:

FROM llama2

# set the temperature to 1 [higher is more creative, lower is more coherent]
PARAMETER temperature 1

# set the system prompt
SYSTEM """
You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
"""

Next, create and run the model:

ollama create mario -f ./Modelfile
ollama run mario
>>> hi
Hello! It's your friend Mario.

For more examples, see the examples directory.

Pull a model from the registry

ollama pull orca

Listing local models

ollama list

Model packages

Overview

Ollama bundles model weights, configuration, and data into a single package, defined by a Modelfile.

Building

go build .

To run it start the server:

./ollama serve &

Finally, run a model!

./ollama run llama2

REST API

`POST /api/generate`

Generate text from a model.

curl -X POST http://localhost:11434/api/generate -d '{"model": "llama2", "prompt":"Why is the sky blue?"}'

README.md