Žiadny popis

221 Vetvy

Patrick Devine 3168f51125 change error handler behavior and fix error when a model isn't found		1 rok pred
api	3168f51125 change error handler behavior and fix error when a model isn't found	1 rok pred
app	dfceca48a7 update icons to have different images for bright and dark mode	1 rok pred
cmd	3168f51125 change error handler behavior and fix error when a model isn't found	1 rok pred
docs	52f04e39f2 Note that CGO must be enabled in dev docs	1 rok pred
examples	8454f298ac fix example `Modelfile`s	1 rok pred
format	5bea29f610 add new list command (#97)	1 rok pred
library	6a19724d5f remove colon from library modelfiles	1 rok pred
llama	8526e1f5f1 add llama.cpp mpi, opencl files	1 rok pred
parser	d59b164fa2 add prompt back to parser	1 rok pred
progressbar	e4d7f3e287 vendor in progress bar and change to bytes instead of bibytes (#130)	1 rok pred
scripts	4dd296e155 build app in publish script	1 rok pred
server	3168f51125 change error handler behavior and fix error when a model isn't found	1 rok pred
web	3c8f4c03d7 web: tweak homepage text	1 rok pred
.dockerignore	6292f4b64c update `Dockerfile`	1 rok pred
.gitignore	7c71c10d4f fix compilation issue in Dockerfile, remove from `README.md` until ready	1 rok pred
.prettierrc.json	8685a5ad18 move .prettierrc.json to root	1 rok pred
Dockerfile	7c71c10d4f fix compilation issue in Dockerfile, remove from `README.md` until ready	1 rok pred
LICENSE	df5fdd6647 `proto` -> `ollama`	1 rok pred
README.md	91cd54016c add basic REST api documentation	1 rok pred
ggml-metal.metal	e64ef69e34 look for ggml-metal in the same directory as the binary	1 rok pred
go.mod	e4d7f3e287 vendor in progress bar and change to bytes instead of bibytes (#130)	1 rok pred
go.sum	e4d7f3e287 vendor in progress bar and change to bytes instead of bibytes (#130)	1 rok pred
main.go	1775647f76 continue conversation	1 rok pred

Ollama

Note: Ollama is in early preview. Please report any issues you find.

Run, create, and share large language models (LLMs).

Download

Download for macOS on Apple Silicon (Intel coming soon)
Download for Windows and Linux (coming soon)
Build from source

Quickstart

To run and chat with Llama 2, the new model by Meta:

ollama run llama2

Model library

ollama includes a library of open-source models:

Model	Parameters	Size	Download
Llama2	7B	3.8GB	`ollama pull llama2`
Llama2 13B	13B	7.3GB	`ollama pull llama2:13b`
Orca Mini	3B	1.9GB	`ollama pull orca`
Vicuna	7B	3.8GB	`ollama pull vicuna`
Nous-Hermes	13B	7.3GB	`ollama pull nous-hermes`
Wizard Vicuna Uncensored	13B	7.3GB	`ollama pull wizard-vicuna`

Note: You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.

Examples

Run a model

ollama run llama2
>>> hi
Hello! How can I help you today?

Create a custom model

Pull a base model:

ollama pull llama2

Create a Modelfile:

FROM llama2

# set the temperature to 1 [higher is more creative, lower is more coherent]
PARAMETER temperature 1

# set the system prompt
SYSTEM """
You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
"""

Next, create and run the model:

ollama create mario -f ./Modelfile
ollama run mario
>>> hi
Hello! It's your friend Mario.

For more examples, see the examples directory.

Pull a model from the registry

ollama pull orca

Listing local models

ollama list

Model packages

Overview

Ollama bundles model weights, configuration, and data into a single package, defined by a Modelfile.

Building

go build .

To run it start the server:

./ollama serve &

Finally, run a model!

./ollama run llama2

REST API

`POST /api/generate`

Generate text from a model.

curl -X POST http://localhost:11434/api/generate -d '{"model": "llama2", "prompt":"Why is the sky blue?"}'

README.md