Explorar o código

Merge pull request #9824 from ollama/mxyng/sched

conditionally enable parallel pipelines
Michael Yang hai 1 mes
pai
achega
021dcf089d
Modificáronse 1 ficheiros con 1 adicións e 1 borrados
  1. 1 1
      ml/backend/ggml/ggml.go

+ 1 - 1
ml/backend/ggml/ggml.go

@@ -373,7 +373,7 @@ func New(r *os.File, params ml.BackendParams) (ml.Backend, error) {
 			(*C.ggml_backend_buffer_type_t)(unsafe.Pointer(&schedBufts[0])),
 			C.int(len(schedBackends)),
 			C.size_t(maxGraphNodes),
-			true,
+			C._Bool(len(gpus) > 1 && slices.Contains(gpus, output.d)),
 		),
 		input:  deviceBufferTypes[input.d],
 		output: deviceBufferTypes[output.d],