瀏覽代碼

Bump VRAM buffer back up

Under stress scenarios we're seeing OOMs so this should help stabilize
the allocations under heavy concurrency stress.
Daniel Hiltgen 11 月之前
父節點
當前提交
30a7d7096c
共有 2 個文件被更改,包括 3 次插入3 次删除
  1. 2 2
      gpu/gpu.go
  2. 1 1
      gpu/gpu_darwin.go

+ 2 - 2
gpu/gpu.go

@@ -31,8 +31,8 @@ type handles struct {
 }
 }
 
 
 const (
 const (
-	cudaMinimumMemory = 256 * format.MebiByte
-	rocmMinimumMemory = 256 * format.MebiByte
+	cudaMinimumMemory = 457 * format.MebiByte
+	rocmMinimumMemory = 457 * format.MebiByte
 )
 )
 
 
 var gpuMutex sync.Mutex
 var gpuMutex sync.Mutex

+ 1 - 1
gpu/gpu_darwin.go

@@ -15,7 +15,7 @@ import (
 )
 )
 
 
 const (
 const (
-	metalMinimumMemory = 384 * format.MebiByte
+	metalMinimumMemory = 512 * format.MebiByte
 )
 )
 
 
 func GetGPUInfo() GpuInfoList {
 func GetGPUInfo() GpuInfoList {