Author | SHA1 Message | Date |
---|---|---|
|
08f1e18965 Offload layers to GPU based on new model size estimates (#1850) | 1 year ago |
|
0b3118e0af fix: relay request opts to loaded llm prediction (#1761) | 1 year ago |
|
d966b730ac Switch windows build to fully dynamic | 1 year ago |