Johannes Gäßler 17366df842
Multi GPU support, CUDA refactor, CUDA scratch buffer (#1703)
* CUDA multi GPU + scratch

ggml_cuda_compute_forward

Tensor parallelism

ggml_cuda_add

ggml_cuda_rms_norm

ggml_cuda_silu

CUDA scratch buffer

--main-gpu CLI option
2023-06-06 21:33:23 +02:00
..
2023-06-04 23:34:30 +03:00
2023-03-29 20:21:09 +03:00
2023-03-25 21:51:41 +02:00
2023-06-04 23:34:30 +03:00