Johannes Gäßler a743d76a01
CUDA: generalize FP16 fattn vec kernel (#7061)
* CUDA: generalize FP16 fattn vec kernel

* disable unsupported head sizes for AMD in test

* try AMD fix

* fix batch size 2-8

* partially revert changes
2024-05-09 14:32:02 +02:00
..
2024-03-29 17:45:46 +02:00
2024-04-30 12:16:08 +03:00
2024-04-30 12:16:08 +03:00