Skip to content

use softmax_last_dim (metal and cuda kernel) in llama attention layer… #1586

use softmax_last_dim (metal and cuda kernel) in llama attention layer…

use softmax_last_dim (metal and cuda kernel) in llama attention layer… #1586

Annotations

1 warning

deploy

succeeded Oct 23, 2024 in 5s