Skip to content

Commit

Permalink
idea
Browse files Browse the repository at this point in the history
  • Loading branch information
lucidrains authored Apr 16, 2024
1 parent 5fd1e48 commit 8faa05e
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -104,6 +104,7 @@ $ python assert.py --use-cuda --causal --striped-ring-attn
- [ ] add ring attention to Tri's flash attention implementation. find some cuda ring reduce impl
- [ ] figure out how to pytest distributed pytorch
- [ ] use sdp context manager to validate when it is possible to use `ring_flash_attn_cuda`, otherwise assert out
- [ ] improvise a variant where each machine keeps compressed summary tokens, and one only ring pass those summary token for some given distance

## Citations

Expand Down

0 comments on commit 8faa05e

Please sign in to comment.