Efficient Attention Calculation (2024)
This project explores efficient transformer attention:
- Reproduced previously proposed attention approximation and subsampling mechanisms.
- Implemented a PQ‑tree–based token subsampler to reduce complexity from O(L2) to O(L log L).
