Efficient Attention Calculation (2024)

This project explores efficient transformer attention:

  • Reproduced previously proposed attention approximation and subsampling mechanisms.
  • Implemented a PQ‑tree–based token subsampler to reduce complexity from O(L2) to O(L log L).