Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The observation about the "block-diagonal patterns" in RAG isn't new and has been exploited / explored before:

- https://arxiv.org/abs/2410.07590 (literally titled "Block-Attention for Efficient RAG")

- https://arxiv.org/abs/2409.15355v3

- https://arxiv.org/abs/2212.10947

The REFRAG paper does not cite any of these.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: