hardmaru on Twitter: "Reformer: The Efficient Transformer They present techniques to reduce the time and memory complexity of Transformer, allowing batches of very long sequences (64K) to fit on one GPU. Should
![💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science 💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science](https://miro.medium.com/v2/resize:fit:1400/1*ooz1uaw4EgcCyz5Q13yF_Q.png)
💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science
![💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science 💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science](https://miro.medium.com/v2/resize:fit:1400/1*ifCm7OLNDi5liHo87ECEzA.png)
💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science
![Reformer: The Efficient Transformer", Anonymous et al 2019 {G} [handling sequences up to L=64k on 1 GPU] : r/MachineLearning Reformer: The Efficient Transformer", Anonymous et al 2019 {G} [handling sequences up to L=64k on 1 GPU] : r/MachineLearning](https://external-preview.redd.it/X0iUTaLs2Nk1xsiLuHSXDEF24fJPyIBmmpqk4epPlYg.jpg?auto=webp&s=036dcf53a951d6bafbf5c2dd6b37ccf914dabf13)
Reformer: The Efficient Transformer", Anonymous et al 2019 {G} [handling sequences up to L=64k on 1 GPU] : r/MachineLearning
![💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science 💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science](https://miro.medium.com/v2/resize:fit:2000/1*tOPx3TSpEF2faZB9_85ArQ.png)
💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science
![💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science 💡Illustrating the Reformer. 🚊 ️ The efficient Transformer | by Alireza Dirafzoon | Towards Data Science](https://miro.medium.com/v2/resize:fit:1200/1*LdrVO56qTiQHmyIJhBcT9A.png)