TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models
This project is an active research effort, and the implementation is currently under development. We plan to open-source the full code once our research paper is published. Some components may be ...
There was an error while loading. Please reload this page.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results