Attention Flows Analyzing and Comparing Attention Mechanisms in Language Models Attention Flows:Analyzing and Comparing Attention Mechanisms in Language Models
Efficient Transformers A Survey.pdf Efficient Transformers: A Survey,这是2020年关于Transformer的综述,感兴趣的可以下载
Current limitations of language models what you need is retrieval.pdf Current limitations of language models:what you need is retrieval