Cosine Attention (Cottention)
					Collection
				
Models for the paper Cottention: Linear Transformers With Cosine Attention https://arxiv.org/abs/2409.18747
					• 
				6 items
				• 
				Updated
					
				
This repository contains the BERT-Cos model described in Cottention: Linear Transformers With Cosine Attention.