Speed up by storing cross-attention scores only at the last timestep 80a6063 verified jixin0101 commited on 5 days ago