Attention matrix
#12
by
stolosa
- opened
Hi! Does anyone know the direction of the data obtained from the attention matrices? For example when I get the data of the coordinate (5,1), what does it mean? The attention given from row 5 to column 1, or the other way around?
Maybe you would know @Rocketknight1 ?