The linear recurrence-based alternative to attention that scales linearly with sequence length
4 views