Seminars

Sun Mon Tue Wed Thu Fri Sat
1
2
3
Sasha Rush (Cornell University) “Pretraining Without Attention” 12:00 pm
Sasha Rush (Cornell University) “Pretraining Without Attention” @ Hackerman Hall B17
Feb 3 @ 12:00 pm – 1:15 pm
Abstract Transformers are essential to pretraining. As we approach 5 years of BERT, the connection between attention as architecture and transfer learning remains key to this central thread in NLP. Other architectures such as CNNs and RNNs[...]
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28

Center for Language and Speech Processing