Sun | Mon | Tue | Wed | Thu | Fri | Sat |
---|---|---|---|---|---|---|
Sasha Rush (Cornell University) “Pretraining Without Attention”
12:00 pm
Sasha Rush (Cornell University) “Pretraining Without Attention”
@ Hackerman Hall B17
Feb 3 @ 12:00 pm – 1:15 pm
Abstract Transformers are essential to pretraining. As we approach 5 years of BERT, the connection between attention as architecture and transfer learning remains key to this central thread in NLP. Other architectures such as CNNs and RNNs[...]
|
||||||