Yulia Tsvetkov (University of Washington) “LLMs under the Microscope: Illuminating the Blind Spots and Improving the Reliability of Language Models”
Abstract Large language models (LMs) are pretrained on diverse data sources—news, discussion forums, books, online encyclopedias. A significant portion of this data includes facts and opinions which, on one hand, celebrate democracy and diversity of[…]