Events

Niyati Bafna (JHU) – “Evaluating and Inducing Dialectal Robustness in Large Language Models”

March 12, 2025

When: March 14, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract While state-of-the-art models for machine translation and natural language understanding perform well on high-resource members of (some of) the world’s language families, they degrade on closely-related languages, dialects, and variants of these languages, which[…]

Xinyu Crystina Zhang (University of Waterloo) – “Information Seeking Beyond English”

March 11, 2025

When: March 10, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract Pretrained language models have brought revolutionary progress to information-seeking in the English world. While the advance is exciting, how to transfer such progress into non-English, especially lower resource languages, presents new challenges that require[…]

Jaemin Cho (UNC Chapel Hill) – “Faithful Reasoning and Fine-grained Evaluation for Multimodal Generation”

February 27, 2025

When: March 3, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract The paradigm of training large-scale foundation models has driven significant advancements in multimodal AI. However, pursuing further performance gains solely through model scaling is becoming impractical due to rising computational costs and resource limitations.[…]

Nikhil Sharma (JHU) – Information Foraging with AI: Human Behavior, Systemic Auditing and Redesigning Knowledge Access

February 24, 2025

When: February 28, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract Information is one of the most powerful forces in today’s digital era, shaping decisions at every level of society. While the ways we seek and access information have evolved over time, core challenges—such as[…]

Tianyu Gao (Princeton University) – “Enabling Language Models to Process Information at Scale”

February 21, 2025

When: February 24, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract Language models (LMs) are highly effective at understanding and generating text, holding immense potential as intuitive, personalized interfaces for accessing information. Expanding their ability to gather and synthesize large volumes of information will further[…]

Andrew Blair-Stanek (JHU) – Shelter Check Dataset and Experiments; LLM Instability on Legal Questions

February 17, 2025

When: February 14, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract This presentation will cover two papers, both on LLMs’ abilities to handle complex legal tasks. First is the Shelter Check dataset, a curated set of past tax-minimization schemes. We test o1 and claude-3.5’s abilities[…]

Igor Molybog (University of Hawaii at Manoa) – Development and deployment of large-scale AI systems and the tasks they still struggle with

February 17, 2025

When: February 21, 2025 @ 12:00 pm – 1:15 pm

Where: Hackerman Hall B17, 3400 N CHARLES ST

Abstract In this talk, Igor will share the latest findings from his research group on the development, evaluation, and deployment of large language models (LLMs) and their multi-modal extensions. We will outline the end-to-end pipeline,[…]

Niyati Bafna (JHU) – “Evaluating and Inducing Dialectal Robustness in Large Language Models”

Xinyu Crystina Zhang (University of Waterloo) – “Information Seeking Beyond English”

Jaemin Cho (UNC Chapel Hill) – “Faithful Reasoning and Fine-grained Evaluation for Multimodal Generation”

Nikhil Sharma (JHU) – Information Foraging with AI: Human Behavior, Systemic Auditing and Redesigning Knowledge Access

Tianyu Gao (Princeton University) – “Enabling Language Models to Process Information at Scale”

Andrew Blair-Stanek (JHU) – Shelter Check Dataset and Experiments; LLM Instability on Legal Questions

Igor Molybog (University of Hawaii at Manoa) – Development and deployment of large-scale AI systems and the tasks they still struggle with

Subscribe

Upcoming Seminars

Center for Language and Speech Processing