Jaemin Cho (UNC Chapel Hill) – “Faithful Reasoning and Fine-grained Evaluation for Multimodal Generation”
Abstract The paradigm of training large-scale foundation models has driven significant advancements in multimodal AI. However, pursuing further performance gains solely through model scaling is becoming impractical due to rising computational costs and resource limitations.[…]