Richard Socher (MetaMind) “Multimodal Question Answering for Language and Vision”
Abstract Deep Learning enabled tremendous breakthroughs in visual understanding and speech recognition. Ostensibly, this is not the case in natural language processing (NLP) and higher level reasoning. However, it only appears that way because there[…]