1) Creating language models that are attuned to spontaneous conversations.
“There is more to conversational speech than Dysfluencies.”
2) Beyond mere word-based transcription:
identifying and cleaning up disfluencies.
transforming the output into “simplified English”.