Yen-ju Lu (JHU) “CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing”
Abstract We introduce Condition-Aware Self-Supervised Learning Representation (CA-SSLR), a generalist conditioning model broadly applicable to various speech-processing tasks. Compared to standard fine-tuning methods that optimize for downstream models, CA-SSLR integrates language and speaker embeddings from[…]