Language-Agnostic Age and Gender Classification of Voice using Self-supervised Pre-Training
Extracting speaker-dependent paralinguistic information out of a person's voice, provides an opportunity for adaptive behaviour related to speaker information in speech processing applications. For instance, in audio-based conversational applications, adapting responses to the attributes of the correspondent is an integral part in making the conversations effective. Two speaker attributes that hum
