TitaNet: Neural Model for Speaker Representation with AAM Loss and Cosine Similarity
The author utilizes the TitaNet model trained with additive angular margin (AAM) loss to optimize cosine distance between speaker embeddings, using cosine similarity as the back-end metric.