Current browse context:
eess.AS
Change to browse by:
References & Citations
Electrical Engineering and Systems Science > Audio and Speech Processing
Title: The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge
(Submitted on 5 Sep 2021 (v1), last revised 7 Sep 2021 (this version, v2))
Abstract: This report describes the submission of the DKU-DukeECE-Lenovo team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2021 track 4. Our system including a voice activity detection (VAD) model, a speaker embedding model, two clustering-based speaker diarization systems with different similarity measurements, two different overlapped speech detection (OSD) models, and a target-speaker voice activity detection (TS-VAD) model. Our final submission, consisting of 5 independent systems, achieves a DER of 5.07% on the challenge test set.
Submission history
From: Weiqing Wang [view email][v1] Sun, 5 Sep 2021 05:45:52 GMT (476kb,D)
[v2] Tue, 7 Sep 2021 03:59:50 GMT (476kb,D)
Link back to: arXiv, form interface, contact.