The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge

Wang, Weiqing; Cai, Danwei; Lin, Qingjian; Yang, Lin; Wang, Junjie; Wang, Jin; Li, Ming

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2109

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge

Authors: Weiqing Wang, Danwei Cai, Qingjian Lin, Lin Yang, Junjie Wang, Jin Wang, Ming Li

(Submitted on 5 Sep 2021 (v1), last revised 7 Sep 2021 (this version, v2))

Abstract: This report describes the submission of the DKU-DukeECE-Lenovo team to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2021 track 4. Our system including a voice activity detection (VAD) model, a speaker embedding model, two clustering-based speaker diarization systems with different similarity measurements, two different overlapped speech detection (OSD) models, and a target-speaker voice activity detection (TS-VAD) model. Our final submission, consisting of 5 independent systems, achieves a DER of 5.07% on the challenge test set.

Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2109.02002 [eess.AS]
	(or arXiv:2109.02002v2 [eess.AS] for this version)

Submission history

From: Weiqing Wang [view email]
[v1] Sun, 5 Sep 2021 05:45:52 GMT (476kb,D)
[v2] Tue, 7 Sep 2021 03:59:50 GMT (476kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2109.02002

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge

Submission history