We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Social and Information Networks

Title: Graph2Speak: Improving Speaker Identification using Network Knowledge in Criminal Conversational Data

Abstract: Criminal investigations mostly rely on the collection of speech conversational data in order to identify speakers and build or enrich an existing criminal network. Social network analysis tools are then applied to identify the most central characters and the different communities within the network. We introduce two candidate datasets for criminal conversational data, Crime Scene Investigation (CSI), a television show, and the ROXANNE simulated data. We also introduce the metric of conversation accuracy in the context of criminal investigations. By re-ranking candidate speakers based on the frequency of previous interactions, we improve the speaker identification baseline by 1.2% absolute (1.3% relative), and the conversation accuracy by 2.6% absolute (3.4% relative) on CSI data, and by 1.1% absolute (1.2% relative), and 2% absolute (2.5% relative) respectively on the ROXANNE simulated data.
Subjects: Social and Information Networks (cs.SI); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as: arXiv:2006.02093 [cs.SI]
  (or arXiv:2006.02093v4 [cs.SI] for this version)

Submission history

From: Mael Fabien [view email]
[v1] Wed, 3 Jun 2020 08:08:42 GMT (647kb,D)
[v2] Thu, 4 Jun 2020 06:37:37 GMT (647kb,D)
[v3] Tue, 9 Jun 2020 09:37:34 GMT (648kb,D)
[v4] Mon, 21 Sep 2020 12:19:29 GMT (1499kb,D)

Link back to: arXiv, form interface, contact.