We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Novel Topology for End-to-end Temporal Classification and Segmentation with Recurrent Neural Network

Authors: Taiyang Zhao
Abstract: Connectionist temporal classification (CTC) has matured as an alignment free to sequence transduction and shows competitive for end-to-end speech recognition. In the CTC topology, the blank symbol occupies more than half of the state trellis, which results the spike phenomenon of the non-blank symbols. For classification task, the spikes work quite well, but as to the segmentation task it does not provide boundaries information. In this paper, a novel topology is introduced to combine the temporal classification and segmentation ability in one framework.
Comments: 4 pages,3 figures
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as: arXiv:1912.04784 [cs.CL]
  (or arXiv:1912.04784v1 [cs.CL] for this version)

Submission history

From: Taiyang Zhao [view email]
[v1] Tue, 10 Dec 2019 15:53:59 GMT (403kb)

Link back to: arXiv, form interface, contact.