Ham2Pose: Animating Sign Language Notation into Pose Sequences

Shalev-Arkushin, Rotem; Moryossef, Amit; Fried, Ohad

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2211

Computer Science > Computer Vision and Pattern Recognition

Title: Ham2Pose: Animating Sign Language Notation into Pose Sequences

Authors: Rotem Shalev-Arkushin, Amit Moryossef, Ohad Fried

(Submitted on 24 Nov 2022 (v1), last revised 1 Apr 2023 (this version, v2))

Abstract: Translating spoken languages into Sign languages is necessary for open communication between the hearing and hearing-impaired communities. To achieve this goal, we propose the first method for animating a text written in HamNoSys, a lexical Sign language notation, into signed pose sequences. As HamNoSys is universal by design, our proposed method offers a generic solution invariant to the target Sign language. Our method gradually generates pose predictions using transformer encoders that create meaningful representations of the text and poses while considering their spatial and temporal information. We use weak supervision for the training process and show that our method succeeds in learning from partial and inaccurate data. Additionally, we offer a new distance measurement that considers missing keypoints, to measure the distance between pose sequences using DTW-MJE. We validate its correctness using AUTSL, a large-scale Sign language dataset, show that it measures the distance between pose sequences more accurately than existing measurements, and use it to assess the quality of our generated pose sequences. Code for the data pre-processing, the model, and the distance measurement is publicly released for future research.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2211.13613 [cs.CV]
	(or arXiv:2211.13613v2 [cs.CV] for this version)

Submission history

From: Rotem Shalev-Arkushin [view email]
[v1] Thu, 24 Nov 2022 13:59:32 GMT (3706kb,D)
[v2] Sat, 1 Apr 2023 17:13:25 GMT (5162kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.13613

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Ham2Pose: Animating Sign Language Notation into Pose Sequences

Submission history