We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Agglomerative Clustering of Handwritten Numerals to Determine Similarity of Different Languages

Abstract: Handwritten numerals of different languages have various characteristics. Similarities and dissimilarities of the languages can be measured by analyzing the extracted features of the numerals. Handwritten numeral datasets are available and accessible for many renowned languages of different regions. In this paper, several handwritten numeral datasets of different languages are collected. Then they are used to find the similarity among those written languages through determining and comparing the similitude of each handwritten numerals. This will help to find which languages have the same or adjacent parent language. Firstly, a similarity measure of two numeral images is constructed with a Siamese network. Secondly, the similarity of the numeral datasets is determined with the help of the Siamese network and a new random sample with replacement similarity averaging technique. Finally, an agglomerative clustering is done based on the similarities of each dataset. This clustering technique shows some very interesting properties of the datasets. The property focused in this paper is the regional resemblance of the datasets. By analyzing the clusters, it becomes easy to identify which languages are originated from similar regions.
Comments: Submitted to the 22nd International Conference on Computer and Information Technology (ICCIT), 18-20 December, 2019, 6 pages, 5 figures and 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
DOI: 10.1109/ICCIT48885.2019.9038550
Cite as: arXiv:2012.07599 [cs.CV]
  (or arXiv:2012.07599v1 [cs.CV] for this version)

Submission history

From: Rahat Zaman [view email]
[v1] Sun, 22 Nov 2020 04:36:25 GMT (1056kb,D)

Link back to: arXiv, form interface, contact.