References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Dataset Structural Index: Leveraging a machine's perspective towards visual data
(Submitted on 5 Oct 2021 (v1), last revised 23 Jan 2023 (this version, v3))
Abstract: With advances in vision and perception architectures, we have realized that working with data is equally crucial, if not more, than the algorithms. Till today, we have trained machines based on our knowledge and perspective of the world. The entire concept of Dataset Structural Index(DSI) revolves around understanding a machine`s perspective of the dataset. With DSI, I show two meta values with which we can get more information over a visual dataset and use it to optimize data, create better architectures, and have an ability to guess which model would work best. These two values are the Variety contribution ratio and Similarity matrix. In the paper, I show many applications of DSI, one of which is how the same level of accuracy can be achieved with the same model architectures trained over less amount of data.
Submission history
From: Dishant Parikh [view email][v1] Tue, 5 Oct 2021 06:40:16 GMT (1964kb,D)
[v2] Thu, 20 Jan 2022 12:18:03 GMT (1969kb,D)
[v3] Mon, 23 Jan 2023 05:33:48 GMT (1970kb,D)
Link back to: arXiv, form interface, contact.