Household poverty classification in data-scarce environments: a machine learning approach

Kshirsagar, Varun; Wieczorek, Jerzy; Ramanathan, Sharada; Wells, Rachel

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1711

Statistics > Machine Learning

Title: Household poverty classification in data-scarce environments: a machine learning approach

Authors: Varun Kshirsagar, Jerzy Wieczorek, Sharada Ramanathan, Rachel Wells

(Submitted on 18 Nov 2017)

Abstract: We describe a method to identify poor households in data-scarce countries by leveraging information contained in nationally representative household surveys. It employs standard statistical learning techniques---cross-validation and parameter regularization---which together reduce the extent to which the model is over-fitted to match the idiosyncracies of observed survey data. The automated framework satisfies three important constraints of this development setting: i) The prediction model uses at most ten questions, which limits the costs of data collection; ii) No computation beyond simple arithmetic is needed to calculate the probability that a given household is poor, immediately after data on the ten indicators is collected; and iii) One specification of the model (i.e. one scorecard) is used to predict poverty throughout a country that may be characterized by significant sub-national differences. Using survey data from Zambia, the model's out-of-sample predictions distinguish poor households from non-poor households using information contained in ten questions.

Comments:	Presented at NIPS 2017 Workshop on Machine Learning for the Developing World, 7 pages with 4 figures
Subjects:	Machine Learning (stat.ML); Applications (stat.AP)
Cite as:	arXiv:1711.06813 [stat.ML]
	(or arXiv:1711.06813v1 [stat.ML] for this version)

Submission history

From: Jerzy Wieczorek [view email]
[v1] Sat, 18 Nov 2017 04:57:05 GMT (34kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1711.06813

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Household poverty classification in data-scarce environments: a machine learning approach

Submission history