An External Stability Audit Framework to Test the Validity of Personality Prediction in AI Hiring

Rhea, Alene K.; Markey, Kelsey; D'Arinzo, Lauren; Schellmann, Hilke; Sloane, Mona; Squires, Paul; Kahn, Falaah Arif; Stoyanovich, Julia

Full-text links:

Download:

Current browse context:

cs.CY

< prev | next >

new | recent | 2201

Computer Science > Computers and Society

Title: An External Stability Audit Framework to Test the Validity of Personality Prediction in AI Hiring

Authors: Alene K. Rhea, Kelsey Markey, Lauren D'Arinzo, Hilke Schellmann, Mona Sloane, Paul Squires, Falaah Arif Kahn, Julia Stoyanovich

(Submitted on 23 Jan 2022 (v1), last revised 12 Apr 2022 (this version, v2))

Abstract: Automated hiring systems are among the fastest-developing of all high-stakes AI systems. Among these are algorithmic personality tests that use insights from psychometric testing, and promise to surface personality traits indicative of future success based on job seekers' resumes or social media profiles. We interrogate the validity of such systems using stability of the outputs they produce, noting that reliability is a necessary, but not a sufficient, condition for validity. Our approach is to (a) develop a methodology for an external audit of stability of predictions made by algorithmic personality tests, and (b) instantiate this methodology in an audit of two systems, Humantic AI and Crystal. Crucially, rather than challenging or affirming the assumptions made in psychometric testing -- that personality is a meaningful and measurable construct, and that personality traits are indicative of future success on the job -- we frame our methodology around testing the underlying assumptions made by the vendors of the algorithmic personality tests themselves.
Our main contribution is the development of a socio-technical framework for auditing the stability of algorithmic systems. This contribution is supplemented with an open-source software library that implements the technical components of the audit, and can be used to conduct similar stability audits of algorithmic systems. We instantiate our framework with the audit of two real-world personality prediction systems, namely Humantic AI and Crystal. The application of our audit framework demonstrates that both these systems show substantial instability with respect to key facets of measurement, and hence cannot be considered valid testing instruments.

Subjects:	Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2201.09151 [cs.CY]
	(or arXiv:2201.09151v2 [cs.CY] for this version)

Submission history

From: Julia Stoyanovich [view email]
[v1] Sun, 23 Jan 2022 00:44:56 GMT (3937kb,D)
[v2] Tue, 12 Apr 2022 02:29:08 GMT (3215kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.09151v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computers and Society

Title: An External Stability Audit Framework to Test the Validity of Personality Prediction in AI Hiring

Submission history