Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy

Fraser, Kathleen C.; Kiritchenko, Svetlana; Balkir, Esma

Full-text links:

Download:

Current browse context:

cs.CY

< prev | next >

new | recent | 2205

Computer Science > Computers and Society

Title: Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy

Authors: Kathleen C. Fraser, Svetlana Kiritchenko, Esma Balkir

(Submitted on 25 May 2022)

Abstract: In an effort to guarantee that machine learning model outputs conform with human moral values, recent work has begun exploring the possibility of explicitly training models to learn the difference between right and wrong. This is typically done in a bottom-up fashion, by exposing the model to different scenarios, annotated with human moral judgements. One question, however, is whether the trained models actually learn any consistent, higher-level ethical principles from these datasets -- and if so, what? Here, we probe the Allen AI Delphi model with a set of standardized morality questionnaires, and find that, despite some inconsistencies, Delphi tends to mirror the moral principles associated with the demographic groups involved in the annotation process. We question whether this is desirable and discuss how we might move forward with this knowledge.

Comments:	To appear at TrustNLP Workshop @ NAACL 2022
Subjects:	Computers and Society (cs.CY); Computation and Language (cs.CL)
Cite as:	arXiv:2205.12771 [cs.CY]
	(or arXiv:2205.12771v1 [cs.CY] for this version)

Submission history

From: Kathleen Fraser [view email]
[v1] Wed, 25 May 2022 13:37:56 GMT (295kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.12771

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computers and Society

Title: Does Moral Code Have a Moral Code? Probing Delphi's Moral Philosophy

Submission history