PassGAN: A Deep Learning Approach for Password Guessing

Hitaj, Briland; Gasti, Paolo; Ateniese, Giuseppe; Perez-Cruz, Fernando

Full-text links:

Download:

Current browse context:

cs.CR

< prev | next >

new | recent | 1709

Computer Science > Cryptography and Security

Title: PassGAN: A Deep Learning Approach for Password Guessing

Authors: Briland Hitaj, Paolo Gasti, Giuseppe Ateniese, Fernando Perez-Cruz

(Submitted on 1 Sep 2017 (v1), last revised 14 Feb 2019 (this version, v3))

Abstract: State-of-the-art password guessing tools, such as HashCat and John the Ripper, enable users to check billions of passwords per second against password hashes. In addition to performing straightforward dictionary attacks, these tools can expand password dictionaries using password generation rules, such as concatenation of words (e.g., "password123456") and leet speak (e.g., "password" becomes "p4s5w0rd"). Although these rules work well in practice, expanding them to model further passwords is a laborious task that requires specialized expertise. To address this issue, in this paper we introduce PassGAN, a novel approach that replaces human-generated password rules with theory-grounded machine learning algorithms. Instead of relying on manual password analysis, PassGAN uses a Generative Adversarial Network (GAN) to autonomously learn the distribution of real passwords from actual password leaks, and to generate high-quality password guesses. Our experiments show that this approach is very promising. When we evaluated PassGAN on two large password datasets, we were able to surpass rule-based and state-of-the-art machine learning password guessing tools. However, in contrast with the other tools, PassGAN achieved this result without any a-priori knowledge on passwords or common password structures. Additionally, when we combined the output of PassGAN with the output of HashCat, we were able to match 51%-73% more passwords than with HashCat alone. This is remarkable, because it shows that PassGAN can autonomously extract a considerable number of password properties that current state-of-the art rules do not encode.

Comments:	This is an extended version of the paper which appeared in NeurIPS 2018 Workshop on Security in Machine Learning (SecML'18), see this https URL
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1709.00440 [cs.CR]
	(or arXiv:1709.00440v3 [cs.CR] for this version)

Submission history

From: Briland Hitaj [view email]
[v1] Fri, 1 Sep 2017 18:42:00 GMT (142kb,D)
[v2] Fri, 9 Mar 2018 21:03:46 GMT (476kb,D)
[v3] Thu, 14 Feb 2019 19:51:21 GMT (463kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1709.00440

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Cryptography and Security

Title: PassGAN: A Deep Learning Approach for Password Guessing

Submission history