Yahoo Password Frequency Corpus
External Data Source
Each of the 51 .txt files represents one subset of all users' passwords observed during the experiment period. "yahoo-all.txt" includes all users; every other file represents a strict subset of that group. Each file is a series of lines of the format: FREQUENCY #OBSERVATIONS ... with FREQUENCY in descending order. For example, the file: 3 1 2 1 1 3 would represent a the frequency list (3, 2, 1, 1, 1), that is, one password...
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see
our documentation.