I present to you a 2.15Gb movie subtitles dictionary.
I scraped & cleaned and unduped them by myself.
Scraping and cleaning was done with costum python scripts. Unduping was done with app.merge from Blandyuk.
Running the dictionary on the remaning md5 list in the listmanger with rules yielded 2033 new founds. (see attached file)
Some passwords are over 20 characters.
Omit range rules seemed to perform the best, stacked with purging rules that remove spaces and punctuation also performed well.
Extract range rules also found a good amount.
I attached my 2 basic Omit range & extract range rules. I will also add them to my github.
Feel free to do your own experimenting and report back your findings ;)
Download link to subtitle dictionary: https://mega.nz/#!mnY1UQJQ!jx8HeeFiR8rnpD1WUgxk6mhE4u46YcFiU1bCoPjtoWM
BTW: There's a bunch of $HEX lines in the directory that got converted by app.merge, but I'm unsure if they're just weird encoded or scrap. If someone can look into that that would be great :)
I scraped & cleaned and unduped them by myself.
Scraping and cleaning was done with costum python scripts. Unduping was done with app.merge from Blandyuk.
Running the dictionary on the remaning md5 list in the listmanger with rules yielded 2033 new founds. (see attached file)
Some passwords are over 20 characters.
Omit range rules seemed to perform the best, stacked with purging rules that remove spaces and punctuation also performed well.
Extract range rules also found a good amount.
I attached my 2 basic Omit range & extract range rules. I will also add them to my github.
Feel free to do your own experimenting and report back your findings ;)
Download link to subtitle dictionary: https://mega.nz/#!mnY1UQJQ!jx8HeeFiR8rnpD1WUgxk6mhE4u46YcFiU1bCoPjtoWM
BTW: There's a bunch of $HEX lines in the directory that got converted by app.merge, but I'm unsure if they're just weird encoded or scrap. If someone can look into that that would be great :)