The command for editing the dictionary.

174region174

Active member
Joined
Jun 30, 2021
Messages
600
Reaction score
1,183
Credits
3,789
My dictionaries have the form

What command should I run to get the file of this form?
I noticed. The dictionary in Figure 2 contains more words. It takes up less hard disk space.
 

Attachments

  • dic1.png
    dic1.png
    13.4 KB · Views: 79
  • dic2.png
    dic2.png
    112 KB · Views: 80

D3F4ULT

Active member
Contributor
Joined
Dec 30, 2019
Messages
331
Reaction score
49
Credits
486
you can use emeditor for that go to replace tab select "Regular Expressions" and type "\n" in "Find" and type "." in "Replace With" then press the "Replace All" button
 

174region174

Active member
Joined
Jun 30, 2021
Messages
600
Reaction score
1,183
Credits
3,789
you can use emeditor for that go to replace tab select "Regular Expressions" and type "\n" in "Find" and type "." in "Replace With" then press the "Replace All" button
It didn't work.
 

Attachments

  • dic3.png
    dic3.png
    17.2 KB · Views: 13

174region174

Active member
Joined
Jun 30, 2021
Messages
600
Reaction score
1,183
Credits
3,789
Yeah. I've already been prompted on another forum, but I haven't done it yet. On Linux, use the command
tr -d '\r' < file.txt > file_without_CR.txt
It might work.
If the words are located in a column in a text document... Its size has been increased almost 3 times.
 

itamatoshi

New member
Joined
Nov 20, 2021
Messages
2
Reaction score
0
Credits
12
My dictionaries have the form

What command should I run to get the file of this form?
I noticed. The dictionary in Figure 2 contains more words. It takes up less hard disk space.

with dot :
aaa.bbb2.ccc13.vvvv.lll!21.wsdsfl!:P!.wedsk!~.aaaa1

Code:
cat withdot | sed 's/\./\n/g'

aaa
bbb2
ccc13
vvvv
lll!21
wsdsfl!:P!
wedsk!~
aaaa1

with newline

bbb2
ccc13
vvvv
lll!21
wsdsfl!:P!
wedsk!~
aaaa1

Code:
cat todot | tr '\n' '.'

aaa.bbb2.ccc13.vvvv.lll!21.wsdsfl!:P!.wedsk!~.aaaa1.
 

itamatoshi

New member
Joined
Nov 20, 2021
Messages
2
Reaction score
0
Credits
12
and it can't contain more words and take less space

printf '\x0A' > dumbtest
printf '\x2E' > dumbtest2
printf '\x00' > dumbtest3

They will be all equal to 1 byte, even the null byte
 

174region174

Active member
Joined
Jun 30, 2021
Messages
600
Reaction score
1,183
Credits
3,789
The carriage return and line break command is set between words. I really don't understand what I'm doing. There is no dot. This is displayed by the program itself. Of course, if I understood correctly.
 

Go619

Member
Joined
Nov 21, 2021
Messages
9
Reaction score
4
Credits
90
@174region174
can you write full command you are used i have some of list to and i want to edit it to shrink size in hard disk space
 

Dawbs

Super Moderator
Staff member
Super Moderator
Trusted
Joined
Dec 30, 2019
Messages
4,173
Reaction score
3,458
Credits
18,332
You can compress the wordlist to make it smaller on the disk. Doesn't have a massive impact on speed. Not all compression formats are supported though.
 

174region174

Active member
Joined
Jun 30, 2021
Messages
600
Reaction score
1,183
Credits
3,789
How can I recode a text document from UTF 8 specification to UTF 16LE BOM specification?
UTF8

UTF16LE BOM
 

Attachments

  • utf16le_bom.png
    utf16le_bom.png
    48.5 KB · Views: 13
  • utf8.png
    utf8.png
    52.8 KB · Views: 13

174region174

Active member
Joined
Jun 30, 2021
Messages
600
Reaction score
1,183
Credits
3,789
Notepad++ cannot open a large text document. I would like to do this on a Linux system using a command in the terminal.
 

Unic0rn

Active member
Joined
Sep 30, 2022
Messages
151
Reaction score
244
Credits
933
Notepad++ cannot open a large text document. I would like to do this on a Linux system using a command in the terminal.
I believe iconv might be what you're looking for. The following should do the trick:

iconv -f utf-8 -t utf-16le in_file.txt > out_file.txt


If you need to guess (as it's impossible to "determine" decisively) the encoding of a file, the following (on linux systems) may help:

file my_file.txt --mime-encoding


Or, if the situation calls for a more sophisticated tool, cchardet is pretty great and will even give you a "confidence level". Here is a link to an article with some info about it (and character encodings in general) as well as some code you can use to get it to work.


Of course, if all else fails there's always running "split -b X my_file.txt" where X is some reasonable number of bytes (ie 2000000000 to 10000000000-ish), opening the resulting files in notepad++, doing whatever is needed there, and then cat-ing them back together afterwards.
 
Top