The command for editing the dictionary.

174region174 · Nov 17, 2021

My dictionaries have the form

What command should I run to get the file of this form?
I noticed. The dictionary in Figure 2 contains more words. It takes up less hard disk space.

D3F4ULT · Nov 17, 2021

you can use emeditor for that go to replace tab select "Regular Expressions" and type "\n" in "Find" and type "." in "Replace With" then press the "Replace All" button

174region174 · Nov 17, 2021

D3F4ULT said:
you can use emeditor for that go to replace tab select "Regular Expressions" and type "\n" in "Find" and type "." in "Replace With" then press the "Replace All" button

It didn't work.

D3F4ULT · Nov 17, 2021

174region174 said:
It didn't work.

did you select regular expresssions before replace all

174region174 · Nov 17, 2021

Yeah. I've already been prompted on another forum, but I haven't done it yet. On Linux, use the command
tr -d '\r' < file.txt > file_without_CR.txt
It might work.
If the words are located in a column in a text document... Its size has been increased almost 3 times.

itamatoshi · Nov 20, 2021

174region174 said:
My dictionaries have the form

What command should I run to get the file of this form?
I noticed. The dictionary in Figure 2 contains more words. It takes up less hard disk space.

with dot :

aaa.bbb2.ccc13.vvvv.lll!21.wsdsfl!:P!.wedsk!~.aaaa1

Code:

cat withdot | sed 's/\./\n/g'

aaa
bbb2
ccc13
vvvv
lll!21
wsdsfl!:P!
wedsk!~
aaaa1

with newline

bbb2
ccc13
vvvv
lll!21
wsdsfl!:P!
wedsk!~
aaaa1

Code:

cat todot | tr '\n' '.'

aaa.bbb2.ccc13.vvvv.lll!21.wsdsfl!:P!.wedsk!~.aaaa1.

itamatoshi · Nov 20, 2021

and it can't contain more words and take less space

printf '\x0A' > dumbtest
printf '\x2E' > dumbtest2
printf '\x00' > dumbtest3

They will be all equal to 1 byte, even the null byte

174region174 · Nov 20, 2021

The carriage return and line break command is set between words. I really don't understand what I'm doing. There is no dot. This is displayed by the program itself. Of course, if I understood correctly.

Go619 · Nov 29, 2021

@174region174
can you write full command you are used i have some of list to and i want to edit it to shrink size in hard disk space

174region174 · Nov 29, 2021

Go619 said:
@174region174
can you write full command you are used i have some of list to and i want to edit it to shrink size in hard disk space

I don't have any information. I also want to know for sure.

Go619 · Nov 29, 2021

174region174 said:
I don't have any information. I also want to know for sure.

i will search for it and if i found it i will post it Here
best of luck

Dawbs · Nov 29, 2021

You can compress the wordlist to make it smaller on the disk. Doesn't have a massive impact on speed. Not all compression formats are supported though.

174region174 · Jun 3, 2022

How can I recode a text document from UTF 8 specification to UTF 16LE BOM specification?
UTF8

UTF16LE BOM

flyinghaggis · Jun 9, 2022

Notepad ++ under encoding...
I think that's what you're looking for

174region174 · Jun 9, 2022

Notepad++ cannot open a large text document. I would like to do this on a Linux system using a command in the terminal.

A1131 · Oct 16, 2022

Try this comannd

cat file | sed 's/[ \t\r[:punct:][:digit:][:cntrl:]]/\n/g;' > newfile

Unic0rn · Nov 20, 2022

174region174 said:
Notepad++ cannot open a large text document. I would like to do this on a Linux system using a command in the terminal.

I believe iconv might be what you're looking for. The following should do the trick:

iconv -f utf-8 -t utf-16le in_file.txt > out_file.txt

If you need to guess (as it's impossible to "determine" decisively) the encoding of a file, the following (on linux systems) may help:

file my_file.txt --mime-encoding

Or, if the situation calls for a more sophisticated tool, cchardet is pretty great and will even give you a "confidence level". Here is a link to an article with some info about it (and character encodings in general) as well as some code you can use to get it to work.

Of course, if all else fails there's always running "split -b X my_file.txt" where X is some reasonable number of bytes (ie 2000000000 to 10000000000-ish), opening the resulting files in notepad++, doing whatever is needed there, and then cat-ing them back together afterwards.

The command for editing the dictionary.

174region174

Active member

Attachments

D3F4ULT

Active member

174region174

Active member

Attachments

D3F4ULT

Active member

174region174

Active member

itamatoshi

New member

itamatoshi

New member

174region174

Active member

Go619

Member

174region174

Active member

Go619

Member

Dawbs

Super Moderator

174region174

Active member

Attachments

flyinghaggis

Active member

174region174

Active member

A1131

Active member

Unic0rn

Active member