#can't import dictionaries into yomitan

235 messages · Page 1 of 1 (latest)

lunar badge
#

I'm trying to setup yomitan on my desktop and I can't get half of the dictionaries I downloaded to import. When I try I get a message saying "error: unknown error." All the dictionaries I'm trying to use are straight from the MarvNC yomichan dictionaries GitHub page and I didn't have any issues using them on my laptop.

hearty shuttle
#

any chance you're out of storage or something

#

otherwise try reinstalling yomitan

lunar badge
#

No, I have plenty of storage but I can try reinstalling

lunar badge
#

Actually when I was messing with it last night I tried using yomitan on Chrome and brave browser

#

And had the same dictionaries produce the same error while importing so I don't think reinstalling will work

hearty shuttle
#

which dictionaries in specific?

#

did you download from the google drive folder at the top or from the direct links

#

for some of the dictionaries if you download them directly you need to unzip it first

#

for the google drive they should be good out of the gate

lunar badge
#

But I'll have to check which specifically when I get home

hearty shuttle
#

uhhh make an issue on the yomitan github that's weird

#

specify os, browser, files, etc

lunar badge
#

[Monolingual] 三省堂国語辞典 第八版 (Recommended), [Monolingual, Encyclopedia] Pixiv, [Freq] JPDB (Recommended), Freq.JPDB_2022-05-10T03_27_02.930Z, and Freq_CC100

#

were the ones I couldn't import

molten gyro
lunar badge
#

huh so it's not just me

#

i don't really ever use github. how do i make an issue?

#

oh i need an account

molten gyro
#

the only thing that comes to mind is that both [Freq] CC100 and [Freq] JPDB only have 1 "big" >10mb term_meta_bank_1.json
whereas other frequency dictionaries like the [Freq] BCCWJ-LUW are separated into 10+ small term_meta_bank_N.json files
feel free to ignore this remark if you believe its irrelevant to the current problem

lunar badge
#

Sorry desktop is currently exploding

#

So this might be completely irrelevant

#

Alright it's alive again

molten gyro
#

only keep the words till 10k frequency

lunar badge
#

I did get bccwj to import

molten gyro
#

yes bcwwj has no problem

lunar badge
molten gyro
#

extract cc100.zip

#

and replace the content of the term meta bank by what i sent and save (i just kept the first few entires)

#

then in your cc100 folder, select the 2 json files, right click on one of them, send to compressed folder, then try to import to yomitan

lunar badge
#

I mean like how do I open it

#

Notepad didn't go well

molten gyro
#

there shouldnt be any problem with notepad?

#

what does it say

lunar badge
#

It was just a brick wall of text

molten gyro
#

yes its normal

#

ctrl + a, delete all

#

then paste what i sent

lunar badge
#

Oh and just paste in what you sent

#

Alright

#

This is just to test the theory right?

#

I shouldn't actually use this dictionary after doing this

molten gyro
#

ye

#

its just the first 10k words atm

#

import it to yomitan

#

we can always delete the dictionary its not a problem

lunar badge
#

alright that worked

molten gyro
#

nice

#

the cc100 appears correctly right?

#

when scanning

lunar badge
#

in the little preview yeah

molten gyro
#

ok lets try with more words

lunar badge
#

it's showing yomu as 162

molten gyro
#

up to 40k frequency

lunar badge
#

seems to be working

molten gyro
#

type 禿 and try to scan it

lunar badge
#

btw the only freq dictionary i've used before was innocent corpus

#

do these work the other way around

#

as in lower number is more common for these?

molten gyro
#

162 for yomu means 162th most common word in the language

#

small number = super common

lunar badge
#

ok so that's the opposite of innocent corpus

#

umm i can't seem to scan text unless i'm doing it wrong

#

but i can scan subtitles from asbplayer

molten gyro
#

do it in the yomichan dictionary

#

or type 禿 in google and hit enter

lunar badge
#

oh wait nvm

#

yeah i did it in google

molten gyro
#

just want to see if this word is indeed in the dictionary

#

what does cc100 say

lunar badge
#

not showing up

molten gyro
#

not that?

lunar badge
#

same with bccwj

#

at least not for the hage reading which i assume is what you want

#

oh yeah that reading shows up

#

but idk if i've heard that before

#

hage i've heard a bunch

molten gyro
#

idk that word either, its just the last word in the file i sent you xd

lunar badge
#

lol

molten gyro
#

to see if all words are covered

#

same thing

#

if nothing works you'll still have cc100 for the most 40k common words so thats still nice

lunar badge
#

shows up in bccwj but not cc100

molten gyro
#

send a screenshot plz

#

and you mean with the previous file i sent?

lunar badge
#

i mean for nicaragua

molten gyro
#

why would you scan nicaragua??

#

scan 禿

#

and send a screen

lunar badge
#

oh i thought that last message was you telling me to scan nicaragua lol

#

so you want me to repeat what i already did with the json but with this text file now

#

and then try scanning hage again

#

give me a sec

molten gyro
molten gyro
#

i didnt understand if cc100 worked or not

lunar badge
#

haven't tried yet

#

misunderstood what you wanted earlier

molten gyro
#

just scan 禿

#

and send a screenshot

lunar badge
#

huh definition 2 is bizarrely specific

molten gyro
#

did you import the cc100 dictionary

lunar badge
#

yeah after editing the json file

molten gyro
#

there was no error?

lunar badge
#

i tried using the second text file you sent and it errored out

#

so i went back to the first text file so i could scan hage

molten gyro
#

the first out of the 3 right

lunar badge
#

yeah

molten gyro
#

so the first one works but the second gives you "unknown error" while importing it?

lunar badge
#

oh wait i didn't even see the second one

#

the third one errored out

#

and it was not an unknown error

#

it was something like some term wasn't recognized

#

let me try it again

molten gyro
#

yes and send a screenshot of the error

lunar badge
molten gyro
#

oh thats my fault

molten gyro
lunar badge
#

forgot a bracket or something

molten gyro
#

i forgot i could have just sent you the zip folder

lunar badge
#

lol oh well

#

it imported

molten gyro
#

does cc100 work on yomu?

lunar badge
#

and kaburo shows up with a frequency of 39327

#

on cc100

molten gyro
#

good

#

well lets continue with json files if it works

lunar badge
#

just replace the current json with this one?

molten gyro
#

yes

#

its the full one actually

#

but do delete and replace it like we did earlier

#

i was thinking it could be a compression problem

lunar badge
#

got unknown error on that one

molten gyro
#

so maybe the size is the problem

#

half of the full dictionary

#

up to 80k

#

or maybe the problem is some weird character somewhere in the dictionary it bugs on

#

@lunar badge

lunar badge
#

it's importing

molten gyro
#

scan 岩手山

#

should give you 80422

lunar badge
#

didn't quite work

molten gyro
#

wait it didnt work for me neither

#

we prob need a real entry in another dictionary

#

lemme try another word

#

@lunar badge try 立県

lunar badge
#

80821

#

yeah

molten gyro
#

send a screen please

#

we never know

lunar badge
molten gyro
#

gg

#

3/4th of the dictionary

#

up to 120k

lunar badge
#

👀

#

btw for freq dictionaries you leave the priority as 0 right?

#

or does it not matter

molten gyro
#

i have them set at 0

lunar badge
#

it imported

molten gyro
#

ok

lunar badge
#

thanks for all the help

molten gyro
#

scan 膝かけ

#

dw

#

im almost as clueless as you lol

lunar badge
#

i can't believe i didn't recognize that one

molten gyro
#

didnt do the core 1 million? smh

lunar badge
lunar badge
#

coincidentally

molten gyro
#

well then the problem must be within the last 40k words of the dictionary

#

and thats only for the cc100

#

not the other dictionaries

lunar badge
#

yeah

#

so you think it's a file size thing?

molten gyro
#

this or theres a weird character it doesnt like

#

maybe some rare kanji or idk

#

but those are just random guesses i know nothing of yomitan's code

lunar badge
#

also how is it that nobody told me i could yomichan words within yomichan

molten gyro
#

you didnt know?

#

its a setting

#

number of child popups

lunar badge
#

nah happened to see it just now while going through the settings

molten gyro
#

alas

lunar badge
#

but i asked about it a while back

#

the struggles of not reading

#

and depending on the kindness of strangers

molten gyro
#

@hearty shuttle
KangaJoo and I have been testing various things with the [Freq] CC100
If we only take 3/4th of the dictionary, i.e the words up to 121824 frequency, he can import the dictionary and use it just fine, but he can't import the full one with words up to 160k frequency.
Maybe it's size issue, or there's a weird character somewhere in the last quarter of the dictionary that causes this "unknown error", or maybe something else I don't know.

hearty shuttle
#

do you have really low ram or something?

#

the import code is apparently unoptimized

#

but it shouldn't be an issue if you have like

#

4gb at least

#

maybe even 2

#

for frequency dicts

#

but yeah i have no idea

#

best make an issue on github and the smart people can diagnose

molten gyro
#

yeah and do report that deleting the last quarter of cc100 dictionary makes it importable for some reason

lunar badge
#

This is a gaming desktop

#

Granted I built it ages ago but ram usage hasn't really gone up that much for games

hearty shuttle
#

wack

#

also can you open the console and screenshow whatever errors it throws while importing

lunar badge
#

Sorry my wife wanted to hang out for a bit

#

@hearty shuttle does this help?

hearty shuttle
#

ah just "unknown error" huh rip

#

thanks for sending the video

#

I'll make an issue later with this info

lunar badge
#

Thank you very much

#

If you need more screenshots or something let me know

hearty shuttle
#

oh wait

#

this is brave browser or something?

#

can you try chrome

#

ah you already did

lunar badge
#

yeah i actually started the process on chrome cause i heard it generally works better with yomitan and asbplayer

#

i have it set up on both right now

desert bay
#

Why don't you try chromium

#

Marmaduke ungoogled chromium

#

Works fine

#

Basically nothing can break this browser

lunar badge
#

Because I already had chrome

#

And didn't know about chromium's existence