FSRS Megathread | Anki | Page 11

unique salmon Apr 15, 2025, 2:36 PM

#

Too bad only, like, 4-5 people would enjoy it, cause barely anybody pays attention to this benchmark

polar maple Apr 15, 2025, 4:10 PM

#

maybe we can go lower than 0.5% RMSE (bins) by combining RWKV-P + RMSE-BINS-EXPLOIT

unique salmon Apr 15, 2025, 4:11 PM

#

polar maple maybe we can go lower than 0.5% RMSE (bins) by combining RWKV-P + RMSE-BINS-EXPL...

I don't count RMSE bins exploit because it's cheating

robust hill Apr 15, 2025, 4:11 PM

#

what if

#

lets see Expertium's algorithm metrics

polar maple Apr 15, 2025, 4:14 PM

#

unique salmon I don't count RMSE bins exploit because it's cheating

how is it cheating?

#

the problem is that we cannot exactly know if a good algorithm isn't cheating the metric a bit

unique salmon Apr 15, 2025, 4:17 PM

#

polar maple the problem is that we cannot exactly know if a good algorithm isn't cheating th...

For that one in particular - it uses the formulas for binning internally, which is obviously cheating

#

If an algorithm doesn't use our binning formulas internally, it's fair

polar maple Apr 15, 2025, 4:19 PM

#

unique salmon For that one in particular - it uses the formulas for binning internally, which ...

i don't see how being aware of a metric constitutes cheating in the same way that models like FSRS are aware of log loss and RMSE (bins) as well

#

FSRS is even directly optimized on RMSE (bins), direct cheating apparently

#

and when you guys find improvements in FSRS it is often in terms of % reduction in RMSE (bins), you are optimizing over the metric directly over time

unique salmon Apr 15, 2025, 4:22 PM

#

polar maple i don't see how being aware of a metric constitutes cheating in the same way tha...

models like FSRS are aware of log loss and RMSE (bins) as well
?
FSRS doesn't use the loss functions internally for it's own calculations

unique salmon Apr 15, 2025, 4:22 PM

#

polar maple and when you guys find improvements in FSRS it is often in terms of % reduction ...

Sure, but again, FSRS doesn't internally count bins using the binning formulas

#

And it doesn't internally keep track of log-loss

#

So it's fair

polar maple Apr 15, 2025, 4:23 PM

#

whatever algorithms do internally is their own business, we should make the metrics as robust as possible to anything that they try

robust hill Apr 15, 2025, 4:24 PM

#

💀

#

the algorithms have more privacy than me

polar maple Apr 15, 2025, 4:26 PM

#

i now believe that RMSE (bins) should just be removed everywhere, it doesn't have a nice interpretation so the only reason we still use it right now is no longer the case
https://github.com/ankitects/anki-manual/issues/368

#

@quasi shadow please give fsrs wiki modify permissions (I think i need permission to modify the page?), i'll write about RMSE (bins)

polar maple Apr 15, 2025, 4:32 PM

#

unique salmon And it doesn't *internally* keep track of log-loss

also disagreed on this exactly line, FSRS does compute log-loss when doing gradient descent

#

and at least on anki it does compute RMSE (bins) to make sure that the new parameters do better on RMSE (bins) than before

#

so it also does compute bins

#

its just not in the way that you consider cheating, but the fact is that it does compute the bins

unique salmon Apr 15, 2025, 4:36 PM

#

polar maple also disagreed on this exactly line, FSRS does compute log-loss when doing gradi...

The log-loss is calculated outside of FSRS, same goes for RMSE
RMSE-BINS-EXPLOIT calculates the bins inside the algorithm itself, hence cheating

ashen light Apr 15, 2025, 4:37 PM

#

this isn't loss and I am disappointed

unique salmon Apr 15, 2025, 4:37 PM

#

How about we skip the forum and just ask Dae directly to remove "Evaluate"? 🤔

polar maple Apr 15, 2025, 4:40 PM

#

unique salmon The log-loss is calculated outside of FSRS, same goes for RMSE RMSE-BINS-EXPLOIT...

did you make a mistake in the diagram? FSRS is cheating? not that I disagree by your standards!

unique salmon Apr 15, 2025, 4:40 PM

#

polar maple did you make a mistake in the diagram? FSRS is cheating? not that I disagree by ...

It's just two different alternatives

#

Feel free to replace "FSRS" with {algorithm_name}

#

The point is that if it doesn't calculate the loss internally, it can't be cheating

polar maple Apr 15, 2025, 4:41 PM

#

FSRS computes log loss when doing internal gradient descent and also RMSE (bins) when updating parameters, so it is definitely cheating

#

in the same way that you claim that RMSE (bins) is cheating

unique salmon Apr 15, 2025, 4:42 PM

#

polar maple FSRS computes log loss when doing internal gradient descent and also RMSE (bins)...

literally where

polar maple Apr 15, 2025, 4:42 PM

#

unique salmon literally where

literally gradient descent optimization

#

and also in anki, FSRS does not give new parameters unless it does better than the previous ones on RMSE (bins)

unique salmon Apr 15, 2025, 4:43 PM

#

polar maple literally gradient descent optimization

But it's not a part of how FSRS calculates DSR. Gradient descent is outside of the algorithm

polar maple Apr 15, 2025, 4:43 PM

#

unique salmon But it's not a part of how FSRS calculates DSR. Gradient descent is outside of t...

it is crazy to claim that gradient descent is outside of the algorithm of FSRS

unique salmon Apr 15, 2025, 4:44 PM

#

polar maple it is crazy to claim that gradient descent is outside of the algorithm of FSRS

Why?

polar maple Apr 15, 2025, 4:44 PM

#

it's the entire optimization process that FSRS uses

#

without optimization FSRS doesn't learn

unique salmon Apr 15, 2025, 4:50 PM

#

You cannot be serious man...
When FSRS calculates difficulty, it does not use log-loss/RMSE
When FSRS calculates stability, it does not use log-loss/RMSE
When FSRS calculates retrievability, it does not use log-loss/RMSE

polar maple Apr 15, 2025, 4:53 PM

#

unique salmon You cannot be serious man... When FSRS calculates difficulty, it does not use lo...

okay then, please remove 'optimize' from anki. FSRS now only has one set of parameters, if you ever want to optimize, you are no longer using FSRS

unique salmon Apr 15, 2025, 4:56 PM

#

polar maple okay then, please remove 'optimize' from anki. FSRS now only has one set of para...

Maaaaaaaaaaaaaan 😭😭😭😭😭
I'm just saying that RMSE-BINS-EXPLOIT is keeping track of the loss inside the algorithm itself, while FSRS doesn't. Hence why one is cheating and the other is not

polar maple Apr 15, 2025, 5:02 PM

#

unique salmon Maaaaaaaaaaaaaan 😭😭😭😭😭 I'm just saying that RMSE-BINS-EXPLOIT is keeping tr...

i mean that's my point, FSRS also needs to compute loss internally so this specific thing isn't cheating imo

robust hill Apr 15, 2025, 5:02 PM

#

noo do not remove optimize

#

i love optimize

polar maple Apr 15, 2025, 5:03 PM

#

to me RMSE-BINS-EXPLOIT isn't cheating and it shows that RMSE (bins) is unreliable

unique salmon Apr 15, 2025, 5:03 PM

#

robust hill i love optimize

Do you love Evaluate though?

polar maple Apr 15, 2025, 5:05 PM

#

@unique salmon how about we go back to plain old RMSE, no bins? nice and human interpretable

robust hill Apr 15, 2025, 5:06 PM

#

unique salmon Do you love Evaluate though?

YES

#

i love them all

unique salmon Apr 15, 2025, 5:06 PM

#

polar maple <@530106856593424407> how about we go back to plain old RMSE, no bins? nice and ...

It will be too similar to log-loss, both in terms of absolute values and in terms of being correlated with retention

polar maple Apr 15, 2025, 5:08 PM

#

fair enough, do we know if AUC is also correlated with retention? could show AUC instead of RMSE (bins)

unique salmon Apr 15, 2025, 5:09 PM

#

https://github.com/ankitects/anki/issues/3926
Just remove Evaluate

GitHub

Remove "Evaluate" · Issue #3926 · ankitects/anki

Dae, despite what the screenshot above shows, I think we should disregard that poll and remove "Evaluate" anyway. David agrees, btw. "Evaluate" gives the user a bunch of numbers...

#

I made an issue

robust hill Apr 15, 2025, 5:10 PM

#

nooo

#

i love it

unique salmon Apr 15, 2025, 5:10 PM

#

(screw forums because people on forums disagree with me 🤣)

robust hill Apr 15, 2025, 5:10 PM

#

evaluate makes me feel like a scientist

unique salmon Apr 15, 2025, 5:12 PM

#

polar maple fair enough, do we know if AUC is also correlated with retention? could show AUC...

Give me a moment

robust hill Apr 15, 2025, 5:48 PM

#

expertium if you delete evaluate button im going to delete you

#

no i wont mods dont punish me pls

unique salmon Apr 15, 2025, 5:53 PM

#

polar maple fair enough, do we know if AUC is also correlated with retention? could show AUC...

Ok, this was a huge pain, but here

#

#

#

Love me some AUC less than 0.5, lol

#

Love it when FSRS does worse than random 🤣

#

Zoomed in a bit

polar maple Apr 15, 2025, 6:07 PM

#

seems usable

#

if AUC is less than 0.5, turn off FSRS

#

@unique salmon btw ill take a look at nn D but i'll write my own code, won't make any promises on this

unique salmon Apr 15, 2025, 6:10 PM

#

polar maple if AUC is less than 0.5, turn off FSRS

Nah, reverse FSRS predictions
Whenever you see R=90%, treat it as 10%
Whenever you see R=10%, treat it as 90%

#

🤣

unique salmon Apr 15, 2025, 6:13 PM

#

polar maple <@530106856593424407> btw ill take a look at nn D but i'll write my own code, wo...

https://github.com/ankitects/anki/issues/3926
Wanna chime in here?

GitHub

Remove "Evaluate" (FSRS section) · Issue #3926 · ankitects/anki

Dae, despite what the screenshot above shows, I think we should disregard that poll and remove "Evaluate" anyway. David agrees, btw. "Evaluate" gives the user a bunch of numbers...

bold terrace Apr 15, 2025, 6:27 PM

#

I agree with @polar maple , if a metric is exploitable, you can defend an algorithm to not internally try to exploit it but by definition of trying to minimize it it might, a Neural Network could led to such cheating without being specially instructed to do it like this

#

But at the same time with log loss you would over privilege shorter interval precision no ? Since the vast majority of reviews are 1-20d

#

You could argue that then having better precision for that vast majority should be the goal though

bold terrace Apr 15, 2025, 6:31 PM

#

unique salmon (screw forums because people on forums disagree with me 🤣)

You’re a complete psycho 😅

unique salmon Apr 15, 2025, 6:32 PM

#

bold terrace I agree with <@142448513622605824> , if a metric is exploitable, you can defend ...

Alex made LSTM and another net, and in both cases RMSE and log-loss improved. We haven't seen a case where log-loss gets better or stays the same, but RMSE gets worse

#

That would be evidence in favor of NN cheating

unique salmon Apr 15, 2025, 6:32 PM

#

bold terrace You’re a complete psycho 😅

Please read what I wrote on Github

bold terrace Apr 15, 2025, 6:32 PM

#

unique salmon Please read what I wrote on Github

Which I did before writing this

#

Not actionnable ? People could based on it understand that something is wrong in their prediction so they can adjust that

#

They can also see if splitting différents deck in different preset help or not

unique salmon Apr 15, 2025, 6:33 PM

#

bold terrace Not actionnable ? People could based on it understand that something is wrong in...

Most users have no idea what the numbers mean

bold terrace Apr 15, 2025, 6:34 PM

#

So explain how to interpret those numbers

#

You’re going full dictatorship

#

Or just trolling but this is just ridiculous

unique salmon Apr 15, 2025, 6:35 PM

#

robust hill Apr 15, 2025, 6:36 PM

#

#

look at my fire rmse bins

unique salmon Apr 15, 2025, 6:37 PM

#

bold terrace I agree with <@142448513622605824> , if a metric is exploitable, you can defend ...

Btw, RMSE isn't used during optimization at all, it's only used after the optimization is finished. So a neural net couldn't "internalize" it anyway

robust hill Apr 15, 2025, 6:37 PM

#

well i only have 1 day of reviews so thats probably why i have .38% 💀

unique salmon Apr 15, 2025, 6:38 PM

#

bold terrace So explain how to interpret those numbers

Why do you hate regular users man...

#

And before you say "That is not what I said at all" - there is absolutely no way in hell we can make "Evaluate" intuitive

#

Like, none

#

The best we can do is "lower = better", which is what it already says

#

We could add the log-loss formula, but that would scare the average person even more

bold terrace Apr 15, 2025, 6:42 PM

#

unique salmon The best we can do is "lower = better", which is what it already says

Because you never cared to write something like “RMSE of 3% can be seen as having in average a normal range of 3% precision around your DR”

unique salmon Apr 15, 2025, 6:42 PM

#

Here's your intuitive log-loss, lol

bold terrace Apr 15, 2025, 6:42 PM

#

unique salmon Why do you hate regular users man...

Funny enough you’re the one saying they should not have to decide

unique salmon Apr 15, 2025, 6:43 PM

#

bold terrace Funny enough you’re the one saying they should not have to decide

I guarantee you if I polled r/Anki, most people would say that they don't know what to do with the numbers that "Evaluate" shows

bold terrace Apr 15, 2025, 6:43 PM

#

You’re playing the dumb troll but you know you bend truth to justify being smarter

#

You give people a “normal range of log loss” based on user in the 10k dataset and voila case solved

unique salmon Apr 15, 2025, 6:44 PM

#

...except that the benchmark uses a different procedure

bold terrace Apr 15, 2025, 6:44 PM

#

But you know that very well you’re just playing pretend

bold terrace Apr 15, 2025, 6:44 PM

#

unique salmon ...except that the benchmark uses a different procedure

Give them range based on the one from evaluate then

#

You’re just playing the “I know better than people so people should not be able to judge by themselves”

unique salmon Apr 15, 2025, 6:46 PM

#

bold terrace You’re just playing the “I know better than people so people should not be able ...

People neither want nor should have to decide this stuff. Users use Anki to review cards, not to tweak a bunch of abstract numbers

bold terrace Apr 15, 2025, 6:46 PM

#

unique salmon People neither want nor should have to decide this stuff. Users use Anki to revi...

What people want is what you get in your poll you decided to ignore

unique salmon Apr 15, 2025, 6:46 PM

#

The whole point of FSRS is (supposed to be) that it outsorces tweaking to the computer

bold terrace Apr 15, 2025, 6:47 PM

#

IMO you just get so attached to FSRS like it’s your baby that you just want to control it’s course

unique salmon Apr 15, 2025, 6:47 PM

#

bold terrace What people want is what you get in your poll you decided to ignore

Alright, fine, I'll make a poll on r/Anki and ask things like

Do you find "Evaluate" useful?
Do you know what the metrics mean?
Do you know what values would be considered "good" and what would be considered "too high"?

bold terrace Apr 15, 2025, 6:47 PM

#

You unilaterally create a request to do something most people asked by you, not to

#

It tells more about you than average users or than about me

bold terrace Apr 15, 2025, 6:49 PM

#

unique salmon Alright, fine, I'll make a poll on r/Anki and ask things like 1) Do you find "Ev...

Because you think your Reddit sects will abide to you

ashen light Apr 15, 2025, 6:49 PM

#

the evaluate button is literally useless

#

the numbers mean literally nothing to anyone except 5 people

bold terrace Apr 15, 2025, 6:49 PM

#

Or because you think redditors are smarter than the average user from Anki board ?

unique salmon Apr 15, 2025, 6:50 PM

#

bold terrace Or because you think redditors are smarter than the average user from Anki board...

No, because I think Redditors are dumber than the average power user from forums

bold terrace Apr 15, 2025, 6:50 PM

#

ashen light the numbers mean literally nothing to anyone except 5 people

So explain the number, vulgarize it, give examples of what healthy value can look

ashen light Apr 15, 2025, 6:51 PM

#

even if it is explaied, what actionable things can I do with that number?

#

like even if I knew exactly what the fuck "Log loss: 0.2826, RMSE(bins): 3.35%. " means, what do I do with that information

bold terrace Apr 15, 2025, 6:52 PM

#

ashen light even if it is explaied, what actionable things can I do with that number?

Reflect on things that could explain a not healthy value : hard that were again, deck that mix very different material, not enough card rated “good” that you didn’t know already acquired leading to too much optimistic stability …

unique salmon Apr 15, 2025, 6:52 PM

#

Btw jake, I'd appreciate it if you commented here or gave me a thumbs up, just so that Dae sees that it's more than me and David
https://github.com/ankitects/anki/issues/3926

GitHub

Remove "Evaluate" (FSRS section) · Issue #3926 · ankitects/anki

Dae, despite what the screenshot above shows, I think we should disregard that poll and remove "Evaluate" anyway. David agrees, btw. "Evaluate" gives the user a bunch of numbers...

ashen light Apr 15, 2025, 6:54 PM

#

bold terrace Reflect on things that could explain a not healthy value : hard that were again,...

what percentage of anki users, if this number was suddenly useful, would actually reflect on this

bold terrace Apr 15, 2025, 6:55 PM

#

ashen light what percentage of anki users, if this number was suddenly useful, would actuall...

If you’re telling me it’s better to have just people having screwed up parameters, then what will you do to help them? How do you ask them those parameters to help and educate them ?

ashen light Apr 15, 2025, 6:56 PM

#

" I think over more than a year of helping on r/Anki, "Evaluate" came in handy, like, once." (https://github.com/ankitects/anki/issues/3926) it sounds like this isn't even helpful

bold terrace Apr 15, 2025, 6:56 PM

#

FSRS would have to be removed too to be sure people are not screwed by it then, this is non sense

ashen light Apr 15, 2025, 6:56 PM

#

what imaginary problem will suddenly be resolved by the contents of the evaluate popup

unique salmon Apr 15, 2025, 6:57 PM

#

bold terrace If you’re telling me it’s better to have just people having screwed up parameter...

I'm not suggesting removing the parameter field

#

As I wrote, the parameter field can be useful

ashen light Apr 15, 2025, 6:58 PM

#

out of curiosity, has anyone outside of maybe that ismael guy actually posted truly shit numbers?

bold terrace Apr 15, 2025, 6:58 PM

#

Also I’m sorry but most of the time the “help” I got before diving in understanding FSRS was more or the time “you understand nothing” …

ashen light Apr 15, 2025, 6:58 PM

#

I'm pro hiding the parameter list too, maybe a "click here to copy info when asking for help" that puts it all in the clipboard for debugging help

bold terrace Apr 15, 2025, 6:59 PM

#

Comparing my DR to my Actual Retention ? Being said I can’t …

unique salmon Apr 15, 2025, 6:59 PM

#

ashen light out of curiosity, has anyone outside of maybe that ismael guy actually posted tr...

Yeah, I once saw a guy who used "Remedy Hard Misuse" and didn't optimize afterwards, he had RMSE=20% or 30% or something like that

unique salmon Apr 15, 2025, 6:59 PM

#

bold terrace Comparing my DR to my Actual Retention ? Being said I can’t …

?
Who said that?

ashen light Apr 15, 2025, 7:00 PM

#

anyway from what I understand the actual argument is not to actually remove evaluate, but to put it somewhere that downplays its importance/relevance

#

(at least that seems to be david's idea)

#

I think the preset ui is just really bad at containing weird niche options

#

so everything just has the same weight to it no matter the importance

unique salmon Apr 15, 2025, 7:01 PM

#

ashen light anyway from what I understand the actual argument is not to actually remove eval...

That would be nice, but I'm not sure how to do that in practice
We can't put it in "Advanced", that would be to akward + hard to make it clear that it's related to FSRS

ashen light Apr 15, 2025, 7:01 PM

#

IF IT WERE ME: multiple setting categories: core, extra, fringe

#

this is peak 'fringe' category

bold terrace Apr 15, 2025, 7:01 PM

#

unique salmon ? Who said that?

https://forums.ankiweb.net/t/with-fsrs-5-allowing-new-reviews-before-next-day-to-increase-average-retention/52019/12

Anki Forums

With FSRS 5, Allowing new Reviews before next day to increase avera...

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

#

unique salmon Apr 15, 2025, 7:02 PM

#

ashen light IF IT WERE ME: multiple setting categories: core, extra, fringe

I have proposed having two layouts, Beginner and Pro. Dae was not very fond of that

bold terrace Apr 15, 2025, 7:03 PM

#

ashen light Apr 15, 2025, 7:03 PM

#

this isn't about beginner/pro, this is about multiple tabs that may have the same categories but with different tiers of dumbshit

bold terrace Apr 15, 2025, 7:03 PM

#

“You’re so wrong your beyond the point of being helped”

ashen light Apr 15, 2025, 7:03 PM

#

two uis are a mess, but this is just one ui

#

¯_(ツ)_/¯

ashen light Apr 15, 2025, 7:03 PM

#

bold terrace “You’re so wrong your beyond the point of being helped”

maybe you shoulda posted your log loss

unique salmon Apr 15, 2025, 7:03 PM

#

https://forms.gle/EyJpGmpR6M8JAFGy6
I will post this on r/Anki tomorrow

Google Docs

FSRS and "Evaluate"

ashen light Apr 15, 2025, 7:03 PM

#

that might have solved everything

bold terrace Apr 15, 2025, 7:04 PM

#

ashen light maybe you shoulda posted your log loss

I was simply explaining how the Average Predicted R could be lower than DR in case where a lot of new cards per day was introduced

#

But I was so wrong it was difficult to explain it to me

ashen light Apr 15, 2025, 7:04 PM

#

just be less wrong yo

#

I bet rmse bins could solve this

unique salmon Apr 15, 2025, 7:05 PM

#

...and then I am called a troll 😅

bold terrace Apr 15, 2025, 7:05 PM

#

Then the same people are taken as example as helping others, while their just getting their ego dose of feeling smarter than everyone

ashen light Apr 15, 2025, 7:06 PM

#

all this talk yet no real world use of rmse bins found

bold terrace Apr 15, 2025, 7:06 PM

#

ashen light all this talk yet no real world use of rmse bins found

I gave 3 practical actions

#

Unfortunately those are ignored I guess

ashen light Apr 15, 2025, 7:06 PM

#

they will be ignored by literally every anki user

#

I was roleplaying the average person using this app

unique salmon Apr 15, 2025, 7:06 PM

#

ashen light all this talk yet no real world use of rmse bins found

Hey, it's useful in the benchmark. We need a number that can get close to 0 🤣

bold terrace Apr 15, 2025, 7:07 PM

#

ashen light I was roleplaying the average person using this app

Pretending to know every people is also not that great when we have not even a clue of the percentage of user using FSRS in the first place

ashen light Apr 15, 2025, 7:07 PM

#

"I have to read the manual to understand what this number means and what to do with it? FINALLY I LOVE THIS FEATURE" - sound, probably

bold terrace Apr 15, 2025, 7:08 PM

#

ashen light "I have to read the manual to understand what this number means and what to do w...

More like “why do I have a retention 10% Lower than expected if the gospel was to enable FSRS and my live would be better than with SM2”

ashen light Apr 15, 2025, 7:09 PM

#

@bold terrace did you move to sm2?

bold terrace Apr 15, 2025, 7:09 PM

#

Followed by “you’re really a geek to have enabled FSRS and complain about that difference “

unique salmon Apr 15, 2025, 7:09 PM

#

bold terrace Pretending to know every people is also not that great when we have not even a c...

Rough approximation (this is from a year ago)

bold terrace Apr 15, 2025, 7:09 PM

#

unique salmon Rough approximation (this is from a year ago)

Anki is a niche

#

unique salmon Apr 15, 2025, 7:09 PM

#

Well, I can't ask random Anki users, so this is the best I've got

ashen light Apr 15, 2025, 7:09 PM

#

@bold terrace maybe accept you're in the 0.001% of people in the dataset that is better served by sm2 and use it instead 🍃

#

can't dae like....scan all the data on ankiweb for decks that have the fsrs feature enabled?

unique salmon Apr 15, 2025, 7:10 PM

#

ashen light can't dae like....scan all the data on ankiweb for decks that have the fsrs feat...

That's a good question, actually

bold terrace Apr 15, 2025, 7:11 PM

#

#

500k view

ashen light Apr 15, 2025, 7:11 PM

#

@bold terrace can you do an experiment for science and use sm2 for a few months?

bold terrace Apr 15, 2025, 7:11 PM

#

The guy still tweak SM2

ashen light Apr 15, 2025, 7:11 PM

#

see how much better your numbers are?

#

it'll be a nice data point I think

bold terrace Apr 15, 2025, 7:12 PM

#

ashen light <@304669962608443402> maybe accept you're in the 0.001% of people in the dataset...

FSRS works great once you have some grasp on how it works, thus why I’m against removing evaluate and instead help people interpreting those values

unique salmon Apr 15, 2025, 7:12 PM

#

https://forms.gle/EyJpGmpR6M8JAFGy6
I added a new question

Google Docs

FSRS and "Evaluate"

ashen light Apr 15, 2025, 7:13 PM

#

I have zero grasp on how it works and I just chose to not care to think about it and I'm doin fine

#

¯_(ツ)_/¯

bold terrace Apr 15, 2025, 7:14 PM

#

Because you’re using Anki and developing in Anki for years 🤷

ashen light Apr 15, 2025, 7:15 PM

#

that is quite an overstatement of my experience

#

fsrs didn't even exist the previous time I was using anki

bold terrace Apr 15, 2025, 7:17 PM

#

#

Funny enough Anki is in a boom

#

SM2 is 6 times bigger than FSRS for whatever that means

ashen light Apr 15, 2025, 7:17 PM

#

shitpost: reminder than sm is currently at version 11 and sm2 is basically 20 years old

#

🍃

unique salmon Apr 15, 2025, 7:18 PM

#

ashen light shitpost: reminder than sm is currently at version 11 and sm2 is basically 20 ye...

17 🤓

#

Actually, no, 18

#

Actually, wait...

#

gimme a sec

#

I'm unsure if there exists SM-19 or no

#

SM-18 is definitely a thing

ashen light Apr 15, 2025, 7:19 PM

#

man I am OUTDATED

unique salmon Apr 15, 2025, 7:19 PM

#

bold terrace SM2 is 6 times bigger than FSRS for whatever that means

Well, FSRS has been around for less than two years, so it's not super surprising

ashen light Apr 15, 2025, 7:20 PM

#

also how many "sm2" searches are looking for super mario 2

#

a nonzero amount thats for sure

bold terrace Apr 15, 2025, 7:20 PM

#

I’m going a bit in the Ad Hominem territory but sometimes I feel Anki is not always used by the people to actually learn the stuff they want to learn but more because it became almost its own thing 😅

bold terrace Apr 15, 2025, 7:20 PM

#

ashen light also how many "sm2" searches are looking for super mario 2

I got Spiderman 2 in the results

unique salmon Apr 15, 2025, 7:21 PM

#

There is SuperMemo 19 (software), but unclear if it's using a new algo

#

I'll take that as "SM-18 is the latest algo"

unique salmon Apr 15, 2025, 7:22 PM

#

unique salmon https://forms.gle/EyJpGmpR6M8JAFGy6 I added a new question

Btw, I deleted (I assume) Jake's response because of the addition of another question 😅

ashen light Apr 15, 2025, 7:23 PM

#

nooooooooo now I have to fill it out again

bold terrace Apr 15, 2025, 7:23 PM

#

unique salmon Apr 15, 2025, 7:23 PM

#

bold terrace

It sure as heck can, lol

ashen light Apr 15, 2025, 7:23 PM

#

fsrs needs priorities

#

so it can be compared to sm18

unique salmon Apr 15, 2025, 7:24 PM

#

The issue is that nobody is going to give us data

#

Jarrett barely scrambled 16 or so collections of SM users

bold terrace Apr 15, 2025, 7:24 PM

#

Yeah

ashen light Apr 15, 2025, 7:24 PM

#

why has no one reverse engineered any of the sms past like 2

bold terrace Apr 15, 2025, 7:24 PM

#

On that note at least FSRS community is open about data

unique salmon Apr 15, 2025, 7:24 PM

#

ashen light why has no one reverse engineered any of the sms past like 2

Apparently a long time ago Anki devs tried, but gave up

ashen light Apr 15, 2025, 7:25 PM

#

back in the dark ages of sm-5

unique salmon Apr 15, 2025, 7:25 PM

#

Also, FSRS sorta-kinda counts as "reverse engineered SM"

unique salmon Apr 15, 2025, 7:26 PM

#

bold terrace

Priorities don't affect the calculation of the probability of recall though, unless I misunderstand how SuperMemo works

ashen light Apr 15, 2025, 7:26 PM

#

@bold terrace would priorities solve your problem

bold terrace Apr 15, 2025, 7:27 PM

#

#

Holy cow

#

There’s worst overthinkers out there

#

Using SM to forget things

unique salmon Apr 15, 2025, 7:27 PM

#

https://supermemo.guru/wiki/SuperMemo_Guru
Woz wrote a ton of stuff

SuperMemo Guru

#

From articles about algorithms and math to...uhhh...some vague stuff about the brain of a certain political leader and about Elongated Muskrat

#

He's a bit of an odd fella

ashen light Apr 15, 2025, 7:29 PM

#

"can supermemo be used to forget things"

bold terrace Apr 15, 2025, 7:29 PM

#

Investing and Vtubers too ?

unique salmon Apr 15, 2025, 7:29 PM

#

bold terrace Investing and Vtubers too ?

Lol

bold terrace Apr 15, 2025, 7:29 PM

#

ashen light "can supermemo be used to forget things"

Yeah that one caught my eyes too

ashen light Apr 15, 2025, 7:30 PM

#

I read the list before I realized you ALSO pointed that one out

#

I mean techincally I'm using anki for that purpose

#

the idea is to just fill my brain with so much garbage it pushes other things out

#

I realized I was spending way too much on alcohol

#

and anki seemed like a cost effective replacement long-term

bold terrace Apr 15, 2025, 7:31 PM

#

ashen light I realized I was spending way too much on alcohol

By the bins ?

ashen light Apr 15, 2025, 7:31 PM

#

I'm a craft beer weenie

unique salmon Apr 15, 2025, 7:31 PM

#

bold terrace By the bins ?

RMSE (bins)

polar maple Apr 15, 2025, 7:34 PM

#

personally i get a little dopamine rush when i see that parameters have changed and also the metrics look better

ashen light Apr 15, 2025, 7:35 PM

#

I think anki needs more visual flair, like you hit the optimzie button and theres a graphic that shows the numbers moving

#

animated bar graphs going up or down

unique salmon Apr 15, 2025, 7:36 PM

#

ashen light I think anki needs more visual flair, like you hit the optimzie button and there...

A bar that represents RMSE getting shorter

unique salmon Apr 15, 2025, 7:36 PM

#

ashen light animated bar graphs going up or down

yep yep

#

Adds zero utility, but at least it's fun 🤣

#

Actually, nvm, it would be a nightmare if you have 20 presets and use "Optimize all presets"

#

Would be like getting ads

ashen light Apr 15, 2025, 7:37 PM

#

nah its one big set of graphs all animating at once

#

this is how we get the young users and vc funding

#

brb gonna make my own anki

#

called wanki

#

with this shit

#

it'll be great

unique salmon Apr 15, 2025, 7:38 PM

#

DerIshmaelite watching 260 graphs going down

bold terrace Apr 15, 2025, 7:38 PM

#

Wanna attract more people ? Preinstall decks 😂

#

Things like yomitan or Anki where you have to install external decks or dictionaries is a no go for most people

polar maple Apr 15, 2025, 7:39 PM

#

unique salmon Alex made LSTM and another net, and in both cases RMSE and log-loss improved. We...

if an nn like RWKV were optimized on RMSE (bins) as the optimization metric then it would likely be able to reverse engineer RMSE-BINS-EXPLOIT

unique salmon Apr 15, 2025, 7:39 PM

#

Oh, yeah, configuring dictionaries for yomitan is a huge filter

ashen light Apr 15, 2025, 7:40 PM

#

bold terrace Wanna attract more people ? Preinstall decks 😂

sponsored preinstalled decks? vc will love this

bold terrace Apr 15, 2025, 7:40 PM

#

Anki feels quite hacky

#

I mean configuring the css of your card …

ashen light Apr 15, 2025, 7:40 PM

#

coca cola presents: basic spanish vocab

bold terrace Apr 15, 2025, 7:41 PM

#

Defining the data type of your card fields …

#

Even the difference between a card and a note

#

I know more or less FSRS but I’m still afraid of using “cloze”

unique salmon Apr 15, 2025, 7:41 PM

#

polar maple if an nn like RWKV were optimized on RMSE (bins) as the optimization metric then...

Good thing we don't do that 🤣
Ok, ngl, you are kinda convincing me to remove it from the benchmark

unique salmon Apr 15, 2025, 7:42 PM

#

bold terrace I know more or less FSRS but I’m still afraid of using “cloze”

https://tenor.com/view/huh-billie-eilish-really-sure-gif-13669711

Tenor

bold terrace Apr 15, 2025, 7:42 PM

#

Let’s documente cards notes and cloze with an UML diagrams

unique salmon Apr 15, 2025, 7:42 PM

#

unique salmon Good thing we don't do that 🤣 Ok, ngl, you are kinda convincing me to remove i...

Well, technically you can't anyway - it's not differentiable

#

RMSE (bins) I mean

ashen light Apr 15, 2025, 7:43 PM

#

on a more serious note: the fact that you gotta use a website then copypaste like a number to get decks/addons is mega jank

#

like how is that not built in still

polar maple Apr 15, 2025, 7:43 PM

#

unique salmon Well, technically you can't anyway - it's not differentiable

it is differentiable, the bins are fixed items

unique salmon Apr 15, 2025, 7:43 PM

#

ashen light on a more serious note: the fact that you gotta use a website then copypaste lik...

I actually think it's neat. No unpacking files and copy-pasting them into folders, just a single number and Anki does the rest

ashen light Apr 15, 2025, 7:44 PM

#

bold terrace I mean configuring the css of your card …

this is I think the most reasonable thing anki does in this category of things

ashen light Apr 15, 2025, 7:44 PM

#

unique salmon I actually think it's neat. No unpacking files and copy-pasting them into folder...

I meant the browser should be in-client

bold terrace Apr 15, 2025, 7:44 PM

#

ashen light this is I think the most reasonable thing anki does in this category of things

Ok but no average user do css

#

Some graphical editor would help many people

ashen light Apr 15, 2025, 7:44 PM

#

bold terrace Ok but no average user do css

how do you propose it be done

bold terrace Apr 15, 2025, 7:44 PM

#

WYSIWYG

unique salmon Apr 15, 2025, 7:44 PM

#

polar maple it is differentiable, the bins are fixed items

maybe i am stoopid

ashen light Apr 15, 2025, 7:45 PM

#

@bold terrace glad to hear you are working on this feature I expect to see a PR soon

polar maple Apr 15, 2025, 7:45 PM

#

RMSE (bins) can be made to be uncheatable by actually 5-way splitting it properly, but it would no longer make sense to use it on algorithms that adapt on the fly like RWKV

unique salmon Apr 15, 2025, 7:45 PM

#

ashen light how do you propose it be done

Add an AI that writes CSS based on user input

bold terrace Apr 15, 2025, 7:45 PM

#

ashen light <@304669962608443402> glad to hear you are working on this feature I expect to s...

Tomorrow by 8AM

ashen light Apr 15, 2025, 7:46 PM

#

but think of the tryhards who don't want a wysiwyg experience

#

I need layers of js in my templates and dae refused to let me add a separate scriping field

bold terrace Apr 15, 2025, 7:47 PM

#

ashen light but think of the tryhards who don't want a wysiwyg experience

I mean having both WYSIWYG and markup is not that rare

ashen light Apr 15, 2025, 7:48 PM

#

ui complexity oh nooooooo

bold terrace Apr 15, 2025, 7:48 PM

#

Reddit, old bbcode forums, …

#

Two tabs, easy win

ashen light Apr 15, 2025, 7:48 PM

#

ui complexityyyyyyyyyy

unique salmon Apr 15, 2025, 7:48 PM

#

I'm trying to visualize how to hide "Evaluate" in away that isn't jank and doesn't require lots of scrolling

bold terrace Apr 15, 2025, 7:48 PM

#

Come on

ashen light Apr 15, 2025, 7:48 PM

#

but now we got tabs and css radio buttons

ashen light Apr 15, 2025, 7:49 PM

#

unique salmon I'm trying to visualize how to hide "Evaluate" in away that isn't jank and doesn...

just make the font smaller

unique salmon Apr 15, 2025, 7:49 PM

#

kek

ashen light Apr 15, 2025, 7:49 PM

#

maybe opacity: 60%

#

possibly more seriously: is there a good drop-in wsywig

bold terrace Apr 15, 2025, 7:50 PM

#

I mean

bold terrace Apr 15, 2025, 7:50 PM

#

ashen light ui complexityyyyyyyyyy

#

Come on

#

Be slightly honest for one minute

#

Preview/Markup, UI Complexity compared to this ?

ashen light Apr 15, 2025, 7:51 PM

#

where even is that

bold terrace Apr 15, 2025, 7:51 PM

#

come on

#

You can do a bit better can’t you

bold terrace Apr 15, 2025, 7:51 PM

#

ashen light where even is that

Anki IOS

ashen light Apr 15, 2025, 7:51 PM

#

I mean thats peak ui performance right theer

unique salmon Apr 15, 2025, 7:51 PM

#

ashen light maybe `opacity: 60%`

bold terrace Apr 15, 2025, 7:51 PM

#

The only one that you need to pay

ashen light Apr 15, 2025, 7:51 PM

#

oh I don't use ios

#

cause I don't pay for things

ashen light Apr 15, 2025, 7:52 PM

#

unique salmon

ship it

bold terrace Apr 15, 2025, 7:52 PM

#

unique salmon

Pure art

#

unique salmon Apr 15, 2025, 7:54 PM

#

You know what would make it even better? Smaller button + less opacity

#

You know what would make it even better? Make it even smaller and decrease opacity further!

#

Perfection

ashen light Apr 15, 2025, 7:55 PM

#

[insert that vince mcmahon meme here using these images]

unique salmon Apr 15, 2025, 7:56 PM

#

ashen light cause I don't pay for things

bold terrace Apr 15, 2025, 7:57 PM

#

Don’t know if I’ll be able to use Anki with CMRR outliving Evaluate

#

At least get rid of that with it

unique salmon Apr 15, 2025, 7:57 PM

#

bold terrace Don’t know if I’ll be able to use Anki with CMRR outliving Evaluate

he doesn't know

#

Jarrett wants to remove CMRR

bold terrace Apr 15, 2025, 7:57 PM

#

The fact it will be be improved or removed ?

unique salmon Apr 15, 2025, 7:57 PM

#

And I am like "LUUUUC! SAVE US!"

unique salmon Apr 15, 2025, 7:58 PM

#

bold terrace The fact it will be be improved or removed ?

Jarrett: cmrr bad
Me: cmrr need realism. luc make cmrr unsuck

bold terrace Apr 15, 2025, 7:58 PM

#

unique salmon Jarrett: cmrr bad Me: cmrr need realism. luc make cmrr unsuck

You see how it’s frustrating to have people solving issues by removing unilaterally things right ?

unique salmon Apr 15, 2025, 7:59 PM

#

Except that I can see the benefits of CMRR, but have to think of rare edge cases to maybe come up with some questionable benefits of Evaluate

ashen light Apr 15, 2025, 7:59 PM

#

the people involved think its easier to remove than fix

#

unless you're gonna do the work making it better, its probably gone

bold terrace Apr 15, 2025, 7:59 PM

#

unique salmon Except that I can see the benefits of CMRR, but have to think of rare edge cases...

So in the end everything is about how YOU see it 🥲

unique salmon Apr 15, 2025, 8:00 PM

#

bold terrace So in the end everything is about how YOU see it 🥲

We'll see how my Reddit poll turns out

bold terrace Apr 15, 2025, 8:00 PM

#

Ad hominem but maybe changing that could help you bonding with real people 😅

unique salmon Apr 15, 2025, 8:00 PM

#

Aka we'll see what the average user thinks

ashen light Apr 15, 2025, 8:00 PM

#

I think we burn down all metrics, remove everything except "enable fsrs" and an "optimize" button

unique salmon Apr 15, 2025, 8:00 PM

#

Or at least the average r/Anki guy

unique salmon Apr 15, 2025, 8:00 PM

#

ashen light I think we burn down all metrics, remove everything except "enable fsrs" and an ...

I think we burn down all metrics, remove everything except "enable fsrs" ~~and an "optimize" button~~

ashen light Apr 15, 2025, 8:00 PM

#

maybe "optmize" and "optimize with reschedule" and get rid of that stupid fucking toggle

unique salmon Apr 15, 2025, 8:00 PM

#

Actually, you forgot DR

#

Optimization can be made automatic

#

Well...uhhh...

#

😅

ashen light Apr 15, 2025, 8:01 PM

#

how do you handle auto optimize and rescheduling on optimzie

#

auto-optimize to me implies no rescheduling

unique salmon Apr 15, 2025, 8:02 PM

#

ahem
Anyway, this is peak
(David's pic btw)

ashen light Apr 15, 2025, 8:02 PM

#

100% agree

#

those nerd numbers can go be an addon

unique salmon Apr 15, 2025, 8:02 PM

#

unique salmon *ahem* Anyway, this is peak (David's pic btw)

its-beautiful-ive-looked-at-this-for-five-hours-now-aziz-v0-18moozyb5s2c1.png

bold terrace Apr 15, 2025, 8:03 PM

#

TBH I wouldn’t mind those options to be in FSRS helper and leave the normal use case being DR only

ashen light Apr 15, 2025, 8:03 PM

#

https://i.imgflip.com/9qx5v1.jpg anyway I quickly used some garbage website to make this garbage meme, its not even high enough res to see anything and I didn't even have a 4th image

bold terrace Apr 15, 2025, 8:03 PM

#

I’m against removing evaluate but moving it I don’t mind

ashen light Apr 15, 2025, 8:03 PM

#

I was hitting "I'm spending too much time on this" territory and so you get this half-finished thing

bold terrace Apr 15, 2025, 8:03 PM

#

But moving it would mean moving it with the optimize

ashen light Apr 15, 2025, 8:04 PM

#

actually david's image should be in the last slot

unique salmon Apr 15, 2025, 8:04 PM

#

bold terrace I’m against removing evaluate but moving it I don’t mind

Btw, I expanded my Github comment a bit

Moving "Evaluate" somewhere else is nice in theory, but I can't think of a good implementation. If we put it in "Advanced", it will be awkward (scrolling back and forth between sections) and unclear that this button is related to FSRS in the first place, unless it says "Evaluate FSRS parameters" or something. Maybe we could collapse it, but again, I can't visualize a good implementation.

ashen light Apr 15, 2025, 8:05 PM

#

https://i.imgflip.com/9qx6qt.jpg I must have nothing to do today wow

#

if only it had enough pixels to actually see anything

bold terrace Apr 15, 2025, 8:15 PM

#

BTW, top comments on this Why you still use SM2 ?

https://www.reddit.com/r/Anki/comments/1h2k4m2/to_people_still_using_sm2_instead_of_fsrs_why/

I prefer to have a bit more control
I don't want to believe what strangers tell you as it is
Happy with retention in real-world, and not as percentage
...

Response from FSRS community : "Make it more blackboxy !" laughcry

From the Anki community on Reddit

Explore this post and more from the Anki community

#

While someone still has to pay its rent with his anki videos

unique salmon Apr 15, 2025, 8:17 PM

#

Ok but this kinda true

bold terrace Apr 15, 2025, 8:17 PM

#

Oh come on

#

you could have screenshoted one of your 9+ comments

#

A seamless experience, smooth and with no bad surprise

#

Oopsi

unique salmon Apr 15, 2025, 8:19 PM

#

Now THIS is the average Anki user

#

The perfect specimen

bold terrace Apr 15, 2025, 8:19 PM

#

unique salmon Now THIS is the average Anki user

Yeah that one I agree

#

When you think about i

#

I knew Anki through r/LearnJapanese

unique salmon Apr 15, 2025, 8:20 PM

#

bold terrace - https://www.reddit.com/r/Anki/comments/1bwxd22/even_with_retention_rate_set_to...

Yeah, TR<DR is something that has been popping up with alarming frequency

#

I have no idea why, honestly

bold terrace Apr 15, 2025, 8:20 PM

#

but actively following r/Anki is not something you do as an average guy

unique salmon Apr 15, 2025, 8:20 PM

#

I mean, I can think of a few explanations, such as not optimizing parameters and Hard misuse, but still

polar maple Apr 15, 2025, 8:21 PM

#

but who would complain about TR>DR?

bold terrace Apr 15, 2025, 8:22 PM

#

unique salmon Yeah, TR<DR is something that has been popping up with alarming frequency

I think Jarrett nailed it : Hyrum's Law. Through SM2, since there is no concept of "DR" or "R threshold", you reset cycles at every lapse, with always-decreasing intervals : People get higher and higher retention through lapses. Basically, even if SM2 does not specify R(Mature) > R(Younger), it is the side effect of its implementation
https://www.hyrumslaw.com/

unique salmon Apr 15, 2025, 8:22 PM

#

polar maple but who would complain about TR>DR?

Eh, I've seen a few people, though they were more confused rather than sad/angry

cursive badge Apr 15, 2025, 8:22 PM

#

I've only just caught up 😅
Evaluate can be useful to decide if splitting presets is a good idea. You don't need to understand what the numbers mean, just before > after = good
You might be able to make nice UI that tells you without showing the numbers, but until then having the numbers is useful.

bold terrace Apr 15, 2025, 8:23 PM

#

FSRS on the other hand is "expected retention"-based. You don't get "higher prediction" at every lapse per se, you might get higher D and lower I, but the prediction will still be a constant DR (not increasing R)

bold terrace Apr 15, 2025, 8:24 PM

#

cursive badge I've only just caught up 😅 Evaluate can be useful to decide if splitting prese...

You're too precious for this world ❤️

#

I wanted to switch one deck to SM2, and one to FSRS, but it's a global setting

#

so I need 2 user profiles

#

and I'm a bit lazy for that now

#

And IMO FSRS is superior at both task (precision AND increasing retention) if you know how to tackle it (which is extremely simple : Increase your DR, wether globally through Deck Options, or through Filtered Decks)

unique salmon Apr 15, 2025, 8:26 PM

#

Time for another spaced repetition civil war

bold terrace Apr 15, 2025, 8:26 PM

#

SM2 is "let's calibrate your ease factor for each card" which is less efficient than letting FSRS guess the profile of your card in 4-5 reviews

unique salmon Apr 15, 2025, 8:26 PM

#

https://tenor.com/view/peeporiot-peeporiot-havi-gif-23057486

Tenor

bold terrace Apr 15, 2025, 8:27 PM

#

And I know I can sound contradicting, but my point is : FSRS is truely the way forward, but we just need a way to hold the hand of the user

unique salmon Apr 15, 2025, 8:27 PM

#

Remove Evaluate side: me, David, sorata, jake
Keep Evaluate side: Sound, Danika, rossgb

bold terrace Apr 15, 2025, 8:27 PM

#

Seems you forgot @cursive badge , and all people in the Anki forum that were too dumb for this poll 1h ago 😄

unique salmon Apr 15, 2025, 8:28 PM

#

bold terrace Seems you forgot <@347088848854974465> , and all people in the Anki forum that w...

I edited my comment just before you made yours 😅

ashen light Apr 15, 2025, 8:28 PM

#

ultimately none of our opinions matter on the topic

#

since daes gonna do what daes gonna do

unique salmon Apr 15, 2025, 8:29 PM

#

Dae giveth and Dae taketh away

bold terrace Apr 15, 2025, 8:29 PM

#

ashen light since daes gonna do what daes gonna do

I ain't goin to blame him

cursive badge Apr 15, 2025, 8:29 PM

#

Well maybe I'm going to make my own SRS app with blackjack and hookers 😝

bold terrace Apr 15, 2025, 8:30 PM

#

cursive badge Well maybe I'm going to make my own SRS app with blackjack and hookers 😝

What about a clean and simple architecture 🥲 ?

ashen light Apr 15, 2025, 8:30 PM

#

cursive badge Well maybe I'm going to make my own SRS app with blackjack and hookers 😝

hey me too

cursive badge Apr 15, 2025, 8:30 PM

#

And 0 users because I'll probably get bored and go back to Anki ;p

ashen light Apr 15, 2025, 8:30 PM

#

how would you make anki clean and simple

unique salmon Apr 15, 2025, 8:30 PM

#

Unironically, why not just keep "Evaluate" in the Helper Add-on?

ashen light Apr 15, 2025, 8:31 PM

#

unique salmon Unironically, why not just keep "Evaluate" in the Helper Add-on?

this is what I'm saying

bold terrace Apr 15, 2025, 8:31 PM

#

ashen light how would you make anki clean and simple

No 50 languages meshed together

#

Maybe a full js memory-vore thing

ashen light Apr 15, 2025, 8:31 PM

#

thats like....the least complicated thing about it

bold terrace Apr 15, 2025, 8:31 PM

#

Or some kind of full-python stuff

unique salmon Apr 15, 2025, 8:31 PM

#

We could just hide the button, but keep the underlying code to be accessed by the add-on

ashen light Apr 15, 2025, 8:31 PM

#

anki used to be 90% python

bold terrace Apr 15, 2025, 8:31 PM

#

The rust-cool boys entered the chat ?

#

Rust sounds super super cool

cursive badge Apr 15, 2025, 8:31 PM

#

I still dream of WASM/JS runtime.

ashen light Apr 15, 2025, 8:31 PM

#

rust was done to allow the code to be used on all platforms

bold terrace Apr 15, 2025, 8:31 PM

#

But just like Ruby was before it

unique salmon Apr 15, 2025, 8:32 PM

#

Although Jarrett said that he doesn't plan to add any new features to the add-on FeelsBadAnki

ashen light Apr 15, 2025, 8:32 PM

#

can't really have a python backend running on ios

bold terrace Apr 15, 2025, 8:32 PM

#

oh

#

Python and iOS doesn't work well ?

ashen light Apr 15, 2025, 8:32 PM

#

there were real reasons for the rust move

cursive badge Apr 15, 2025, 8:32 PM

#

ashen light rust was done to allow the code to be used on all platforms

Why not Java? It runs on 3 billion devices you know. ;p

ashen light Apr 15, 2025, 8:32 PM

#

and the current multi-lang architecture

#

java could have been an option!

#

point is ankidroid is able to have feature-parity trivially due to the rust backend, before that it was constantly year(s) behind in terms of compatibility

bold terrace Apr 15, 2025, 8:33 PM

#

Also personally, in my little dream SRS, no concept of Notes/Cards, no cloze, searching deck and addon from within the app, just "Again/Good" by default

ashen light Apr 15, 2025, 8:34 PM

#

how do you handle cases where multiple cards naturally fall out of the same source data

unique salmon Apr 15, 2025, 8:34 PM

#

bold terrace Also personally, in my little dream SRS, no concept of Notes/Cards, no cloze, se...

No notes/cards would be ass though

bold terrace Apr 15, 2025, 8:34 PM

#

ashen light how do you handle cases where multiple cards naturally fall out of the same sour...

Different cards

ashen light Apr 15, 2025, 8:35 PM

#

note/cards is like one of the most convenient things, its just the ux is garbage

bold terrace Apr 15, 2025, 8:35 PM

#

I don't know I feel notes/cards is a good idea theoritically but in practice I don't think I could transform my Vocabulary deck into a Sentence deck just by mapping different fields in my front

#

The quality of the card would suffer

ashen light Apr 15, 2025, 8:35 PM

#

my own system wuold be very frustrating without it, especially as I bury my siblings

unique salmon Apr 15, 2025, 8:35 PM

#

I think notes vs cards is a necessary evil: it's confusing, but the alternative is having to make a lot more cards, not being able to edit all of them at once, and not having "bury siblings"

bold terrace Apr 15, 2025, 8:36 PM

#

For ex, I think people doing JP->EN and EN->JP from the same notes will always struggle with synonyms

#

Becuase you might need some context around the JP term to know which one you want in the EN->JP answer, and context for the EN in the JP->EN

unique salmon Apr 15, 2025, 8:37 PM

#

Same goes for deck vs preset: a necessary evil. Otherwise you will have to either

configure each deck separately
OR
have the same settings for all decks

cursive badge Apr 15, 2025, 8:37 PM

#

It would probably be terrible UX, but I kind of wish cards were decoupled more from notes. I like the idea of there being an explicit knowledge graph where cards can be based on multiple nodes.

bold terrace Apr 15, 2025, 8:37 PM

#

Oh !

cursive badge Apr 15, 2025, 8:37 PM

#

It's probably something that would work better hidden behind the scense in something like Duolingo

bold terrace Apr 15, 2025, 8:37 PM

#

@cursive badge you got me there, I was about to point of "F'cking card links"

#

Right now I have a field that help me tremendously : "Confusion". When I confuse card A with card B, I add "B" in "Confusion" of card "A"

#

So at every review of A, I also reinforce my ability to dissociate A and B

#

Having things completely atomic in Anki is not that great IMO

#

Being able to relate cards together would be a killer feature

ashen light Apr 15, 2025, 8:39 PM

#

what types of relations

#

how would relations affect reviews

bold terrace Apr 15, 2025, 8:39 PM

#

"Synonyms", "Antagonist", "Different form"...

unique salmon Apr 15, 2025, 8:39 PM

#

ashen light what types of relations

Like "Card A reminds me of card B"

#

FSRS could take that into account in theory

ashen light Apr 15, 2025, 8:39 PM

#

what do those relations do in terms of actually reviewing

bold terrace Apr 15, 2025, 8:40 PM

#

Let's say you got wrong a word because you mixed it with another, Anki could then put those 2 in alternance in the next upcoming days

unique salmon Apr 15, 2025, 8:40 PM

#

ashen light what do those relations do in terms of actually reviewing

For example, if you review card A and your memory stability for that card increases, stability for card B also increases

#

We tried that with siblings, but the improvement was too small

ashen light Apr 15, 2025, 8:40 PM

#

and people are gonna....draw lines between cards?

#

whats the ui for this

bold terrace Apr 15, 2025, 8:41 PM

#

Typical problematic scenario : A and B are very similar.
Day 1 : Review A (i=5)
Day 5 : Review A (i=10)
Day 15 : Review A (i=20)
Day 20 : Review B, got it wrong because you thought it was A
Day 21 : Review B, got it right
Day 26, Review B, got it right
...
Day 35 : Review A, got it wrong because now you thought it was B.
...
Day 65 : Review B, got it wrong because now you thought it was A

#

You do that "confusion dance" until A and B fall in the same day bin

#

With relationship, confusing B could reduce the interval of A, or sync it to B, so you review always them together, to be sure you can differentiate them

cursive badge Apr 15, 2025, 8:42 PM

#

A dream feature for me would would be to map answers to the knowledge graph and automate the creation of "scaffolding" cards when interference is detected.

#

The Math Academy FIRe stuff would be fun too.

bold terrace Apr 15, 2025, 8:43 PM

#

Yuup

ashen light Apr 15, 2025, 8:43 PM

#

the math academy stuff is carefully crafted decks and relations though?

bold terrace Apr 15, 2025, 8:44 PM

#

But IMO, this kind of graph might be easier to do if you do a tool speciallized to a domain (math, japanese ...)

ashen light Apr 15, 2025, 8:44 PM

#

like how much work goes into that

bold terrace Apr 15, 2025, 8:44 PM

#

if you do it agnostic, then it's a bit difficult to build all that model yourself

#

With a domain-specific approach, you could even use "community defined relationship"

cursive badge Apr 15, 2025, 8:44 PM

#

It would be really hard to create a good UX. That's why I said it makes more sense for a Duolingo-like. I can dream though.

bold terrace Apr 15, 2025, 8:44 PM

#

New users would directly benefit from some relationship like "Synonyms" etc

ashen light Apr 15, 2025, 8:44 PM

#

finally the peanut gallery can affect my anki reviews

unique salmon Apr 15, 2025, 8:45 PM

#

Model: FSRS-5-siblings
Total number of users: 9999
Total number of reviews: 349923850
Weighted average by reviews:
FSRS-5-dev LogLoss (mean±std): 0.3270±0.1525
FSRS-5-dev RMSE(bins) (mean±std): 0.0507±0.0325

Model: FSRS-5
Total number of users: 9999
Total number of reviews: 349923850
Weighted average by reviews:
FSRS-5 LogLoss (mean±std): 0.3276±0.1526
FSRS-5 RMSE(bins) (mean±std): 0.0518±0.0333

FeelsBadAnki

bold terrace Apr 15, 2025, 8:45 PM

#

unique salmon Model: FSRS-5-siblings Total number of users: 9999 Total number of reviews: 3499...

Yeah but this is completely different thing

unique salmon Apr 15, 2025, 8:45 PM

#

For the record, a neural net didn't do much better either

#

Though maybe Alex's nets are smarter

bold terrace Apr 15, 2025, 8:46 PM

#

here we're not really talking JP->EN vs EN->JP, but more like mesh of networks, with semantic relationship

polar maple Apr 15, 2025, 8:46 PM

#

it would be cool to get a dataset with actual information on the cards

bold terrace Apr 15, 2025, 8:46 PM

#

Also, the precision would not necessarly be the first benefit

#

The fact B entered the chat, even change the initial prediction of A

#

so prediction could change very organically based on what other things happen

#

"A card C has been introduced ? It has "Synonyms" links with cards A and B ? Let's sync their recall timing !"

#

You could even create "routes" of learning in that network

#

bulk things together, etc

ashen light Apr 15, 2025, 8:48 PM

#

bold terrace here we're not really talking JP->EN vs EN->JP, but more like mesh of networks, ...

building such a thing seems user and deck specific and would probably take more time making all the links than just bruteforcing your way through it 🍃

cursive badge Apr 15, 2025, 8:48 PM

#

polar maple it would be cool to get a dataset with actual information on the cards

I need a WaniKani DB heist.

ashen light Apr 15, 2025, 8:48 PM

#

what I have problems with some other person might just get

bold terrace Apr 15, 2025, 8:49 PM

#

ashen light building such a thing seems user and deck specific and would probably take more ...

Not really though !

#

I had cards with >100 reviews I kept getting wrong until I did the work to manually do that outside Anki

#

I think Anki is an extremely inefficient way to learn even vocabulary

ashen light Apr 15, 2025, 8:50 PM

#

and identifying and doing the work outside anki would be faster than creating some sort of relation graph in anki

bold terrace Apr 15, 2025, 8:50 PM

#

If you consider a language like Japanese has 500k word, let say you need to know 50k word, at a rate of 8 new/day, it's 17 years !

unique salmon Apr 15, 2025, 8:50 PM

#

polar maple it would be cool to get a dataset with actual information on the cards

bold terrace Apr 15, 2025, 8:50 PM

#

unique salmon

Yeah my decks is basically a big siterip of netflix lol

bold terrace Apr 15, 2025, 8:51 PM

#

ashen light and identifying and doing the work outside anki would be faster than creating so...

Sure thing ! But if you make it community-based, then it's a bit like if students would create their own students ressource

unique salmon Apr 15, 2025, 8:51 PM

#

bold terrace If you consider a language like Japanese has 500k word, let say you need to know...

bro is learning core 500k 💀

bold terrace Apr 15, 2025, 8:51 PM

#

unique salmon bro is learning core 500k 💀

The 17 years were for 50K lol

#

Before I had tht mindset : "Every word I know should be in Anki so I'll be able to really track my progress"

#

Now I realize how infeasible it is

ashen light Apr 15, 2025, 8:52 PM

#

bold terrace If you consider a language like Japanese has 500k word, let say you need to know...

because you need a card for ever word

#

every word should be in anki because anki is a grind

#

and are you grinding if you don't add everything you don't know to it?

#

how else are you gonna push the unwanted memories out

bold terrace Apr 15, 2025, 8:53 PM

#

That's the problematic mindset indeed

#

BUT

ashen light Apr 15, 2025, 8:53 PM

#

anki is a form of penance

unique salmon Apr 15, 2025, 8:54 PM

#

bold terrace Apr 15, 2025, 8:54 PM

#

In fact, many words overlap, so finding ways to batch them is really a nice trick to achieve higher new/day without having to wait to master 20k words before it goes faster

#

#

I batch my next new card by common kanji now

ashen light Apr 15, 2025, 8:55 PM

#

at least have the kanji for orange there

#

its so left out

unique salmon Apr 15, 2025, 8:55 PM

#

unique salmon

Also, I just realized that it's a tripple negative 🤣

bold terrace Apr 15, 2025, 8:55 PM

#

And to find the next batch, I added a sorting order "Unseen Card" in Kanji Grid (By Kuuuube)

#

so I take a mastered-kanji for which I have a lot of cards to learn

#

and I can make learnig new words way more easily

ashen light Apr 15, 2025, 8:57 PM

#

not even 100%

bold terrace Apr 15, 2025, 8:57 PM

#

IMO grinding better logloss/RMSE prediction already hit a threshold of diminishing returns

ashen light Apr 15, 2025, 8:57 PM

#

you need to grind harder clearly

cursive badge Apr 15, 2025, 8:57 PM

#

As a side topic: I reduced my DR from 90%->80% last month because I was struggling. It halved my reviews but somehow my TR for the month only dropped to 87% 😅

bold terrace Apr 15, 2025, 8:57 PM

#

cursive badge As a side topic: I reduced my DR from 90%->80% last month because I was struggli...

Lucky boy

unique salmon Apr 15, 2025, 8:57 PM

#

cursive badge As a side topic: I reduced my DR from 90%->80% last month because I was struggli...

That makes sense if you didn't use "reschedule cards on change"

bold terrace Apr 15, 2025, 8:57 PM

#

I dropped from 80->70% in December, went from 77% to 55%

#

never again

cursive badge Apr 15, 2025, 8:57 PM

#

unique salmon That makes sense if you didn't use "reschedule cards on change"

I did

bold terrace Apr 15, 2025, 8:58 PM

#

#Team90DRforevernow

cursive badge Apr 15, 2025, 8:58 PM

#

Well, the helper addon version.

bold terrace Apr 15, 2025, 8:58 PM

#

CMRR is full of lies and shadows

#

@cursive badge : But you still do higher DR for mature words no ?

#

So it pull the retention higher /

#

I remember you said you had Filtered Decks for mature

unique salmon Apr 15, 2025, 8:59 PM

#

CMRR be like:
This user has a deck with exactly 10*days_to_simulate cards in it. And he can learn an infinite number of new cards, but only as long as the overall studying time does not exceed 30 minutes

#

This is why I asked Luc to just hook it up to the simulator config

bold terrace Apr 15, 2025, 9:00 PM

#

CMRR be like : return 0.70

#

What about ... 0.70

cursive badge Apr 15, 2025, 9:00 PM

#

bold terrace <@347088848854974465> : But you still do higher DR for mature words no ?

I dropped doing that too after a while. It has not been a great month for me ☹️

bold terrace Apr 15, 2025, 9:00 PM

#

Gotta take some 0.70 ?

bold terrace Apr 15, 2025, 9:00 PM

#

cursive badge I dropped doing that too after a while. It has not been a great month for me ☹️

Sorry to hear that 😦

ashen light Apr 15, 2025, 9:00 PM

#

I like how it just returns the lowest valid value, "I mean it is the minimum recommended value"

unique salmon Apr 15, 2025, 9:01 PM

#

CMRR was intended to have as few settings as possible, but as a result you get this "The user has no learned cards, deck size=10*days_to_simulate, Easy Days don't exist, fuzz doesn't exist, sort order doesn't exist, new card limit is infinite" shit

ashen light Apr 15, 2025, 9:02 PM

#

sounds a bit halfbaked

unique salmon Apr 15, 2025, 9:02 PM

#

This is why I asked Luc to just hook it up to the simulator config

bold terrace Apr 15, 2025, 9:02 PM

#

unique salmon CMRR was intended to have as few settings as possible, but as a result you get t...

Also fundamentaly there is the issue of the exponential that goes slower and slower into low number, while those low number contribution are purely linear

unique salmon Apr 15, 2025, 9:03 PM

#

luc plz deliver

#

prayge

bold terrace Apr 15, 2025, 9:03 PM

#

So it's easier to have a full backlog of 30% Retention than to maintain cards at 80% R

polar maple Apr 15, 2025, 9:03 PM

#

cursive badge As a side topic: I reduced my DR from 90%->80% last month because I was struggli...

flat forgetting curve supremacy

cursive badge Apr 15, 2025, 9:04 PM

#

Secretly I was one of the low decay weirdos all along 😮

#

I'm definitely not remembering this stuff in 100 years though, so maybe not that weirdly low ;p

bold terrace Apr 15, 2025, 9:06 PM

#

Also IMO the default graph selection of Anki should be a bit more useful than "What was your Retention at 9PM".

Stability over Time
Memorized over Time f(R,S), not just sum(R)
Daily Load profiling (by lapse, repetitions...)

unique salmon Apr 15, 2025, 9:07 PM

#

bold terrace IMO grinding better logloss/RMSE prediction already hit a threshold of diminishi...

Kind of, but not in the sense that you mean. Rather, I think we picked all the low-hanging fruit and past FSRS-6 there just aren't any clear ways to improve FSRS
You could get much better results with a neural net though

unique salmon Apr 15, 2025, 9:08 PM

#

bold terrace Also IMO the default graph selection of Anki should be a bit more useful than "W...

Memorized over Time f(R,S), not just sum(R)
Oh come oooon, how many times have we discussed this...
Sum(R) has an intuitive interpretation, f(R, S) will almost certainly have none

bold terrace Apr 15, 2025, 9:08 PM

#

Like, my R is around 90% between noon and 6PM and very low after 8PM ...

No shit sherlock, I had DR=70/80% when I was doing my reviews at night

bold terrace Apr 15, 2025, 9:08 PM

#

unique salmon > Memorized over Time f(R,S), not just sum(R) Oh come oooon, how many times have...

?

#

f(R,S) means : a function depending on R and S

unique salmon Apr 15, 2025, 9:08 PM

#

Yep

bold terrace Apr 15, 2025, 9:08 PM

#

integral based or not, doesn't matter

#

you create fights where there is none

unique salmon Apr 15, 2025, 9:09 PM

#

Think of one that still has an intuitive interpretation

#

Oh, the integral one does, btw

bold terrace Apr 15, 2025, 9:09 PM

#

exactly

unique salmon Apr 15, 2025, 9:09 PM

#

R*(1-exp(-S)) or whatever doesn't

bold terrace Apr 15, 2025, 9:09 PM

#

whatever

unique salmon Apr 15, 2025, 9:09 PM

#

Or R×sqrt(S) or R×ln(S)

bold terrace Apr 15, 2025, 9:09 PM

#

Damn you really missunderstand everything 🥲

#

By F(R,S) I just mean any kind of function taking into account R and S

#

so if you like your integral so be it

#

I Don't care xD

#

Go jerk on vtubers

unique salmon Apr 15, 2025, 9:10 PM

#

There is a difference between "it has desirable mathematical properties" and "it can be explained in one sentence with <20 words to an average user"

#

You want something with desirable mathematical properties, but in this case it makes it very hard to come up with something that also has a simple, intuitive interpretation

#

And not just "higher number = better"

unique salmon Apr 15, 2025, 9:12 PM

#

bold terrace so if you like your integral so be it

For "memorised over time" just sum(R) is the best

#

I was planning to use the integral for CMRR

cursive badge Apr 15, 2025, 9:12 PM

#

Ross like line go up. Line go up make Ross feel good.

bold terrace Apr 15, 2025, 9:12 PM

#

sum(R) should be called "Expected Score at a test" IMO 😛

#

If you have 50% R for all your card, you might be able to get a 50% at a test with the same cosntraints than in Anki

#

So yeah, sum(R) can make sense

#

But c'mon, please people, have a bit more self-love than just trying to get x-% at a test 🥲

#

Try to remember it more than one day 😄

bold terrace Apr 15, 2025, 9:16 PM

#

cursive badge Ross like line go up. Line go up make Ross feel good.

I think it's even more important that normally, in a perfect world, TR=DR

#

Soooo if all you see is a retention line that stick at DR for month

#

you might feel you're getting nowhere

#

but in fact, Stability might increase, workload might be dropping ...

#

all those tiny positive things need to be brought up 🙂

#

That's also why I think sum(R) is a bit silly : If your DR is 80, then sum(R), without adding card ........... will always be somewhere around N*90% (N your number of active card)

unique salmon Apr 15, 2025, 9:19 PM

#

bold terrace That's also why I think sum(R) is a bit silly : If your DR is 80, then sum(R), w...

Well, yes. That's the point

cursive badge Apr 15, 2025, 9:19 PM

#

Kind of sad if you are looking for progress though.

bold terrace Apr 15, 2025, 9:20 PM

#

Yeah to me sum(R) is really just a "test score estimate" in some way

#

Well not even an estimate 😛

#

You know at 100% R 50% of your deck, the Memorized will tell you 50% N, but in practice you'll most definitely miss your test if the grading condition is 50% good answers

#

(if you're not lucky and pick questions you were at 0% in your deck)

#

So sum(R) is like "Your estimated score test result, if the test consist of all the cards of your deck" lol

cursive badge Apr 15, 2025, 9:26 PM

#

Maybe I need a Stability over time heatmap to show progress.

unique salmon Apr 15, 2025, 9:27 PM

#

You can plot sum(S), yes. Though I can't think of a nice interpretation

#

Average S is a bit better in that sense

cursive badge Apr 15, 2025, 9:27 PM

#

unique salmon You can plot sum(S), yes. Though I can't think of a nice interpretation

I mean this, but into the past:

bold terrace Apr 15, 2025, 9:28 PM

#

Yeah average S is nice, median too but is a bit less smooth in practice

cursive badge Apr 15, 2025, 9:28 PM

#

(N.B. the cut-off at 21 days is because of the filtered decks Sound was talking about earlier)

bold terrace Apr 15, 2025, 9:28 PM

#

I'm not entirely sure what the due dimension brings to the table though 😄

#

Could be useful to find "spike" of workload, but those are often the 1-5d stability that are accounted only for one rep in those things

#

I need to redo it, but Stability over Reps is the most depressing things I ever plotted

#

It's ... a declining function

#

The more you rep, the less the average stability

#

It's at that point I thought : Ok now my focus is to find where my workload goes XD

#

My higher lapses, represent 10% of workload for each slice of 5% cards 🥲

#

54% of my workload for 33% of my higher lapse cards 🥲

cursive badge Apr 15, 2025, 9:32 PM

#

I have not found the "due in" version very useful. It just lets you see a bit more of what is happening in "Future Due".
A version looking into the past would let you see how the card stabilities changed over time (hopefully lots of them increasing).

bold terrace Apr 15, 2025, 9:32 PM

#

cursive badge I have not found the "due in" version very useful. It just lets you see a bit mo...

Yeah with the past it could be cool 🙂

#

So now I even consider reducing my lapse normal -> hard to 4 and my hard to suspend to 8 lol

#

The overall idea would be : Discover as many easy words as possible, taking the low hanging fruits, and then build the more difficult words based on them

#

I have 1157 cards I never lapsed a single time xD

#

Over 3000

#

So when I see a word like 躱す (To dodge), that I reviewd 62 times in 4 months for a current stability of 4d... I'm like ... ok maybe I should just postpone it

cursive badge Apr 15, 2025, 9:41 PM

#

One of my worst is 靖 (109 reviews, 23 lapses). I kept on confusing it with 情.

bold terrace Apr 15, 2025, 9:46 PM

#

Yeah for those I really think brute force is not the key

#

So either you take time to really analyze it, or you just suspend it

cursive badge Apr 15, 2025, 9:49 PM

#

I finally got it after noticing the issue and spending some extra effort. Now the thing that usually gets me is answering "peace" instead of "peaceful".

ashen light Apr 15, 2025, 10:04 PM

#

I think you just aren't brute forcing hard enough

unique salmon Apr 15, 2025, 10:07 PM

#

bro just 500 more reviews and i will remember it bro

ashen light Apr 15, 2025, 10:07 PM

#

maybe meditate on the card for an hour

#

new preset: brute force. 20 learning steps and they're all 15 minutes

unique salmon Apr 15, 2025, 10:08 PM

#

ashen light new preset: brute force. 20 learning steps and they're all 15 minutes

I'm willing to bet someone is actually doing that

#

Though, finding the exact user among millions of users is a problem

ashen light Apr 15, 2025, 10:09 PM

#

its me

#

I'm doing it

unique salmon Apr 15, 2025, 10:09 PM

#

where poisson binomial jake

#

jake

#

we need to ~~cook~~ code

#

where leech detector

cosmic hedge Apr 15, 2025, 10:11 PM

#

cursive badge One of my worst is 靖 (109 reviews, 23 lapses). I kept on confusing it with 情.

bold terrace Apr 15, 2025, 10:11 PM

#

cosmic hedge

Holy moly

unique salmon Apr 15, 2025, 10:11 PM

#

cosmic hedge

https://tenor.com/view/gypsy-crusader-gif-23889828

Tenor

bold terrace Apr 15, 2025, 10:11 PM

#

Mother of all leeches

ashen light Apr 15, 2025, 10:12 PM

#

one day

bold terrace Apr 15, 2025, 10:12 PM

#

What were their stability ?

ashen light Apr 15, 2025, 10:12 PM

#

I'm waiting for dae to sign off on it before I touch code

cosmic hedge Apr 15, 2025, 10:12 PM

#

my screenshot software crashed 😭

cosmic hedge Apr 15, 2025, 10:12 PM

#

bold terrace What were their stability ?

3 days

bold terrace Apr 15, 2025, 10:13 PM

#

Nice

#

And the words ?

cosmic hedge Apr 15, 2025, 10:13 PM

#

i mean theres more than 3 XD

#

精進 is the top one

#

yeah I need a leech detector now tbf 😅

cursive badge Apr 15, 2025, 10:24 PM

#

cosmic hedge yeah I need a leech detector now tbf 😅

I think we can safely say without a detector that those 3 are leeches 😂

bold terrace Apr 15, 2025, 10:24 PM

#

Lapse>100 might already give you some insights 😂

sick moth Apr 15, 2025, 10:30 PM

#

unique salmon Remove Evaluate side: me, David, sorata, jake Keep Evaluate side: Sound, Danika,...

As stated a few times, I'm an extremist for minimalism: I'd consider removing everything but DR

cursive badge Apr 15, 2025, 10:30 PM

#

I still think there needs to be a component of looking at study time and passed intervals in a leech detector. The Poisson Binomial stuff is interesting, but I don't think it fully captures "leechness" on its own.

unique salmon Apr 15, 2025, 10:32 PM

#

sick moth As stated a few times, I'm an extremist for minimalism: I'd consider removing ev...

Based

#

Now let's see how Dae reacts to my issue 😅

cursive badge Apr 15, 2025, 10:36 PM

#

Maybe the solution is to remove all the advanced stuff but leave the APIs for an "Advanced Mode" addon that puts them back in. Then the burden of maintaining the separate UI is offloaded to the Addon maintainers instead of core Anki / Dae.

ashen light Apr 15, 2025, 10:38 PM

#

thats something I've been thinking on how it would sort of work, like anki could give a handful of hooks into say fsrs logic and addons could use that in a variety of ways

#

imagine: fsrs auto-optimize addon

#

conflcts are your own problem 🍃

#

the issue with this though is that mobile users get fucked

cursive badge Apr 15, 2025, 10:44 PM

#

My kingdom for a [cross platform addon system] 😂

ashen light Apr 15, 2025, 10:44 PM

#

if only apple didn't explictly forbid them

cursive badge Apr 15, 2025, 10:46 PM

#

We just have to get the EU to harass them harder until they submit. I saw something about them having caved on emulators recently because of EU pressure.

ashen light Apr 15, 2025, 10:47 PM

#

see you in 5 years

cursive badge Apr 15, 2025, 10:47 PM

#

Just in time for the Svelte migration to be finished! 😂

ashen light Apr 15, 2025, 10:48 PM

#

🍃

quasi shadow Apr 16, 2025, 2:04 AM

#

polar maple if AUC is less than 0.5, turn off FSRS

AUC is not a good metric for our optimization goal.

#

https://github.com/open-spaced-repetition/spaced-repetition-algorithm-metric/blob/main/metrics_research.ipynb

GitHub

spaced-repetition-algorithm-metric/metrics_research.ipynb at main ...

Contribute to open-spaced-repetition/spaced-repetition-algorithm-metric development by creating an account on GitHub.

polar maple Apr 16, 2025, 2:05 AM

#

quasi shadow AUC is not a good metric for our optimization goal.

then let's just leave only log loss

#

or none, expertium wants to remove the evaluate button

quasi shadow Apr 16, 2025, 2:06 AM

#

polar maple then let's just leave only log loss

doglaugh It's my initial position years ago.

#

Expertium hopes the metric human-readable, so we have RMSE(bins).

quasi shadow Apr 16, 2025, 2:08 AM

#

polar maple <@449662392314494987> please give fsrs wiki modify permissions (I think i need p...

Do you mean this page: https://github.com/open-spaced-repetition/fsrs4anki/wiki/The-Metric ?

GitHub

The Metric

A modern Anki custom scheduling based on Free Spaced Repetition Scheduler algorithm - open-spaced-repetition/fsrs4anki

polar maple Apr 16, 2025, 2:09 AM

#

unique salmon It will be too similar to log-loss, both in terms of absolute values and in term...

or how about we only display RMSE non-bins and not log loss? if we don't show log loss then we don't run into this similary problem

quasi shadow Apr 16, 2025, 2:09 AM

#

polar maple Apr 16, 2025, 2:09 AM

#

quasi shadow Do you mean this page: https://github.com/open-spaced-repetition/fsrs4anki/wiki/...

yeah

quasi shadow Apr 16, 2025, 2:09 AM

#

Now you can edit it.

polar maple Apr 16, 2025, 2:09 AM

#

ok thanks

quasi shadow Apr 16, 2025, 2:13 AM

#

@unique salmon pretraining decay doesn't work well.

Model: FSRS-rs-dev
Total number of users: 345
Total number of reviews: 10524780
Weighted average by reviews:
FSRS-rs-dev LogLoss (mean±std): 0.3275±0.1511
FSRS-rs-dev RMSE(bins) (mean±std): 0.0479±0.0311
FSRS-rs-dev AUC (mean±std): 0.7184±0.0831

Weighted average by log(reviews):
FSRS-rs-dev LogLoss (mean±std): 0.3496±0.1618
FSRS-rs-dev RMSE(bins) (mean±std): 0.0620±0.0389
FSRS-rs-dev AUC (mean±std): 0.7078±0.0884

Weighted average by users:
FSRS-rs-dev LogLoss (mean±std): 0.3517±0.1638
FSRS-rs-dev RMSE(bins) (mean±std): 0.0640±0.0399
FSRS-rs-dev AUC (mean±std): 0.7071±0.0909

parameters: [0.2027, 1.0535, 2.8078, 15.9455, 6.9865, 0.5577, 2.2141, 0.0069, 1.5326, 0.1223, 1.0383, 1.8223, 0.1175, 0.3022, 2.2859, 0.2162, 3.0055, 0.79, 0.2611, 0.1427, 0.2029]

Model: FSRS-rs
Total number of users: 345
Total number of reviews: 10524780
Weighted average by reviews:
FSRS-rs LogLoss (mean±std): 0.3273±0.1509
FSRS-rs RMSE(bins) (mean±std): 0.0479±0.0309
FSRS-rs AUC (mean±std): 0.7187±0.0838

Weighted average by log(reviews):
FSRS-rs LogLoss (mean±std): 0.3489±0.1607
FSRS-rs RMSE(bins) (mean±std): 0.0615±0.0374
FSRS-rs AUC (mean±std): 0.7079±0.0888

Weighted average by users:
FSRS-rs LogLoss (mean±std): 0.3509±0.1624
FSRS-rs RMSE(bins) (mean±std): 0.0634±0.0382
FSRS-rs AUC (mean±std): 0.7072±0.0913

parameters: [0.216, 1.1977, 2.8019, 15.7018, 6.9865, 0.5514, 2.2311, 0.007, 1.533, 0.1272, 1.0386, 1.8204, 0.1162, 0.2988, 2.2863, 0.2181, 3.0072, 0.8048, 0.2625, 0.1379, 0.1914]

quasi shadow Apr 16, 2025, 3:39 AM

#

ashen light the math academy stuff is carefully crafted decks and relations though?

There isn't any deck in math academy. I'm using it.

#

cosmic hedge Apr 16, 2025, 8:14 AM

#

quasi shadow Expertium hopes the metric human-readable, so we have RMSE(bins).

#1282005522513530952 message I still personally think that "a better fit/log loss than x% of users" would be the most readable option.

unique salmon Apr 16, 2025, 8:42 AM

#

quasi shadow <@530106856593424407> pretraining decay doesn't work well. ``` Model: FSRS-rs-de...

Crap. Oh well

quasi shadow Apr 16, 2025, 8:46 AM

#

cosmic hedge https://discord.com/channels/368267295601983490/1282005522513530952/133393357409...

It's meaningless to compare the log loss among users because it's related to the retention.

quasi shadow Apr 16, 2025, 9:38 AM

#

#

Good News: FSRS now outperforms SM-17 significantly.

bold terrace Apr 16, 2025, 9:46 AM

#

People complaining about long intervals, meanwhile my hard deck : 50 consecutive good rating, stability 28 lol

#

4 reviews for my normal deck to get to 25d stability

#

Hard : logloss 0.4353, RMSE 4.34%
Normal : logloss 0.3579, RMSE 3.29%
Merged : logloss 0.4203, RMSE 3.39%

#

But funny enough, a mistake in the normal one is more sanctionned than in terms of interval (but not in reps to recover) in the normal one

unique salmon Apr 16, 2025, 9:59 AM

#

quasi shadow Good News: FSRS now outperforms SM-17 significantly.

...based on 16 collections

#

You should add AUC btw, just for the sake of consistency with the other benchmark

#

Actually, once your PR is merged I'll make another one just to re-write some stuff in readme

quasi shadow Apr 16, 2025, 11:10 AM

#

If everything goes well, the benchmark will be done tomorrow.

unique salmon Apr 16, 2025, 12:47 PM

#

@quasi shadow would you move "Evaluate" to the Helper add-on if Dae was ok with it?

quasi shadow Apr 16, 2025, 12:57 PM

#

Nope

#

Just use the notebook optimizer.

unique salmon Apr 16, 2025, 12:58 PM

#

quasi shadow Nope

Why? FeelsBadAnki

#

It's better than removing it entirely

quasi shadow Apr 16, 2025, 1:00 PM

#

yeah

unique salmon Apr 16, 2025, 1:00 PM

#

Then Sound won't complain 🤣

robust hill Apr 16, 2025, 1:00 PM

#

if you remove my evaluate button

cursive badge Apr 16, 2025, 1:00 PM

#

Just let Jarrett retire 😂

robust hill Apr 16, 2025, 1:01 PM

#

https://tenor.com/view/스마일-조커-웃음-laughter-laugh-gif-4675782166802660076

Tenor

unique salmon Apr 16, 2025, 1:13 PM

#

cursive badge Just let Jarrett retire 😂

robust hill Apr 16, 2025, 1:21 PM

#

yes

bold terrace Apr 16, 2025, 1:55 PM

#

If R=50%, would be the cost of a "Good" be equal to the cost of a "Again" ?

unique salmon Apr 16, 2025, 1:56 PM

#

bold terrace If R=50%, would be the cost of a "Good" be equal to the cost of a "Again" ?

If you mean time per review, no. And Jarrett removed that "correction", if that's what you're talking about

bold terrace Apr 16, 2025, 1:57 PM

#

Ah no no in terms of optimization, to reduce logloss and RMSE

#

I was wondering if someone is strange enough to put as DR, 50%

#

What would be all the implications

unique salmon Apr 16, 2025, 1:58 PM

#

Ah, yes. At R=50% logloss is the same for any grade

bold terrace Apr 16, 2025, 1:59 PM

#

Could it be that the closer to 50% you are, the less precise FSRS could be then ?

#

For example for DR=60%, the cost of 3 or a 1 would be much more similar, so the optimization result might not be a model that target 60%, but a model "that just doesn't really care" 🤔

#

Of course depends on the subject, getting 50% if the questions are "Yes/No" would be different than "Type the year when this happened"

#

The "endgoal" question being : "Isn't because of that, that higher DR like 90-95% could just be easier to predict for FSRS than lower one like 70%"

unique salmon Apr 16, 2025, 2:04 PM

#

If your true R is 50% all the time, FSRS would still do its best to adapt to predict that. So it wouldn't "not care", it has to care to accurately predict R=50% all the time

#

It's not like it won't be penalized. If your true R=50% all the time and FSRS predicts 40%, it would be penalized

bold terrace Apr 16, 2025, 2:07 PM

#

Yep you would get very volatile parameters only if the answers would be a flip coin "Yes / No"

#

(By nature you'd have a R of 50%)

#

For the DR=95% vs 70% though

robust hill Apr 16, 2025, 2:07 PM

#

what if ur true R is at 2%

unique salmon Apr 16, 2025, 2:19 PM

#

bold terrace Yep you would get very volatile parameters only if the answers would be a flip c...

I dunno if they would be volatile

unique salmon Apr 16, 2025, 5:28 PM

#

@polar maple ok man, I didn't want to bother you, but I REALLY hope you can release RWKV soon. My benchmarking article has been in the making for months and I want to finish it 😅

#

https://docs.google.com/forms/d/1Uy8zr9QOS6u-oLVRwVCuQfyFUiSwKxt9pWEvlTGdn9k/viewanalytics
Regarding Evaluate, things may change as I gather more responses, but so far a very strange pattern emerges: there are a lot of users who use Evaluate without understanding where the numbers come from or even what values are sane.
As of right now:

~60% of users use Evaluate regularly
Only ~15% of users can give a range of log-loss/RMSE values that are good. Out of the remaining 85%, most don't know what values are reasonable at all, not even roughly
~88% of users don't know the math behind the metrics
Yet only ~40% of users are confident that their Anki routine would not be negatively affected by the removal of Evaluate

That's...strange. It means that a lot of users are using Evaluate on a regular basis without knowing what values are good or how they are calculated, and those users feel like removing numbers that they don't understand would (somehow) disturb their way of using Anki.

@bold terrace thoughts?

Google Docs

FSRS and "Evaluate"

#

TLDR: 85-90% users have no idea what the numbers mean or what values are good, but 60% of users use Evaluate regularly anyway

#

I'm not sure how to reconcile these two facts

polar maple Apr 16, 2025, 6:31 PM

#

bold terrace People complaining about long intervals, meanwhile my hard deck : 50 consecutive...

maybe in the hard deck there doesn't exist cards that have gotten out of the leech zone, so fsrs can't learn how to predict stability for these cards

hasty fractal Apr 16, 2025, 6:34 PM

#

unique salmon https://docs.google.com/forms/d/1Uy8zr9QOS6u-oLVRwVCuQfyFUiSwKxt9pWEvlTGdn9k/vie...

ask people what's the function of evaluate. imo some people think you should press it "for the algo" or it's just stats porn for them.

#

and anki is educational software, we gotta remove all porn!

#

ah wait, u didn't give them enough options. I personally don't fit into any of those groups.

#

I used evaluate a lot at one time (for presets) but have completely stopped using it.

#

I only use it now if a new update comes (stats porn).

cursive badge Apr 16, 2025, 6:49 PM

#

unique salmon I'm not sure how to reconcile these two facts

I think you are still not considering that knowing exactly what the numbers mean / how they are calculated does not matter.
Knowing that number goes down = good is enough for people to:

See that FSRS is improving over time
Check if splitting / reorganising Presets is worth it

Knowing what range is "good" would be useful, but could be replaced with a simple traffic light Good/Ok/Bad (if we actually know what ranges are "good").

unique salmon Apr 16, 2025, 6:51 PM

#

cursive badge I think you are still not considering that knowing exactly what the numbers mean...

Ok, but even if knowing exactly how the numbers are calculated isn't important and only knowing the range of good values is, 85% (80% as of now, the results have changed a bit) of users don't even know the range

#

So we're still left with a situation where the majority of users don't know what values are good, but keep using Evaluate anyway

cursive badge Apr 16, 2025, 6:54 PM

#

To be fair I don't know what ranges are technically "good". If I created a new preset and saw it had massively larger values than existing presets it still helps me know something might be wrong, even if I don't know exactly what range I should be expecting.

unique salmon Apr 16, 2025, 6:56 PM

#

I like the idea of a "health check", but Evaluate is really poorly suited for that. Evaluate is like a health check that tells you "You have fatal organ failure" when it's already too late AND doesn't tell you which organs are shutting down or why

cursive badge Apr 16, 2025, 6:57 PM

#

In an ideal world I do agree that they would be debugging/advanced values and we would have nice "Health Check" tools.

unique salmon Apr 16, 2025, 6:57 PM

#

We don't have good tools to diagnose Hard misuse

#

sadge

#

FeelsBadAnki

cursive badge Apr 16, 2025, 6:57 PM

#

Is it even possible to detect Hard misuse?

#

As in ever

unique salmon Apr 16, 2025, 6:58 PM

#

I've proposed detecting it based on one of parameters, but Jarrett said it's a bad idea

bold terrace Apr 16, 2025, 6:58 PM

#

unique salmon https://docs.google.com/forms/d/1Uy8zr9QOS6u-oLVRwVCuQfyFUiSwKxt9pWEvlTGdn9k/vie...

I'm surprised 19 has checked the formulas for RMSE, I didn't, I just asked you how to interpret it once or twice and that was it

bold terrace Apr 16, 2025, 6:58 PM

#

cursive badge To be fair *I* don't know what ranges are technically "good". If I created a new...

For the rest, I align myself on this interpretation by @cursive badge

cursive badge Apr 16, 2025, 6:59 PM

#

I assumed it was impossible to detect Hard misuse because you are effectively lying and we have no way of knowing the objective truth apart from what the user tells us.

unique salmon Apr 16, 2025, 6:59 PM

#

Yep

bold terrace Apr 16, 2025, 7:00 PM

#

However, if there is something I think could be simplified ... or even ... removed... would be to use both logloss and RMSE in the screen. You know lower is better, but what if RMSE goes down but not logloss, etc

unique salmon Apr 16, 2025, 7:00 PM

#

We can kind of assume that the user is misusing Hard if FSRS decided to set their SInc(Hard) to 1 aka S doesn't increase with Hard, but again, Jarrett said it wouldn't work well

bold terrace Apr 16, 2025, 7:00 PM

#

Even to this day, something I'm like "Ok now I have 0.40 logloss instead of 0.60, but I get a bigger RMSE by splitting the deck... so what do I chose ? Lower logloss ? Lower RMSE ?"

polar maple Apr 16, 2025, 7:00 PM

#

have we tried something like treating 'hard' as 'again' and checking if the metrics look better after?

unique salmon Apr 16, 2025, 7:01 PM

#

polar maple have we tried something like treating 'hard' as 'again' and checking if the metr...

Nope

bold terrace Apr 16, 2025, 7:01 PM

#

polar maple have we tried something like treating 'hard' as 'again' and checking if the metr...

Yeah agree that would be basically a constant factor of 2 and at least we could suggest the user "Hey, guess you might have better time treating your Hard as Again (or even just ignore those)"

#

ah shit can't ignore those

cursive badge Apr 16, 2025, 7:01 PM

#

polar maple have we tried something like treating 'hard' as 'again' and checking if the metr...

Then you break it for weirdos like me 😂

bold terrace Apr 16, 2025, 7:02 PM

#

cursive badge Then you break it for weirdos like me 😂

Not necessarly, the optimizer would run twice, one with Hard=Hard, one with Hard=Again, and you take the best fit

#

Definitely more something for the addon though ?

unique salmon Apr 16, 2025, 7:02 PM

#

The best solution is to have a "I use Hard as fail" toggle

#

That's it

#

Simple and no false positives/negatives

bold terrace Apr 16, 2025, 7:02 PM

#

Wouldn't hurt either

#

at least the user can check how different results would be

unique salmon Apr 16, 2025, 7:03 PM

#

It would require maintaining two versions of FSRS though, that sucks

bold terrace Apr 16, 2025, 7:03 PM

#

I'm fucking surprised at 97% FSRS though

polar maple Apr 16, 2025, 7:03 PM

#

apparently 'Remedy Hard Misuse' just does this Hard -> Again relabelling, why isn't this just automatically done?

cursive badge Apr 16, 2025, 7:03 PM

#

bold terrace Not necessarly, the optimizer would run twice, one with Hard=Hard, one with Hard...

But what if Hard=Fail fits better if you assume that's what I meant when I did not.

unique salmon Apr 16, 2025, 7:03 PM

#

bold terrace I'm fucking surprised at 97% FSRS though

Considering that this a survey about FSRS, I wouldn't take that particular % seriously

bold terrace Apr 16, 2025, 7:03 PM

#

I'm really curious how much this fit "a more broad" population like the 500k people that watched the "anki introduction" where the guy still tweak SM2 in 2024

bold terrace Apr 16, 2025, 7:03 PM

#

unique salmon Considering that this a survey *about* FSRS, I wouldn't take that particular % s...

ah indeed

robust hill Apr 16, 2025, 7:04 PM

#

cursive badge Then you break it for weirdos like me 😂

what

unique salmon Apr 16, 2025, 7:04 PM

#

That's like making a survey asking "What's you favorite anime?" and being surprised that 95% of participants watch any anime at all

cursive badge Apr 16, 2025, 7:04 PM

#

robust hill what

I think we have had this reaction before ;p

bold terrace Apr 16, 2025, 7:04 PM

#

cursive badge But what if Hard=Fail fits better if you assume that's what I meant when I did n...

Well to be honest if your memory model fits better with Hard=Fail, why not use that model 😄 ?

bold terrace Apr 16, 2025, 7:05 PM

#

cursive badge Then you break it for weirdos like me 😂

OH you know what ?? Do you think it would be possible to have an Anki addon that would remove Hard/Easy, but would input those instead of "Good" when time used to answer is lesser or greater than certain thresholds ???

cursive badge Apr 16, 2025, 7:05 PM

#

bold terrace Well to be honest if your memory model fits better with Hard=Fail, why not use t...

but I didn't mean Hard=Fail when I was grading, so FSRS would push all the intervals a lot shorter in trying to get me to match my DR

bold terrace Apr 16, 2025, 7:05 PM

#

That would be DOPE

unique salmon Apr 16, 2025, 7:06 PM

#

bold terrace OH you know what ?? Do you think it would be possible to have an Anki addon that...

please no
https://expertium.github.io/Buttons.html

Expertium’s Blog

Button usage and review time of Anki users

Spaced repetition stuff

bold terrace Apr 16, 2025, 7:06 PM

#

Wait

#

We could take the 10K user dataset

unique salmon Apr 16, 2025, 7:07 PM

#

I hope nobody will interpret this article as “It’s ok to use review time to automatically select the answer button for the user”.

Time to answer varies not only between different people but also between different types of material. So Anki will have to estimate what time corresponds to Again-Hard-Good-Easy for this specific user and for this specific material.

average_t(Again) > average_t(Hard) > average_t(Good) > average_t(Easy) is true only for 40% of users.

There will be outliers if the user went to the toilet or got distracted by a phone call or something.

It’s WAY easier to just use self-reported grades. There are a lot of arguments about using 2 vs 4 buttons, and those arguments will likely last as long as Anki itself, but using time as a proxy for the answer button will be worse than either of those options. Using time as a proxy will work reliably only for about 40% of users, will be prone to outliers, and the exact cutoffs will have to be adjusted for each user individually and for different decks.

Compare that to just asking the user to click a button.

bold terrace Apr 16, 2025, 7:07 PM

#

Force all <X sec to be "hard', all >Y to be "easy", run optimization on it, and see if FSRS fit better ?

#

That would be proof

unique salmon Apr 16, 2025, 7:07 PM

#

In case you are confused: for example, Again > Hard > Good > Easy means “Average time for ‘Again’ is greater than the average time for ‘Hard’, which in turn is greater than the average time for ‘Good’, which in turn is greater than the average time for ‘Easy’”. But that’s too long, so I just wrote it as Again > Hard > Good > Easy.

ashen light Apr 16, 2025, 7:08 PM

#

unique salmon That's like making a survey asking "What's you favorite anime?" and being surpri...

whats your fav fsrs param

polar maple Apr 16, 2025, 7:09 PM

#

bold terrace Even to this day, something I'm like "Ok now I have 0.40 logloss instead of 0.60...

choose log loss most of the time

cursive badge Apr 16, 2025, 7:09 PM

#

bold terrace Force all <X sec to be "hard', all >Y to be "easy", run optimization on it, and ...

You would probably want to do it based on the response distribution rather than fixed thresholds. Also Anki only records total study time, not time looking at front of card which I think taints the data.

bold terrace Apr 16, 2025, 7:09 PM

#

unique salmon In case you are confused: for example, Again > Hard > Good > Easy means “Average...

Let's run FSRS on the 40% that respect the Again > Hard > Good > Easy and change all their ratings based on their time answering

#

And see how well it improve their rating

#

their prediction*(

#

Also

#

Hard > Good > Easy is the only thing we need to take into account

unique salmon Apr 16, 2025, 7:10 PM

#

Using both time and grades would be neat, but idk how to do it in practice with FSRS

bold terrace Apr 16, 2025, 7:11 PM

#

so we have the blue, orange part that match

#

60% user fit perfectly !

unique salmon Apr 16, 2025, 7:11 PM

#

I tried it once (time + number of reviews done on that day) and it didn't do shit

#

So either I'm dumb or it's just hard to do

bold terrace Apr 16, 2025, 7:13 PM

#

Well it's true that time taken to answer, is already somewhat captured in the Retention info... So long answers already weight more on the "fail" side

#

And, since people didn't use themselve hard/good/easy, fitting a model on those "faked entries" in the benchmark means if they press "Good" for everything, they won't benefit from tit

#

So we'd need to take users that already respect that pattern

#

can't do that on people not using Hard/Easy consistently in the first place like me

#

But if an Addon was forcing those Hard/Easy, and the user was just pressing "Good", it would solve that

unique salmon Apr 16, 2025, 7:15 PM

#

Going back to Evaluate

66% of users who use FSRS use Evaluate regularly
Only 23% of them can give ranges of sane values
Only 21% of them know the math (that's actually surprisingly high, I thought it will be like 2%)
Only 30% of them believe that removing Evaluate will not be bad for them

bold terrace Apr 16, 2025, 7:15 PM

#

But FSRS optimizer wouldn't have to change at all since the time info is captured in those 3 values

polar maple Apr 16, 2025, 7:17 PM

#

iirc i did small tests with LSTM and excluding duration information affected log loss by ~0.001 and treating hard = good = easy affected log loss by ~0.003

bold terrace Apr 16, 2025, 7:18 PM

#

Hope crushed

unique salmon Apr 16, 2025, 7:18 PM

#

Speaking of which
binary means "hard = good = easy"

#

I'm surprised by how not-shit it is

#

And better than FSRS-5 btw

cursive badge Apr 16, 2025, 7:19 PM

#

I still suspect time-to-flip could be useful.

unique salmon Apr 16, 2025, 7:19 PM

#

So FSRS-6 with "pretend that Hard = Good = Easy" is still better than FSRS-5

#

(marginally)

#

Maybe 4 buttons really are placebo

bold terrace Apr 16, 2025, 7:21 PM

#

polar maple iirc i did small tests with LSTM and excluding duration information affected log...

Another idea, another hope ! Engineering a feature that would represent how much the card front info is represented in the deck !

For example, if the front of cards is : A, AB, B, C, D, E, you'd have higher featuer for A, B, AB than for C D E

unique salmon Apr 16, 2025, 7:21 PM

#

bold terrace Another idea, another hope ! Engineering a feature that would represent how much...

Nope, no can do

#

Anything that involves the content of the card is a "no"

#

Only soulless numbers and IDs

bold terrace Apr 16, 2025, 7:21 PM

#

What do you think @polar maple 😄 ?

polar maple Apr 16, 2025, 7:21 PM

#

yeah we just don't have the info available to us

unique salmon Apr 16, 2025, 7:21 PM

#

As in "you LITERALLY can't", not "Expertium is telling you it's bad"

bold terrace Apr 16, 2025, 7:22 PM

#

aah

#

User Card data is not shared ?

cursive badge Apr 16, 2025, 7:22 PM

#

Not in the benchmark dataset

polar maple Apr 16, 2025, 7:22 PM

#

the dream is that we get some vocabulary deck data so i can throw it at a nn to let it figure it out

#

use some word/sentence embedding nn to encode the card info

bold terrace Apr 16, 2025, 7:23 PM

#

Would 74K reviews suffice ?

polar maple Apr 16, 2025, 7:23 PM

#

prob not

unique salmon Apr 16, 2025, 7:23 PM

#

bold terrace User Card data is not shared ?

The dataset is anonymized. No text, no audio, no images. Only deck IDs, preset IDs, card IDs and note IDs

cursive badge Apr 16, 2025, 7:23 PM

#

bold terrace User Card data is not shared ?

You can see all the available columns on the huggingface page: https://huggingface.co/datasets/open-spaced-repetition/anki-revlogs-10k

open-spaced-repetition/anki-revlogs-10k · Datasets at Hugging Face

bold terrace Apr 16, 2025, 7:26 PM

#

Sad, there are few things that could be useful while being anonymous (glossary, front, ...)

#

I even started noting the words I confused with other when I did

#

Imagine this on NN

unique salmon Apr 16, 2025, 7:27 PM

#

unique salmon Going back to Evaluate 1) 66% of users who use FSRS use Evaluate regularly 2) On...

Going back to Evaluate

66% of users who use FSRS use Evaluate regularly

Only 23% of them can give ranges of sane values

Only 21% of them know the math (that's actually surprisingly high, I thought it will be like 2%)

Only 30% of them believe that removing Evaluate will not be bad for them

So...now what?

#

https://tenor.com/view/finding-nemo-bags-floating-stuck-now-what-gif-5473087

Tenor

now what

▶ Play video

cursive badge Apr 16, 2025, 7:28 PM

#

bold terrace Sad, there are few things that could be useful while being anonymous (glossary, ...

Until someone syncs their "memorizing people's phone numbers" deck with AnkiWeb...

bold terrace Apr 16, 2025, 7:28 PM

#

cursive badge Until someone syncs their "memorizing people's phone numbers" deck with AnkiWeb....

Could do some anonimzation on those !

unique salmon Apr 16, 2025, 7:28 PM

#

Deck1::Subdeck2::NuclearCodes

bold terrace Apr 16, 2025, 7:28 PM

#

For example if you Remove "Screenshot" and "Sentence" in mine, you won't see my dirty talk

polar maple Apr 16, 2025, 7:29 PM

#

unique salmon > Going back to Evaluate > 1) 66% of users who use FSRS use Evaluate regularly >...

do nothing, seems like a significant portion of users get something useful out of evaluate

cursive badge Apr 16, 2025, 7:29 PM

#

I'm assuming Dae does not want to hand check 10k users worth of decks for sensitive data.

bold terrace Apr 16, 2025, 7:29 PM

#

cursive badge I'm assuming Dae does not want to hand check 10k users worth of decks for sensit...

Sure but those 10K are set in stone, but what about the future !

ashen light Apr 16, 2025, 7:30 PM

#

just go on r/anki and ask for volunteers

#

won't get 10k but hey you might get 100

bold terrace Apr 16, 2025, 7:31 PM

#

I'm sure 90% of people don't even mine their own card

ashen light Apr 16, 2025, 7:31 PM

#

make the fsrs helper addon have a button to upload a deck to allow science to be done on it

bold terrace Apr 16, 2025, 7:31 PM

#

they just download a shared core deck

cursive badge Apr 16, 2025, 7:31 PM

#

Have an "I donate my decks to science" setting inside Anki ;p

ashen light Apr 16, 2025, 7:31 PM

#

insidious: make the fsrs helper addon just do that with no prompting 🍃

bold terrace Apr 16, 2025, 7:31 PM

#

With shared decks, infering card relationship would even be easier since we'd have huge amount of data

ashen light Apr 16, 2025, 7:31 PM

#

you'll kill your reputation but who cares when you have all that fresh real data

bold terrace Apr 16, 2025, 7:32 PM

#

ashen light you'll kill your reputation but who cares when you have all that fresh real data

Didn't stop Facebook/Twitter/Google to be where they at

ashen light Apr 16, 2025, 7:32 PM

#

yeah but they ahve money

bold terrace Apr 16, 2025, 7:32 PM

#

Because they didn't care about their reputation first

ashen light Apr 16, 2025, 7:32 PM

#

I mean when you control the flow of info you can just hide flows that make you look bad

#

¯_(ツ)_/¯

unique salmon Apr 16, 2025, 7:32 PM

#

https://www.theverge.com/2021/5/29/22459869/us-soldiers-leaked-nuclear-info-online-flashcard-apps

The Verge

US soldiers reportedly leaked nuclear info online accidentally, by ...

The information has since been removed from the sites

ashen light Apr 16, 2025, 7:33 PM

#

what I'm hearing is we need fsrs incorporated first, then we can use it to steal all the decks 🍃

#

hahah you weren't even memeing about nuclear codes

bold terrace Apr 16, 2025, 7:33 PM

#

unique salmon https://www.theverge.com/2021/5/29/22459869/us-soldiers-leaked-nuclear-info-onli...

Not even in Anki, damn ... on "Chegg"

#

cursive badge Apr 16, 2025, 7:33 PM

#

I really want WaniKani to donate their dataset to science.

#

They have a massive dataset and all the "cards" already have nice links showing how they are related.

bold terrace Apr 16, 2025, 7:35 PM

#

cursive badge I really want WaniKani to donate their dataset to science.

Imagine DuoLingo

#

Never used it

#

But when I read that I'm like maybe it's not too late

cursive badge Apr 16, 2025, 7:36 PM

#

Unfortunately the WK SRS is terrible 😦

#

They just use fixed intervals. Not even SM2 levels of adapting to the user 😦

unique salmon Apr 16, 2025, 7:43 PM

#

bold terrace Imagine DuoLingo

Don't have to imagine: https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/N8XJME
Only 13 million reviews though, it's way smaller than our 10k dataset

#

Excerpt
p_recall,timestamp,delta,user_id,learning_language,ui_language,lexeme_id,lexeme_string,history_seen,history_correct,session_seen,session_correct 1.0,1362076081,27649635,u:FO,de,en,76390c1350a8dac31186187e2fe1e178,lernt/lernen<vblex><pri><p3><sg>,6,4,2,2 0.5,1362076081,27649635,u:FO,de,en,7dfd7086f3671685e2cf1c1da72796d7,die/die<det><def><f><sg><nom>,4,4,2,1 1.0,1362076081,27649635,u:FO,de,en,35a54c25a2cda8127343f6a82e6f6b7d,mann/mann<n><m><sg><nom>,5,4,1,1 0.5,1362076081,27649635,u:FO,de,en,0cf63ffe3dda158bc3dbd55682b355ae,frau/frau<n><f><sg><nom>,6,5,2,1 1.0,1362076081,27649635,u:FO,de,en,84920990d78044db53c1b012f5bf9ab5,das/das<det><def><nt><sg><nom>,4,4,1,1 1.0,1362076081,27649635,u:FO,de,en,56429751fdaedb6e491f4795c770f5a4,der/der<det><def><m><sg><nom>,4,3,1,1 1.0,1362076081,27649635,u:FO,de,en,1bacf218eaaf9f944e525f7be9b31899,kind/kind<n><nt><sg><nom>,4,4,1,1 1.0,1362082032,444407,u:dDwF,es,en,73eecb492ca758ddab5371cf7b5cca32,bajo/bajo<pr>,3,3,1,1 1.0,1362082044,5963,u:FO,de,en,76390c1350a8dac31186187e2fe1e178,lernt/lernen<vblex><pri><p3><sg>,8,6,6,6 0.75,1362082044,5963,u:FO,de,en,7dfd7086f3671685e2cf1c1da72796d7,die/die<det><def><f><sg><nom>,6,5,4,3 0.888888888889,1362082044,5963,u:FO,de,en,35a54c25a2cda8127343f6a82e6f6b7d,mann/mann<n><m><sg><nom>,6,5,9,8 0.8,1362082044,5963,u:FO,de,en,0cf63ffe3dda158bc3dbd55682b355ae,frau/frau<n><f><sg><nom>,8,6,5,4 0.8,1362082044,5963,u:FO,de,en,84920990d78044db53c1b012f5bf9ab5,das/das<det><def><nt><sg><nom>,5,5,5,4 1.0,1362082044,5963,u:FO,de,en,56429751fdaedb6e491f4795c770f5a4,der/der<det><def><m><sg><nom>,5,4,5,5 1.0,1362082044,5963,u:FO,de,en,1bacf218eaaf9f944e525f7be9b31899,kind/kind<n><nt><sg><nom>,5,5,3,3 1.0,1362082130,77,u:dDwF,es,en,73eecb492ca758ddab5371cf7b5cca32,bajo/bajo<pr>,5,5,1,1 0.0,1362082194,150,u:FO,de,en,84920990d78044db53c1b012f5bf9ab5,das/das<det><def><nt><sg><nom>,10,9,1,0 1.0,1362082194,150,u:FO,de,en,35a54c25a2cda8127343f6a82e6f6b7d,mann/mann<n><m><sg><nom>,15,13,1,1

#

No idea what some of these mean, but whatever

polar maple Apr 16, 2025, 7:49 PM

#

what could p_recall be?

unique salmon Apr 16, 2025, 7:49 PM

#

No, that's the easiest one 🤣

polar maple Apr 16, 2025, 7:50 PM

#

it's not 0/1

#

did they measure over a session or over a set of users or something

#

for the same item

#

or did they include their HLR predictions into the dataset itself

unique salmon Apr 16, 2025, 7:50 PM

#

Per session, it seems

polar maple Apr 16, 2025, 7:51 PM

#

unlucky

#

per session data is not useful

unique salmon Apr 16, 2025, 7:51 PM

#

polar maple or did they include their HLR predictions into the dataset itself

No, based on the last two column names

#

0.888888888889,1362082044,5963,u:FO,de,en,35a54c25a2cda8127343f6a82e6f6b7d,mann/mann<n><m><sg><nom>,6,5,9,8
If session_seen=9 and session_correct=8, that gives us 0.888888888889

#

So yeah, checks out

polar maple Apr 16, 2025, 7:52 PM

#

yeah this isn't usable for us

unique salmon Apr 16, 2025, 8:06 PM

#

polar maple or did they include their HLR predictions into the dataset itself

Btw, I find it funny that Duolingo reports lower AUC on their own dataset than we on our
https://github.com/open-spaced-repetition/srs-benchmark
HLR 3 0.41±0.012 0.105±0.0030 0.633±0.0050
https://github.com/duolingo/halflife-regression/blob/master/settles.acl16.pdf

GitHub

GitHub - open-spaced-repetition/srs-benchmark: A benchmark for spac...

A benchmark for spaced repetition schedulers/algorithms - open-spaced-repetition/srs-benchmark

GitHub

halflife-regression/settles.acl16.pdf at master · duolingo/halflif...

Contribute to duolingo/halflife-regression development by creating an account on GitHub.

polar maple Apr 16, 2025, 8:11 PM

#

maybe they have a bug in their implementation

#

jk but 0.538 is very low

unique salmon Apr 16, 2025, 8:13 PM

#

I'm not joking when I'm telling Jarrett to contact Duolingo and just straight up tell them "HLR sucks, use FSRS instead"

south lodge Apr 16, 2025, 8:14 PM

#

Is there enough outcome reporting to make that claim convincing?

unique salmon Apr 16, 2025, 8:16 PM

#

Considering that we have a dataset with ~700 million reviews and Duolingo thought that 13 million reviews was good enough for their paper - yes

south lodge Apr 16, 2025, 8:18 PM

#

Review count and external testing results might not necessarily correlate with each other

bold terrace Apr 16, 2025, 8:25 PM

#

unique salmon I'm not joking when I'm telling Jarrett to contact Duolingo and just straight up...

They will hire him and forbid him from contributing on FSRS anymore 😦

#

Well, maybe right now US-China relationship are not that great for international hiring though

quasi shadow Apr 17, 2025, 2:23 AM

#

#

polar maple Apr 17, 2025, 2:33 AM

#

50.1% wow

quasi shadow Apr 17, 2025, 3:46 AM

#

Should I remove L2 regularization when changing the default value?

#

#

😅I guess we don't need to check the distribution.

#

After I change the default value of w[11] from 1.8 to 4.0, the median of optimized value of w[11] is 3.77.

#

Notice that median values of w[12] and w[13] aer also changed significantly.

#

#

When w[12] increases, S_fail decreases.

#

When w[13] decreases, S_fail decreases, too.

#

When w[11] increases, S_fail increases.

#

So, in some degree, the changes of w[12] and w[13] compensate the change of w[11].

quasi shadow Apr 17, 2025, 4:12 AM

#

polar maple maybe they have a bug in their implementation

Their paper has a several problem.

#

They uses the in-day correct rate of a word as the P(recall).

#

It assumes the trials of the same word in the session are iid but it's not true.

#

#

This one is also problematic when p = 0 or 1.

#

😅 I don't know whether they were aware of these problems. But the paper was accepted. That's why I thought the peer review makes nonsense when the peer knows nothing about the niche domain.

south lodge Apr 17, 2025, 4:49 AM

#

Note that to prevent computational overflow and under-
flow errors, we bound ˆp_Θ ∈ [0.0001, 0.9999] and
ˆh_Θ ∈ [15 min, 9 months] in practice.
(fwiw, in A.3)

quasi shadow Apr 17, 2025, 6:29 AM

#

south lodge > Note that to prevent computational overflow and under- > flow errors, we bound...

Yeah, they were aware of that. But the solution was werid...

unique salmon Apr 17, 2025, 8:43 AM

#

quasi shadow Should I remove L2 regularization when changing the default value?

For testing the distribution issue, yez

quasi shadow Apr 17, 2025, 9:20 AM

#

the result is similiar

west whale Apr 17, 2025, 10:30 PM

#

Good work FSRS friends ❤️

Feat/FSRS-6 #3929

hasty fractal Apr 17, 2025, 10:37 PM

#

I thought Jarrett will stop at FSRS-5 lol.

#

Lunar new year is long gone, Jarrett is still here.

bold terrace Apr 17, 2025, 10:44 PM

#

Can't wait for it to reach Anki 🥲 Good job

quasi shadow Apr 18, 2025, 3:03 AM

#

hasty fractal I thought Jarrett will stop at FSRS-5 lol.

FSRS-5 has some severe problems in the formula of the same-day stability, so I have to fix it anyway.

#

😅 That's why I planned to release FSRS-5.5.

#

But I accepted more improvement ideas so we have FSRS-6.

#

#

@cosmic hedge I have a problem.

#

After I modify the Easy Days Config in the Options screen, the Easy Days Config in the Simulator screen doesn't keep sync with it.

clever cargo Apr 18, 2025, 3:46 AM

#

quasi shadow <@388069992660205588> I have a problem.

try this

diff --git a/ts/routes/deck-options/SimulatorModal.svelte b/ts/routes/deck-options/SimulatorModal.svelte
index 1b587c985..afd0d1eb2 100644
--- a/ts/routes/deck-options/SimulatorModal.svelte
+++ b/ts/routes/deck-options/SimulatorModal.svelte
@@ -178,7 +178,7 @@ License: GNU AGPL, version 3 or later; http://www.gnu.org/licenses/agpl.html
         );
     }
 
-    let easyDayPercentages = [...$config.easyDaysPercentages];
+    $: easyDayPercentages = [...$config.easyDaysPercentages];
 </script>
 
 <div class="modal" class:show={shown} class:d-block={shown} tabindex="-1">

quasi shadow Apr 18, 2025, 4:32 AM

#

It works!

#

Thank you.

cosmic hedge Apr 18, 2025, 6:23 AM

#

clever cargo try this ```diff diff --git a/ts/routes/deck-options/SimulatorModal.svelte b/ts/...

I remember removing this on purpose for some reason
https://github.com/ankitects/anki/pull/3837/commits/8086edca5e19f8d02cf97072d4c3453142fd2bdd
I think it might have been save to preset options. Maybe it was that I used a subscribe for some reason. idk so long as it works. 😂

robust hill Apr 18, 2025, 8:29 AM

#

when fsrs 6 coming

#

how do i put it inside my anki

lapis hearth Apr 18, 2025, 8:33 AM

#

robust hill how do i put it inside my anki

dae still has to review and merge it

robust hill Apr 18, 2025, 8:35 AM

#

noooooo

lapis hearth Apr 18, 2025, 8:36 AM

#

But dae takes 10 working days to respond to smth

#

and 10 more days to make a new build

#

Could someone bring this to daes attention.

quasi shadow Apr 18, 2025, 8:55 AM

#

My colleagues have finished the refactoring of our App's scheduling module recently, so I will take over the rest of work (refactoring the long-term scheduling algorithm). So I won't have time to improve FSRS in the next several months.

robust hill Apr 18, 2025, 9:02 AM

#

damn

#

went out with a bang i see

unique salmon Apr 18, 2025, 9:02 AM

#

cosmic hedge I remember removing this on purpose for some reason https://github.com/ankitect...

Btw, other than CMRR, there's this: https://forums.ankiweb.net/t/desired-retention-ui-overhaul/57678/33?u=expertium
But it hasn't got an explicit ok from Dae

And there's also this: https://forums.ankiweb.net/t/ideas-to-make-deck-preset-interactions-more-clear/58773/5?u=expertium
Which also hasn't got an explicit ok from Dae
FeelsBadAnki

Anki Forums

Desired Retention UI Overhaul

Ok, how about an idea suggested by Brayan: answer buttons that show interval lengths The interval lengths above answer buttons would change instantly when desired retention is changed More from Brayan: put the fsrs parameters at the bottom of the FSRS section and add some title to the “query input” (idk what is called the form below...

Anki Forums

Ideas to make deck/preset interactions more clear?

Yeah, that’s easier to implement and works better with 20+ presets.

robust hill Apr 18, 2025, 9:02 AM

#

dont worry

#

@lapis hearth will replace you

lapis hearth Apr 18, 2025, 9:07 AM

#

If I could I would

#

But I dont

unique salmon Apr 18, 2025, 9:55 AM

#

The realest answer (from my survey on Evaluate)

#

Btw, results: https://docs.google.com/forms/d/1Uy8zr9QOS6u-oLVRwVCuQfyFUiSwKxt9pWEvlTGdn9k/viewanalytics

53% of users use Evaluate regularly
Only 15% can give a range of reasonable values
Only 14% know the mathematical formulas used to calculate the values
33% believe that removing Evaluate would have a negative impact on their Anki routine, 30% are unsure, and 37% believe it wouldn't have a negative impact

Google Docs

FSRS and "Evaluate"

robust hill Apr 18, 2025, 12:54 PM

#

also question

#

why doesnt hard count as a good for first time cards?

#

as fulfilling one of the learning steps

#

curious why it was that way

unique salmon Apr 18, 2025, 12:54 PM

#

Because learning steps are shit

robust hill Apr 18, 2025, 12:55 PM

#

alright

unique salmon Apr 18, 2025, 12:55 PM

#

I've said this many times - the whole thing with learning steps shouldn't exist in the first place

robust hill Apr 18, 2025, 12:55 PM

#

how should it work then

unique salmon Apr 18, 2025, 12:55 PM

#

It's a mess

unique salmon Apr 18, 2025, 12:55 PM

#

robust hill how should it work then

Just the same algorithm for all intervals, from minutes to years

robust hill Apr 18, 2025, 12:56 PM

#

i see

#

well

#

doesnt that mean when u learn new things u have to be on anki the whole day

#

if u learn it in the morning

#

does retention decrease equally

#

e.g. over 8 waking hours vs 8 sleeping hours

unique salmon Apr 18, 2025, 12:58 PM

#

That's a very good question. I don't know 🤷‍♂️

robust hill Apr 18, 2025, 1:00 PM

#

no and the reason is

#

sleep is when memory consolidation begins (oversimplified)

#

boom cooked

#

unfortunately i cannt bring myself to do new cards at the end of the day 💔

#

billions must nap

robust hill Apr 18, 2025, 1:24 PM

#

@unique salmon after starting a new deck

#

with lets say 30 new cards a day

#

when would you recommend to start the first optimization

#

i use default fsrs parameters

#

havent optimized yet

unique salmon Apr 18, 2025, 1:25 PM

#

Whenever you want, really

robust hill Apr 18, 2025, 1:29 PM

#

alright

#

sounds good

unique salmon Apr 18, 2025, 1:57 PM

#

https://www.reddit.com/r/Anki/comments/1k23tvn/atrocious_true_retention/

I also haven't optimized FSRS yet because I didn't know this was an option.

From the Anki community on Reddit

Explore this post and more from the Anki community

#

😭

#

#

I swear, if FSRS only had one toggle, people would still find ways to not use it properly

#

So this guy didn't realize that optimization is a thing AND he also didn't realize that he can control interval lengths by adjusting desired retention

ashen light Apr 18, 2025, 2:04 PM

#

maybe the problem is that people have to hit secret hidden buttons

unique salmon Apr 18, 2025, 2:05 PM

#

ashen light maybe the problem is that people have to hit secret hidden buttons

secret
a giant-ass blue button in the middle of the screen

ashen light Apr 18, 2025, 2:05 PM

#

its in a corner with like 50 other things to also look at

unique salmon Apr 18, 2025, 2:05 PM

#

Like, I can see not realizing that DR affects interval lengths if you have never changed DR, but not realizing that optimization is a thing...

#

We need an interactive tutorial so bad, man

ashen light Apr 18, 2025, 2:08 PM

#

no one would use it

#

or, the type of person who would is also the type to not have this problem in the first place

#

whats needed is basically an exam when you open anki the first time, you gotta answer a bunch of questions that shows you read the manual

#

only then can you use the program

unique salmon Apr 18, 2025, 2:10 PM

#

kek

#

Just make the interactive tutorial unskippable

ashen light Apr 18, 2025, 2:11 PM

#

have fun implementing such a thing

soft skiff Apr 18, 2025, 2:26 PM

#

Hi, guys, how many new cards can i learn every day, which is the upper limit of human cognition?

unique salmon Apr 18, 2025, 2:33 PM

#

soft skiff Hi, guys, how many new cards can i learn every day, which is the upper limit of ...

That's a difficult question, and there aren't a lot of good estimates
https://supermemo.guru/wiki/How_much_knowledge_can_human_brain_hold

The upper limit of knowledge for a human brain may amount to 300,000 stable items of knowledge as consolidated in spaced repetition
Items = cards in Anki
Over 50 years, that's 6000 cards per year aka ~16 new cards per day

How much knowledge can human brain hold

robust hill Apr 18, 2025, 2:33 PM

#

lol

#

thats amazing

#FSRS Megathread