quaint wagon Aug 11, 2024, 1:09 PM

#

https://gregdan3.github.io/ilo-muni/

sina alasa e sona nimi lon ilo Muni la o pana e nasin e sitelen lon tomo ni!

ilo Muni

Watch toki pona grow and change- now with graphs!

edgy cradle Aug 11, 2024, 1:10 PM

#

wan

quaint wagon Aug 11, 2024, 1:11 PM

#

tan seme

edgy cradle Aug 11, 2024, 1:11 PM

#

heres something i notices

#

the relative mode is less accurate further back in time

wicked stratus Aug 11, 2024, 1:12 PM

#

san

quaint wagon Aug 11, 2024, 1:12 PM

#

https://gregdan3.github.io/ilo-muni/?query=sina+seme+-+sina+seme_3%2C+sina+pali+e+seme+-+sina+pali+e+seme_5&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2

ilo Muni

Watch toki pona grow and change- now with graphs!

wicked stratus Aug 11, 2024, 1:12 PM

#

I actually have cool graphs I'm just not at home rn 😔

quaint wagon Aug 11, 2024, 1:13 PM

#

edgy cradle the relative mode is less accurate further back in time

yes, that's true
there's less toki pona going on that far back
the data actually goes to 2010, but barely so, so i ignored it lol

#

this is why there are multiple scales offered tho

icy turret Aug 11, 2024, 1:13 PM

#

nanpa wan is roughly twice as common as nanpa tu

whole epoch Aug 11, 2024, 1:14 PM

#

https://gregdan3.github.io/ilo-muni/?query=nanpa+open%2C+nanpa+pini&minSentLen=1&scale=rel&start=1533081600&end=1722470400&smoothing=1
it's interesting how nanpa open and nanpa pini only started to be used around 2020, with nanpa pini being way more common than nanpa open (probably cause nanpa wan expresses a similar idea)

icy turret Aug 11, 2024, 1:14 PM

#

indirect evidence for unpopularity of sona-preverb

wicked stratus Aug 11, 2024, 1:15 PM

#

edgy cradle Aug 11, 2024, 1:15 PM

#

quaint wagon yes, that's true there's less toki pona going on that far back the data actually...

if 20 people says something and there's 1,000 texts in the month, then that's 2%
but if 20 people say something and there's 100,000 texts in the months, then that's 0.02%

whole epoch Aug 11, 2024, 1:16 PM

#

icy turret indirect evidence for unpopularity of sona-preverb

indirect evidence for the popularity(?) of preverb open and preverb pini

#

novel lintel Aug 11, 2024, 1:16 PM

#

mi anpa lon e kokosila :3 https://gregdan3.github.io/ilo-muni/?query=penpo%2C+penpo+-+ilo+penpo+o+lukin+ala%2C+kokosila&minSentLen=1&scale=rel&start=1659312000&end=1722470400&smoothing=2

quaint wagon Aug 11, 2024, 1:16 PM

#

whole epoch https://gregdan3.github.io/ilo-muni/?query=nanpa+open%2C+nanpa+pini&minSentLen=1...

there's a related phenomenon where the frequency of the most used words is going down relative to less used words since anywhere from march 2020 to may 2021, continuing to today
(you can see it in that graph of pona above)
i... don't know what this means tho :P

quaint wagon Aug 11, 2024, 1:16 PM

#

whole epoch

psst, check the absolute mode
there's like 30 occurrences there :P

icy turret Aug 11, 2024, 1:16 PM

#

damn, its not often that you can attribute a word to one month and then essentially never again

wicked stratus Aug 11, 2024, 1:17 PM

#

novel lintel mi anpa lon e kokosila :3 <https://gregdan3.github.io/ilo-muni/?query=penpo%2C+p...

oh gosh it's The Dip™️ mk. 2 /musi

whole epoch Aug 11, 2024, 1:17 PM

#

quaint wagon psst, check the absolute mode there's like 30 occurrences there :P

i will present my data in the most biased way as to make my opinion seem more correct
-# /musi
but yea fair lol

edgy cradle Aug 11, 2024, 1:17 PM

#

quaint wagon Aug 11, 2024, 1:18 PM

#

in general, take anything that has fewer than ~400 occurrences all-time (you can check with cumulative) with a grain of salt
that data can be relevant, but it can be much more easily swayed by errors, or affected by an individual speaker

#

if you wanna know how powerful exactly one speaker is, look up ilo o ken e toki ni

whole epoch Aug 11, 2024, 1:19 PM

#

quaint wagon there's a related phenomenon where the frequency of the most used words is going...

do you think it might be cause the community in general are speaking about more diverse topics? pona, toki and other common words are also more common among beginners/ the start of conversations, so them decreasing in usage might imply that people are having more in-depth cconversations/that "beginner conversations" are getting more uncommon

#

idk im thinking out loud

quaint wagon Aug 11, 2024, 1:20 PM

#

whole epoch do you think it might be cause the community in general are speaking about more ...

essentially yes! i said a similar thing yesterday, altho i described it as more complex topics rather than more diverse topics, but same principle

whole epoch Aug 11, 2024, 1:20 PM

#

gotcha

#

that might just be cause the toki pona community is bigger and has gotten less,,, meta for lack of a better term

quaint wagon Aug 11, 2024, 1:20 PM

#

ehehhe, yep

whole epoch Aug 11, 2024, 1:20 PM

#

using tp as the medium of conversation instead of as the topic

quaint wagon Aug 11, 2024, 1:21 PM

#

i am surprised i haven't seen anyone look up ki yet

to be clear, ki's results are nonsense and unavoidably so lmao

#

there are at least 7 things i'm aware of which ki appears in that are not uses of the toki pona word ki

whole epoch Aug 11, 2024, 1:22 PM

#

whole epoch indirect evidence for the popularity(?) of preverb open and preverb pini

indirect evidence of the unpopularity of preverb open/pini 😔

quaint wagon Aug 11, 2024, 1:23 PM

#

eeeheheh

novel lintel Aug 11, 2024, 1:23 PM

#

quaint wagon there are at least 7 things i'm aware of which ki appears in that are not uses o...

nimi ki seme pi toki pona :p · ona tu li lon

icy turret Aug 11, 2024, 1:25 PM

#

@quaint wagon feature suggestion: create a list of commentary matching particular search requests
use it to comment on searches that could be misleading

#

e.g. if someone looks up san remind them the search is partially "spoiled" by your presence

#

the counterargument to adding this is we cant possibly account for every misleading search so it might be better done in conversation than in gui

#

the countercounterargument is that we can account for the most potentially frequent of those

arctic yoke Aug 11, 2024, 1:29 PM

#

a lot of people, i have noticed, seem to think my name is a toki pona question word /musi

drifting breach Aug 11, 2024, 1:35 PM

#

icy turret indirect evidence for unpopularity of sona-preverb

i guess most of the speakers (at least the beginner ones) don't even know about this preverb🦦

round mural Aug 11, 2024, 1:37 PM

#

icy turret damn, its not often that you can attribute a word to *one month* and then essent...

it's almost like this word died along with the meme

tall rune Aug 11, 2024, 1:38 PM

#

what was itomi

whole epoch Aug 11, 2024, 1:38 PM

#

tall rune Aug 11, 2024, 1:39 PM

#

ah

mint knoll Aug 11, 2024, 2:29 PM

#

https://gregdan3.github.io/ilo-muni/?query=linluwi%2C+majuna%2C+jami&minSentLen=6&scale=abs&start=1470009600&end=1722470400&smoothing=2 compared the 3 finalists of utala pi nimi sin

ilo Muni

Watch toki pona grow and change- now with graphs!

thin rose Aug 11, 2024, 2:55 PM

#

mint knoll Aug 11, 2024, 3:14 PM

#

https://gregdan3.github.io/ilo-muni/?query=ojuta%2C+mijun&minSentLen=1&scale=abs&start=1596240000&end=1722470400&smoothing=2 ojuta vs mijun

ilo Muni

Watch toki pona grow and change- now with graphs!

icy turret Aug 11, 2024, 3:25 PM

#

alasa is growing in relative terms, both as a "standalone" predicate and as a preverb

fresh sentinel Aug 11, 2024, 3:34 PM

#

how does that compare to lukin preverb?

drifting breach Aug 11, 2024, 3:37 PM

#

isn't there a special syntax for marking only preverbs or only verbs or only smth?

icy turret Aug 11, 2024, 3:37 PM

#

there isn't

#

lukin is harder to estimate for this exact reason

verbal marten Aug 11, 2024, 4:26 PM

#

meager steeple Aug 11, 2024, 4:43 PM

#

#1187212477155528804

#

la nimi ona li awen lon tenpo mute

quaint wagon Aug 11, 2024, 5:13 PM

#

^ tenpo kama la mi weka e tomo ni tan sona pi ilo Muni

brittle mulch Aug 11, 2024, 5:45 PM

#

JdrtWRrQP48iSbGfzqFEJPniBlTz3wAqs5ATkkdAVWMkJyCOhK7CSE5BHQldgJScgj4SuwEpOIFtku2qrbQ0Re3sGBwf7JiYmTma7VSsrOo5jGEcZbjl4yWGBFVjJCcgjoSuwkhOQR0JXYCUnIIErsBKTkAeCV2BlZyAPBK6WphVWxsicgms7EgYR9nuP3jJeYEVWMkJyCOhK7CSE5BHQldgJScgj4SuwEpOQB4JXYGVnIA8ErqanxUMEbmGEAkCIAACIAACIAACIAACIAACIAACINAhBGCIdMiNxGWAAAiAAAiAAAiAAAiAAAiAAAiAAAjICcAQkbNCJAiAAAiAAAiAAAiAAAiAAAiAAAiAQIcQgCHSITcSlwECIAACIAACIAACIAACIAACIAACICAnAENEzgqRIAACIAACIAACIAACIAACIAACIAACHULgv7Cn4MWR3GPGAAAAAElFTkSuQmCC.png

quaint wagon Aug 11, 2024, 5:51 PM

#

woah how's it transparent

brittle mulch Aug 11, 2024, 5:52 PM

#

right click on the graph

#

and press copy image

quaint wagon Aug 11, 2024, 5:52 PM

#

WUH????

#

never knew that was a thing

brittle mulch Aug 11, 2024, 5:52 PM

#

SINA LI IJO E NI

icy turret Aug 11, 2024, 5:53 PM

#

yep thats a thing

quaint wagon Aug 11, 2024, 5:53 PM

#

SITELEN LI TAN ILO ChartJS

#

LI TAN ALA MI

brittle mulch Aug 11, 2024, 5:53 PM

#

a

quaint wagon Aug 11, 2024, 5:53 PM

#

MI PANA E NANPA E NASIN TASO E SITELEN ALA

brittle mulch Aug 11, 2024, 5:53 PM

#

peli pani

quaint wagon Aug 11, 2024, 5:53 PM

#

seme

#

are you saying
very funny
perhaps

tall rune Aug 11, 2024, 5:53 PM

#

you can see when i joined the community :3

#

also it strips double letters :c

quaint wagon Aug 11, 2024, 5:55 PM

#

tall rune also it strips double letters :c

gonna be real, this will probably never not be a feature (necessity) of ilo muni
there's just Way too many ways to write words if i don't collapse duplicate letters
which makes the database too massive to deliver as i'm currently doing it

tall rune Aug 11, 2024, 5:55 PM

#

yeah thats fair

quaint wagon Aug 11, 2024, 5:55 PM

#

there's a similar but less massive problem with capitals

tall rune Aug 11, 2024, 5:55 PM

#

mhm

quaint wagon Aug 11, 2024, 5:57 PM

#

fwiw, in processing it's more like

collapse duplicates (preserving the first cap)
score
lowercase
frequency count
that is to say, i score things with their caps, then remove caps later for space/counting reasons

icy turret Aug 11, 2024, 5:59 PM

#

quaint wagon gonna be real, this will probably never not be a feature (necessity) of ilo muni...

you could solve that by first reading an english dictionary, then refusing to deduplicate letters if they match an english word
but thats frankly silly and unhelpful

quaint wagon Aug 11, 2024, 5:59 PM

#

icy turret you could solve that by first reading an english dictionary, then refusing to de...

other languages
phonomatches in english
phonomatches in other languages

#

these are really all the same problem

#

this also assumes the english words are, themselves, rendered appropriately

icy turret Aug 11, 2024, 6:01 PM

#

quaint wagon 1. other languages 2. phonomatches in english 3. phonomatches in other languages

okay. replace "solve" with "improve"

quaint wagon Aug 11, 2024, 6:01 PM

#

i'm not honestly sure it would even do that

#

not without a pretty sizeable manual processing step, anyway

tall rune Aug 11, 2024, 6:02 PM

#

apparently there was a little stella bump in june 2021

icy turret Aug 11, 2024, 6:02 PM

#

my hypothesis is that 1 is essentially not affected, and 2 3 are fewer than the number of english words that are correctly preserved by this

#

but like yeah

#

its so far removed from the point of the tool that its just not worth your time

merry walrus Aug 11, 2024, 6:58 PM

#

meager steeple Aug 11, 2024, 6:59 PM

#

"mi li" and "sina li" can still be found in sentences like "soweli mi li pona"

icy turret Aug 11, 2024, 6:59 PM

#

merry walrus

unfortunately this can't be used to draw conclusions about the frequency of ungrammatical li because what ilo tani said yeah

meager steeple Aug 11, 2024, 7:00 PM

#

sona mi la mun li wile ken e ni: ^ mi li la ni li alasa e toki lon open toki taso

icy turret Aug 11, 2024, 7:10 PM

#

@quaint wagon check this out

#

two bits of insight from this

#

90% of toki pona is those words (i think is how youre meant to read it???)

quaint wagon Aug 11, 2024, 7:10 PM

#

nope the scale is wrong

icy turret Aug 11, 2024, 7:10 PM

#

fuc

quaint wagon Aug 11, 2024, 7:10 PM

#

the shape is right but the numbers are wrong

#

my bad

icy turret Aug 11, 2024, 7:11 PM

#

right, its the same scale problem

quaint wagon Aug 11, 2024, 7:11 PM

#

check again in relative; you'll have exactly one line anyway

icy turret Aug 11, 2024, 7:11 PM

#

anyway insight two is not affected by that issue

quaint wagon Aug 11, 2024, 7:12 PM

#

20% of toki pona is those words, btw
roughly

icy turret Aug 11, 2024, 7:12 PM

#

assuming early tokiponists weren't using really fancy toki pona which they weren't, the fact that these words add up to less probably means that early toki pona data has more non toki pona noise in it

quaint wagon Aug 11, 2024, 7:12 PM

#

oh yeah no doubt

#

to help a bit, move up to sentlen of 2

#

or even 3

icy turret Aug 11, 2024, 7:13 PM

#

minor gui nuisance: you might not want to allow users to do this

quaint wagon Aug 11, 2024, 7:13 PM

#

the math will be done against the total number of words, not sents of len 3+, but more words = more opportunities to score correctly

icy turret Aug 11, 2024, 7:14 PM

#

ye

#

@quaint wagon vertical lines grid in the graph background. are they hardcoded

quaint wagon Aug 11, 2024, 7:22 PM

#

nah they're a default of chart js that I did not investigate

#

i could do a ton more presentation wise

icy turret Aug 11, 2024, 7:23 PM

#

noted

#

@quaint wagon whats the correct way to compare the frequency of a vs a a vs a a a

#

i tried a - a a, a a - a a a, a a a - a a a a but from the vibe of the chart i feel like ive not considered something

#

misali has been declining in mentions

fresh sentinel Aug 11, 2024, 7:31 PM

#

oh, the

icy turret Aug 11, 2024, 7:32 PM

#

apeja li mi peaks during every sptp because we're a crowd and we shout it

#

pretty expected. had to use log scale to see toki Inli taso

#

#

drifting breach Aug 11, 2024, 7:46 PM

#

the unpa season

merry walrus Aug 11, 2024, 7:52 PM

#

I am going to listen and see what you are trying to say

#

Oh I've listened to þis before

merry walrus Aug 11, 2024, 8:09 PM

#

I listened to þe whole song really trying to interpret þe lyrics in a sexual way and it feels so much like a stretch þat I don't þink þis is what your talking about or maybe I am just really misunderstandign what she is saying

#

Oh þere are lyrics

#

I'll come back

icy turret Aug 11, 2024, 8:12 PM

#

merry walrus I listened to þe whole song really trying to interpret þe lyrics in a sexual way...

in case youre in doubt, read what acronym the album name "utala ni (li) pona a" spells out :p

quaint wagon Aug 11, 2024, 8:13 PM

#

icy turret <@497549183847497739> whats the correct way to compare the frequency of a vs a a...

the best you can do is limit the sentence length and deal with that limited perspective
subtracting the number of a two up is actually more accurate, but more limited
otherwise, i still errantly double count in this context

#

this is a mistake google makes too, if their docs are to be trusted

icy turret Aug 11, 2024, 8:14 PM

#

sona pona

merry walrus Aug 11, 2024, 8:14 PM

#

icy turret in case youre in doubt, read what acronym the album name "utala ni (li) pona a" ...

Þis is really ugh

#

I really misunderstood her

#

Like misheard her words a lot

#

I loved þis song because it sounded like

#

two lovers who are apart woefully missing each oþer

#

But now þat I'm reading þe lyrics

#

It's just like

#

not þat

#

ugh

#

Anyways I want to find out what Majeka was being referred to here

#

Oh I found it

#

No?

#

Þere are instances of majeka as a magic nimisin in 2021

#

Like 3 times which ig aren't here

#

or maybe I'm dumb because I don't know how þis þing works

drifting breach Aug 11, 2024, 8:30 PM

#

do these spikes really exists?

merry walrus Aug 11, 2024, 8:31 PM

#

mi ni e mama sina li is powe despair

novel lintel Aug 11, 2024, 9:48 PM

#

icy turret you could solve that by first reading an english dictionary, then refusing to de...

nimi "AAA" li lon toki [🇨🇦] :p

short narwhal Aug 11, 2024, 9:49 PM

#

a tomo ni li lon

novel lintel Aug 11, 2024, 9:49 PM

#

drifting breach do these spikes really exists?

nanpa li lili mute a · o lukin lon nasin [Absolute]

short narwhal Aug 11, 2024, 10:00 PM

#

mi kalama e sitelen akesi
(there are slight rounding errors due to how i made the wave in audacity: 1, normalizing to be within +/-1; 2, rounding to 1 decimal point; 3, each month was 1 sample point of a wave, so i slowed it down in audacity to auto smooth it)

brittle mulch Aug 12, 2024, 2:57 AM

#

#

all spiked oct 2021 hmm

edgy cradle Aug 12, 2024, 3:00 AM

#

merry walrus Aug 12, 2024, 4:33 AM

#

It seems þat (ik pi li is an old grammatical þing) people's grammar's getting better

uneven prairie Aug 12, 2024, 5:33 AM

#

icy turret nanpa wan is roughly twice as common as nanpa tu

but this nanpa tu includes nanpa tu wan and nanpa tu tu

fresh sentinel Aug 12, 2024, 7:38 AM

#

i'm also curious about

pi * e
li pi

icy turret Aug 12, 2024, 7:44 AM

#

fresh sentinel i'm also curious about - pi * e - li pi

data volume too low to be reliable

#

but for li pi we can at least say it was probably more used pre pandemic

#

and pre pandemic you legitimately had a lot of people learn from jan Pije so

#

then again jan Pije's site already purged weird usages after pu came out

fresh sentinel Aug 12, 2024, 7:45 AM

#

pandemic probably meant that lots of people were learning for the first time with good resources

icy turret Aug 12, 2024, 7:46 AM

#

not necessarily

#

jan lentan's came out in like what 2021? i forget

#

someone should check tbh

fresh sentinel Aug 12, 2024, 7:46 AM

#

good = avoiding li pi

#

what about pi * e? like "soweli pi moku e kala"

#

(i can't run Muni because it doesn't work in my browser at this time, i've filed a GitHub issue)

icy turret Aug 12, 2024, 7:48 AM

#

fresh sentinel what about `pi * e`? like "soweli pi moku e kala"

the thing is that ilo Muni doesn't do a straightup regex-like search over the text, it finds appropriate trigrams

#

and there are few trigrams that fit pi * e

#

most are too infrequent to be included

#

so the result is unrepresentative

#

and you cant sum them up, it shows as different lines

fresh sentinel Aug 12, 2024, 7:49 AM

#

ahhh

icy turret Aug 12, 2024, 7:49 AM

#

so for example

#

if there were 2 results for pi walo e

#

in all texts

#

ilo Muni doesn't store that trigram because not enough data

#

and won't display it here at all

#

it also only displays the top 10 ish(?) when doing a wildcard search

icy turret Aug 12, 2024, 8:13 AM

#

@quaint wagon smoothing suggestion: try a kernel with soft edges

#

google ngrams doesn't do that but it feels like itd be better at literal smoothing of the graph

#

so instead of peaks becoming plateaus, they would become more spread out peaks

#

-# ofc this comes with the disclaimer that this feature request is for whenever you feel like working on muni again

quaint wagon Aug 12, 2024, 12:05 PM

#

fresh sentinel (i can't run Muni because it doesn't work in my browser at this time, i've filed...

I replied btw, with a few sites you can test to investigate what component is the broken one

pseudo smelt Aug 12, 2024, 12:33 PM

#

https://gregdan3.github.io/ilo-muni/?query=mu%2C+kalama&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2
mu li kama suli tawa kalama

ilo Muni

Watch toki pona grow and change- now with graphs!

mint knoll Aug 12, 2024, 12:42 PM

#

https://gregdan3.github.io/ilo-muni/?query=pu%2C+ku%2C+su&minSentLen=1&scale=abs&start=1470009600&end=1722470400&smoothing=2

ilo Muni

Watch toki pona grow and change- now with graphs!

#

pu ku su

quaint wagon Aug 12, 2024, 1:34 PM

#

I wonder if I could deliver the given graph as the open graph image for the site, if the URL params are filled in

#

That's... Probably irresponsible with my database? I actually don't know what it looks like networking wise when you fetch the metadata of a link
Is discord doing that for you and then sending you the result? Or is it each individual who sees the link?

tall rune Aug 12, 2024, 1:36 PM

#

it does it for everyone individually

#

theres this one website whose embed is a simple math problem, except its randomly generated every time, so when you post it, everyone sees something different

tall rune Aug 12, 2024, 1:41 PM

#

icy turret the thing is that ilo Muni doesn't do a straightup regex-like search over the te...

its still an interesting graph tho

short narwhal Aug 12, 2024, 2:21 PM

#

pi * e = pie 🥧 👍

fresh sentinel Aug 12, 2024, 2:45 PM

#

tall rune its still an interesting graph tho

ooooh

quaint wagon Aug 12, 2024, 3:05 PM

#

tall rune theres this one website whose embed is a simple math problem, except its randoml...

Ooo that's super neat
And actually brings up a different question:
I probably need to have an actual server to do this trick, huh?
All my JS is client side lmao, so that idea is not happening
Although it does occur to me that an alternate way for me to deliver the app would be to have the entire thing be hosted on like, Vercel? And keep the entire DB in memory, using just sql.js on the server side
Something to investigate for later

tall rune Aug 12, 2024, 3:08 PM

#

https://txnor.com/mathchallenge

#

this is the thing

quaint wagon Aug 12, 2024, 3:23 PM

#

LMAO

mint knoll Aug 12, 2024, 3:26 PM

#

tall rune https://txnor.com/mathchallenge

7

tall rune Aug 12, 2024, 3:26 PM

#

nah its 19 trust

quaint wagon Aug 12, 2024, 3:42 PM

#

fresh sentinel what about `pi * e`? like "soweli pi moku e kala"

actually, this is a search i could allow
the reason I limit it to standing in for a single word is because if you search for the top 10 matches of a given form like this, you'll only get back 10 phrases of the minimum matchable length for the search; shorter phrases are also more numerous.

#

granted there are some places around funky grammatical features that could be exceptions, but there won't be many of them

fresh sentinel Aug 12, 2024, 3:43 PM

#

ooh

quaint wagon Aug 12, 2024, 3:45 PM

#

what i could do is performance testing on allowing multiple wildcards; there are few enough terms available that it could work

whole epoch Aug 12, 2024, 3:49 PM

#

it's interesting how, despite interjection lon becoming more unpopular, interjection ni has stayed at a steady level

#

also cool to see how "mi mute" for "we" is on a steady decline
back when i learnt tp originally way back when, it was almost universal to use it (as i remember it anyways)

icy turret Aug 12, 2024, 4:01 PM

#

whole epoch it's interesting how, despite interjection lon becoming more unpopular, interjec...

lon-yes was quiiite commonplace

#

its been backlashed

whole epoch Aug 12, 2024, 4:01 PM

#

yeye

#

i just expected ni have gotten more popular

quaint wagon Aug 12, 2024, 4:15 PM

#

@icy turret i think it was you who posted it but i can't find it; you posted the comparison between nimisin and a few of the other "standard low use but around" type words like linluwi and majuna? and i have thoughts about that as well, actually
well, one thought really:
a user of the word nimisin is much more likely to talk about lower use words; conversely, those who don't use the word nimisin talk about newer words less, resulting in the phrase nimi sin being about as used as the word nimisin.
in fact, this is something i could probably demonstrate in my primary db? by counting the number of distinct authors who have sentences containing nimisin, versus the number of distinct authors who have sentences containing the phrase nimi sin

fresh sentinel Aug 12, 2024, 4:31 PM

#

i am generally curious about like
if Muni describes a word as popular
is it because many people are using the word, or because spiders Georg is using the word a lot

quaint wagon Aug 12, 2024, 4:36 PM

#

i know for a fact this is happening to lipamanka and nano, because their names are primarily said by themselves
but determining that currently requires checking manually

#

you can be extremely confident that words over rank 150 are actually in use by a variety of speakers, but the range of confidence drops significantly after that since you go from several thousand uses at rank 150 to several hundred uses at rank 200; that's an amount which you could swing by in a single day of being silly

fresh sentinel Aug 12, 2024, 4:39 PM

#

idk the privacy implications of this but
if Muni could say whether a word is being used by under 10, 100, 1000, 10000 people
and then lines could have different thickness or opacity depending on the category
that would maybe make it easy to visualize if a spike is a silly spike

#

this wouldn't catch mu mu mu

quaint wagon Aug 12, 2024, 4:40 PM

#

ooooo wait that actually is interesting and i don't think it would be hard to check

#

uagh it would be a lot of queries tho lmao

#

well, not on ilo muni's side
db generation side

#

i'm unsure how to represent it on the graph tho; line thickness gets difficult to judge if there are more than 3 distinct thicknesses

fresh sentinel Aug 12, 2024, 4:41 PM

#

3 might be enough? for 10, 100, 1000

quaint wagon Aug 12, 2024, 4:41 PM

#

also, for reference, distinct author count is imprecise bc of pluralkit and more generally bc i can't combine authors across platforms

fresh sentinel Aug 12, 2024, 4:41 PM

#

idk how often a word will be used by more than 1000 people

short narwhal Aug 12, 2024, 4:41 PM

#

fresh sentinel i am generally curious about like if Muni describes a word as popular is it beca...

jan mute ala li kepeken nimi Nimisin. ni li tan ike nanpa tan ni: nimisin Georg li kepeken nimi ni mute la nanpa li ante ike la ona o lon ala nanpa

fresh sentinel Aug 12, 2024, 4:41 PM

#

in one month

quaint wagon Aug 12, 2024, 4:42 PM

#

oh i see, you want that info on a monthly basis

#

that could be harder

#

that makes sense tho

fresh sentinel Aug 12, 2024, 4:42 PM

#

not necessarily! this is just the first idea that came to me

#

and wouldn't change the UI too much anu seme

tall rune Aug 12, 2024, 4:43 PM

#

we found a spiders moment yesterday

icy turret Aug 12, 2024, 4:43 PM

#

@quaint wagon you could create some kind of measure for how concentrated word use to one person vs many people

tall rune Aug 12, 2024, 4:43 PM

#

icy turret Aug 12, 2024, 4:43 PM

#

this sounds vaguely similar to the gini coefficient

tall rune Aug 12, 2024, 4:43 PM

#

tall rune

quaint wagon Aug 12, 2024, 4:44 PM

#

my god

#

i have no way to fix something like this

#

like, not reasonably

tall rune Aug 12, 2024, 4:44 PM

#

o tonsi tawa ale owe

quaint wagon Aug 12, 2024, 4:44 PM

#

idk at least it's clear it's a spike of silliness

icy turret Aug 12, 2024, 4:44 PM

#

o tonsi tawa ale

whole epoch Aug 12, 2024, 4:44 PM

#

tall rune

same thing for lonsi but a group of spiders

short narwhal Aug 12, 2024, 4:44 PM

#

sina alasa e pipi Georg

fresh sentinel Aug 12, 2024, 4:44 PM

#

this would show up as an under-10-people line in my proposal, which may help

quaint wagon Aug 12, 2024, 4:45 PM

#

right

tall rune Aug 12, 2024, 4:45 PM

#

whole epoch same thing for lonsi but a group of spiders

makes sense

icy turret Aug 12, 2024, 4:45 PM

#

you know, i can ruin your data by posting procedurally generated text at least once a month in large quantities

quaint wagon Aug 12, 2024, 4:45 PM

#

i would omit you, personally, from the data in that case

icy turret Aug 12, 2024, 4:46 PM

#

the correct solution is ofc to ban me from ilo Muni

#

ye

quaint wagon Aug 12, 2024, 4:46 PM

#

actually i think i neglected to mention this entirely but i do omit #jaki from the data

icy turret Aug 12, 2024, 4:46 PM

#

yeye

fresh sentinel Aug 12, 2024, 4:46 PM

#

reasonable

quaint wagon Aug 12, 2024, 4:46 PM

#

which i think is completely understandable ye

#

uagh i've been fiddling with postgres all morning at work and now i'm looking at the sqlite db

#

explodes due to slightly different keywords

fresh sentinel Aug 12, 2024, 4:47 PM

#

starting a public server and calling the general chat #jaki /utala

tall rune Aug 12, 2024, 4:47 PM

#

other stuff i did yesterday was making these sorts of graphs to compare the relative popularity of two words, pretty fun to look at

quaint wagon Aug 12, 2024, 4:47 PM

#

fresh sentinel starting a public server and calling the general chat <#316066233755631616> /uta...

you can do that! ma pona's jaki is the only bot channel i omit bc it's the largest channel on the server and it isn't even close

icy turret Aug 12, 2024, 4:47 PM

#

fresh sentinel starting a public server and calling the general chat <#316066233755631616> /uta...

worse you can create a thread and invite ten people to spam it with plausibly real text

quaint wagon Aug 12, 2024, 4:48 PM

#

that channel, in text, is 4gb of the 56gb of the entire raw discord dataset

fresh sentinel Aug 12, 2024, 4:48 PM

#

holy shit lmao

tall rune Aug 12, 2024, 4:48 PM

#

quaint wagon Aug 12, 2024, 4:48 PM

#

i'll grant that text is a lot of spare discord fluff i.e. authors and their roles and metadata on a per message basis
but.

tall rune Aug 12, 2024, 4:48 PM

#

i also learned that nobody said mije in november 2016

quaint wagon Aug 12, 2024, 4:49 PM

#

why do you have a-a on the graph?

tall rune Aug 12, 2024, 4:49 PM

#

on the previous ones it was to highlight the 0 line

tall rune Aug 12, 2024, 4:49 PM

#

tall rune i also learned that nobody said mije in november 2016

i just didnt bother removing it for this one

icy turret Aug 12, 2024, 4:49 PM

#

tall rune i also learned that nobody said mije in november 2016

no one on reddit did cause reddit was small and discord communities didnt exists yet

tall rune Aug 12, 2024, 4:49 PM

#

makes sense

quaint wagon Aug 12, 2024, 4:49 PM

#

yeah, until i think march 2017? the data is all telegram and reddit

fresh sentinel Aug 12, 2024, 4:49 PM

#

a time when men didn't exist

quaint wagon Aug 12, 2024, 4:50 PM

#

mije is reliably less used than meli which is very amusing to me

short narwhal Aug 12, 2024, 4:50 PM

#

tenpo pi mije ala

tall rune Aug 12, 2024, 4:51 PM

#

tall rune other stuff i did yesterday was making these sorts of graphs to compare the rela...

this graph shows meli vs mije and yeah, meli is almost always more popular except for the weird spikiness at the beginning

icy turret Aug 12, 2024, 4:51 PM

#

quaint wagon mije is reliably less used than meli which is very amusing to me

im guessing this is cause demographics

tall rune Aug 12, 2024, 4:51 PM

#

tall rune this graph shows meli vs mije and yeah, meli is almost always more popular excep...

which is probably just because of the small volume of data

#

there was a time in early 2023 tho where mije was more popular

icy turret Aug 12, 2024, 4:52 PM

#

i just saw someone say minisin and this is making gears turn in my head

short narwhal Aug 12, 2024, 4:52 PM

#

misinin

quaint wagon Aug 12, 2024, 4:53 PM

#

gears grinding violently, making a terrible screeching noise, but just barely turning

icy turret Aug 12, 2024, 4:53 PM

#

what would a minisin look like

short narwhal Aug 12, 2024, 4:53 PM

#

me when i do this again

#

mi ni sin

quaint wagon Aug 12, 2024, 4:54 PM

#

ok i think i can do author counts on a per ngram basis

icy turret Aug 12, 2024, 4:54 PM

#

quaint wagon ok i think i can do author counts on a per ngram basis

look into the gini coeff genuimely

#

it might be better

quaint wagon Aug 12, 2024, 4:55 PM

#

would i not need author count in order to get that info

#

the sqlite db side of it would be easier than i thought initially, since i can just add another row to the frequency table "num_authors"; the number of authors of a given word or phrase is always containable in the same dimensions as a frequency entry i.e. some phrase, some min sent len, and some date (representing a range)

#

in order to count authorship, i need a way to know what distinct authors have said a given term, for which i can use their edgedb generated uuid (note: authors are joined by their combo of platform and platform id+name, and only may be entered when i enter a new message) and probably stuff that in a per-entry set for the lifetime of the ngram counter, which is the time it takes me to count a month of data
when i query sentences, i just tack on the additional info of sentence.message.author.id and use that to update the authorship sets corresponding to the phrase i'm updating
and at the end of a month, write back the length of each set side by side with the frequency data

icy turret Aug 12, 2024, 5:00 PM

#

i think it might just be num authors of all ngrams per month? if you count authors for every ngram individually you lose the 0 case

#

not sure tho, maybe thats stupid

quaint wagon Aug 12, 2024, 5:00 PM

#

i am intent on getting num authors of all ngrams per month and num authors for each ngram

#

the frequency table and ranks table are nearly identical for a reason

#

that reason is, while you can count the ranks info from the frequency table, it takes over 20x the reads

#

i'm essentially packing a more specific type of query into that table

quaint wagon Aug 12, 2024, 6:20 PM

#

question:
if i implement a selector for smoothing method, would that be overwhelming
there's already. a lot of options lmao

#

@icy turret @fresh sentinel @whole epoch @tall rune

fresh sentinel Aug 12, 2024, 6:21 PM

#

i haven't actually used the tool yet ehehe

quaint wagon Aug 12, 2024, 6:21 PM

#

ah fair

#

did you see my suggested search btw
it really shouldn't take more than 3s to resolve a simple text search even with the default settings
and since those other tools all work, it must be something I'm doing that's breaking on your browser
idk what tho, and you can't pull up dev tools on mobile, aaaaaaa

icy turret Aug 12, 2024, 6:25 PM

#

quaint wagon question: if i implement a selector for smoothing method, would that be overwhel...

i was the one to propose it so i have to say no
I would call the options soft kernel (smooth) and hard kernel (google ngrams style) which should indicate to people why theyd ever want to use the hard kernel (they don't), but maybe you have better naming conventions in mind

quaint wagon Aug 12, 2024, 6:26 PM

#

uh, my naming method would have been to use the name of the smoothing method as i discover it on Wikipedia pages about stats w.r.t . timeseries data

whole epoch Aug 12, 2024, 6:31 PM

#

quaint wagon question: if i implement a selector for smoothing method, would that be overwhel...

wouldnt mind it so go for it if you wanna do that
it like, wouldnt get in the way of anything except a tiny bit of screen realestate on the top

#

and if it's too much you can just not use it (assuming the default is good)

quaint wagon Aug 12, 2024, 6:32 PM

#

@wanton shoal (hi) suggested exponential smoothing and it's pretty good bc it mostly preserves changes in direction while moving the peaks and troughs nearer to one another; it also passes the tonsi and misikeke tests, correctly showing 0 for any point prior to the initial non-zero point of those graphs
i have also found gaussian and median, which are respectively extra curvy and extra blocky; gaussian fails the tonsi/misikeke tests but is good for trend analysis; median does decently at those tests, passing them up to 30 smoothing

#

actually median technically fails the tonsi test bc of the 3 occurrences in July 2019, 0 in Aug/sep, and 21 in Oct; it omits the July data for a while

wanton shoal Aug 12, 2024, 6:34 PM

#

might be good to add a dropdown where u can choose from multiple methods? and maybe do a little explainer on what they do and don’t do, for the uninitiated?

quaint wagon Aug 12, 2024, 6:35 PM

#

quaint wagon question: if i implement a selector for smoothing method, would that be overwhel...

ye, a drop-down was what i was suggesting here

whole epoch Aug 12, 2024, 6:35 PM

#

wanton shoal might be good to add a dropdown where u can choose from multiple methods? and ma...

that's basically what's done for the different charting modes and i assume it'd be the same for smoothing

#

in the help page there are explanations

wanton shoal Aug 12, 2024, 6:35 PM

#

oh am on phone and didn’t scroll up that far soz xd

quaint wagon Aug 12, 2024, 6:35 PM

#

np!

quaint wagon Aug 12, 2024, 6:35 PM

#

quaint wagon *actually* median technically fails the tonsi test bc of the 3 occurrences in Ju...

granted I think omitting tiny early data is a better failure mode than extending early data to earlier than the word existed

wanton shoal Aug 12, 2024, 6:39 PM

#

i played around yesterday with only including data up to time T, but not beyond it, so data wouldn’t get extended into the past

#

however it didn’t seem to work all that well tbh

quaint wagon Aug 12, 2024, 6:41 PM

#

exponential?

#

or median?

novel lintel Aug 12, 2024, 6:43 PM

#

icy turret what would a minisin look like

nimis in pi toki [Mini] anu seme

wanton shoal Aug 12, 2024, 9:41 PM

#

quaint wagon exponential?

normal sliding avg

#

the thing is, it works to some extent, but imo the data only barely resembles the input haha
here's the input:

#

#

and 5 smoothing applied

#

i can see the resemblance, but in the original the peaks aren't nearly the same height (i suppose the second peak is flattened a lot)

#

(also, "the tonsi test" is a very funny phrase to me for some reason)

wanton shoal Aug 12, 2024, 10:30 PM

#

median smoothing also sucks

#

destroys the data, yet suffers from the same issues as simple avg

#

mi sona ala. pilin mi li ni: ken tu li lon.

ilo pona ni li awen. ona li pona ala, taso ona li sama ilo Ngram
ilo one-sided exponential li kama kepeken.

... 3. some data analyst helps out :D

#

lots of research into things that deal well with cyclic data, passing only certain bands for signal processing, whatever
but i feel like just stat vis isn't really a focus of any research. very thin info online

#

taso mi toki taso. jan Kekan San o toki e pali :D

quaint wagon Aug 12, 2024, 10:34 PM

#

wanton shoal mi sona ala. pilin mi li ni: ken tu li lon. 1. ilo pona ni li awen. ona li pona ...

mi wile pana e nasin ken mute li wile pana e ilo Exponential lon open tan ni: linja sin li sama linja open

wanton shoal Aug 12, 2024, 10:34 PM

#

lon

quaint wagon Aug 12, 2024, 10:34 PM

#

ken suli la nasin ona li pona nanpa wan tawa lukin pi tenpo pini

#

taso, mi awen alasa e sona li wile toki tawa jan pi sona nanpa, a a

wanton shoal Aug 12, 2024, 10:35 PM

#

lon. mi pilin e ni: ilo wan li pona lon ijo p# wan. ilo tu li pona ala lon ona li pona lon ijo p# tu.

quaint wagon Aug 12, 2024, 10:36 PM

#

aaaaa ken

wanton shoal Aug 12, 2024, 10:36 PM

#

lukin la ilo pi ken ale li lon ala :D

#

sama ijo ale hahaha

quaint wagon Aug 12, 2024, 10:39 PM

#

a a a, lon

merry walrus Aug 12, 2024, 10:39 PM

#

Out here doing þe Lord's work 🙏

#

Ignore þe Belarusians

#

hijksafkjajg

quaint wagon Aug 12, 2024, 10:40 PM

#

.... belarusians?

merry walrus Aug 12, 2024, 10:40 PM

#

Þe only oþer example of pokasi I've seen is it being used for Belarus

pseudo smelt Aug 12, 2024, 10:40 PM

#

… pokasi?

quaint wagon Aug 12, 2024, 10:40 PM

#

anyway this increases the necessity of unique author tracking ehehhehe

merry walrus Aug 12, 2024, 10:40 PM

#

Yes

#

How much have I contributed to þe Pokasi population

#

Probably most because I made it up

#

Þe greatest þing I ever did was make a nimisin in my first SP sentence

#

I didn't even know what a nimisin was

wanton shoal Aug 12, 2024, 10:41 PM

#

power move

pseudo smelt Aug 12, 2024, 10:41 PM

#

wawa

wanton shoal Aug 12, 2024, 11:14 PM

#

https://github.com/gregdan3/ilo-muni/pull/15 hehe

GitHub

Exponential smoothing and smoother selection by SarahIsWeird · Pull...

This PR is more or less to propose an implementation. I've implemented a simple exponential smoother, as I've talked about. The current smoother remains the default.

quaint wagon Aug 12, 2024, 11:17 PM

#

i am halfway through writing this as we speak LMAO

wanton shoal Aug 12, 2024, 11:17 PM

#

also re: issue #13 regex having look-behind and look-ahead is cursed anyways, makes it a CFG

#

lmao

quaint wagon Aug 12, 2024, 11:17 PM

#

fair enough tho, this is easy to slot into my work!

wanton shoal Aug 12, 2024, 11:18 PM

#

i was faster >:)

quaint wagon Aug 12, 2024, 11:18 PM

#

ehehe

wanton shoal Aug 12, 2024, 11:18 PM

#

would have been a good 20 mins faster still if i didn't accidentally work on the unforked branch lmao

quaint wagon Aug 12, 2024, 11:18 PM

#

lmao

#

i should probably do my work on branches instead of directly on main now huh

wanton shoal Aug 12, 2024, 11:19 PM

#

👏

#

preach

#

even when working alone, i've noticed that as soon as a project grows in size and i wanna do some quick fixes, PRs are still a godsent

#

cuz manually selecting what to include in the diff is annoying, when you've refactored half a file :D

quaint wagon Aug 12, 2024, 11:22 PM

#

true, ehe

#

lazygit makes it smooth when necessary tho

#

i very rarely have to dip into my git plugin's diff view to fix things

wanton shoal Aug 12, 2024, 11:23 PM

#

i guess it's also a result of me refusing to use the command line for git anymore, since i only work in IDEs nowadays, and it's just so much easier for me xd

#

so i'm kinda accustomed to select all, commit, push

#

but that does hide the fact i forgor to checkout another branch haha

quaint wagon Aug 12, 2024, 11:42 PM

#

ehehe, np

#

i've merged your branch to a separate branch merge-smoother which i wish i just called smoother; gonna add a smoother url param and some logic to disable the button as necessary!

#

aside: oh god i have no idea if my js is any good, i am not good at computer

#

ah jk i just missed that you did so in reading the github diff!

wanton shoal Aug 13, 2024, 12:39 AM

#

quaint wagon aside: oh god i have no idea if my js is any good, i am not good at computer

u made a thing, which is more than a lot of people can say ab themselves! :D

#

...but yes, some refactoring might be good haha

quaint wagon Aug 13, 2024, 12:40 AM

#

yeahhhhh

#

ehehhehe

#

see the thing is
sona toki is the best code i've ever written
aaaand this whole project was downhill from there :D

#

i do know at a minimum that the sqlite file is a mess bc i've mixed the responsibilities of querying and mutating the data there
and that the input file sucks bc i'm doing a bunch of crappy splitting instead of like, actually parsing user input

meager steeple Aug 13, 2024, 12:45 AM

#

btw, are you analyzing each message or sentences within a message?

#

if i put a full stop or semicolon would that turn the message into two sentences

quaint wagon Aug 13, 2024, 12:47 AM

#

meager steeple btw, are you analyzing each message or sentences within a message?

sentences within a message

#

my sentence tokenizer isn't perfect
quotes count for it too
altho i am considering removing them honestly

meager steeple Aug 13, 2024, 12:58 AM

#

sona

quaint wagon Aug 13, 2024, 1:06 AM

#

i wanna sit down and do some analysis to that end
on one hand,

quotes, double and single, are definitely used to distinguish sentences
on the other hand,
in toki pona, they much more often already are marked as sentences based on the other punctuation
ofc, both of these are anecdotal statements- i wanna know which is more true and when

#

also, i wanna do intra-word punctuation in the word tokenizer for isn't don't
not because i need that to tokenize toki pona, but because doing so would slightly increase my accuracy of detecting toki pona

quaint wagon Aug 13, 2024, 1:56 AM

#

@wanton shoal turns out that exponential is misleading for peak-y data! Fook

merry walrus Aug 13, 2024, 2:03 AM

#

I don't get what þe smooþing does so I'll assume it's doing good here

quaint wagon Aug 13, 2024, 3:10 AM

#

merry walrus I don't get what þe smooþing does so I'll assume it's doing good here

the data is really noisy to begin with, so smoothing attempts to reduce that noise while still representing the data faithfully

but the problem is, under some circumstances the smoothing is inaccurate

if you have data with peaks at specific times, the peaks can be spread out in a misleading way (center averaging turns them into plateaus, and exponential smoothing turns them into slopes)
if you have data that is zero for a while then starts, smoothing can spread the data to before there was a real data point

#

ok having the realization that i can't really produce an appropriate smoothing algorithm without knowing what it means for there to be a "signal"

#

really, the smoothing necessary just depends on the input
and different terms will have different signals

#

median works extremely well on continuous data, but demolishes seasonal data instantly

#

extremely funny

#

no smoothing for comparison

#

exponential is reliably the least silly of all the smooting methods

#

relative minmax is nice for comparison multiple test cases at once

merry walrus Aug 13, 2024, 3:25 AM

#

damn misikeke barely edges it out

#

RIP tonsi

quaint wagon Aug 13, 2024, 3:25 AM

#

lmao

tall rune Aug 13, 2024, 9:15 AM

#

im skeptical of smoothing and just turn it off when i use the tool usually

icy turret Aug 13, 2024, 11:00 AM

#

hidden tide Aug 13, 2024, 2:00 PM

#

tenpo Mopiju li lon

#

as you can observe, in times of tenpo pi kama sona la troughs form

#

here's the usage trend of various video games that were played on ma pona, i ought to add gartic phone but i don't know what the common tokiponization of it is

mild glacier Aug 13, 2024, 2:19 PM

#

hey

#

hows this?:

#

hidden tide Aug 13, 2024, 2:21 PM

#

each instance of a larger string of letters adds to the count of the smaller strings here

mild glacier Aug 13, 2024, 2:23 PM

#

ah okay, cool!

#

hows this?

hidden tide Aug 13, 2024, 2:30 PM

#

snazzy

meager steeple Aug 13, 2024, 2:35 PM

#

mild glacier hows this?

oo that little jump in lu near the end is because of april fools day when everybody spoke tuki tiki

pseudo smelt Aug 13, 2024, 2:35 PM

#

a!

#

muti

hidden tide Aug 13, 2024, 2:37 PM

#

muti mute

#

wait no i'm blending languages

#

muti a

pseudo smelt Aug 13, 2024, 2:37 PM

#

muti muta
(sona mi la “muta” li nimi majuna pi toki sike)

meager steeple Aug 13, 2024, 2:38 PM

#

muti tu

#

anu seme

hidden tide Aug 13, 2024, 2:38 PM

#

i mean i guess

#

but i think just a afterwords sounds more natural

#

anyways, here's jan Wano's graph that looks at each string individually without accumulation

#

take away the first line and we can see the trend of how many as are used for laughter

meager steeple Aug 13, 2024, 4:01 PM

#

quaint wagon Aug 13, 2024, 4:04 PM

#

A AA A A A A

merry walrus Aug 13, 2024, 4:20 PM

#

gyuhjkeafafd

#

Damn þe rest of tp þrough L just barely beat out þe particles

quaint wagon Aug 13, 2024, 4:32 PM

#

LMAO YOU DID ALL OF THEM?
send link? @merry walrus

meager steeple Aug 13, 2024, 4:32 PM

#

sina wan a li suli e "ijo sama"

quaint wagon Aug 13, 2024, 4:32 PM

#

MI WAWA

merry walrus Aug 13, 2024, 4:33 PM

#

quaint wagon LMAO YOU DID ALL OF THEM? send link? <@573246258504925185>

It's only þrough L, I just wanted to see how long until þe particles got outpaced (ignoring a because it would give any side an unfair advantage)

#

https://gregdan3.github.io/ilo-muni/?query=li+%2B+e+%2B+o+%2B+pi+%2B+la+%2B+en+%2B+kin%2C+akesi+%2B+ala+%2B+alasa+%2B+anpa+%2B+ante+%2B+anu+%2B+awen+%2B+esun+%2B+ijo+%2B+ilo+%2B+ike+%2B+insa+%2B+jan+%2B+jaki+%2B+jelo+%2B+jo+%2B+kala+%2B+kalama+%2B+kasi+%2B+kama+%2B+ken+%2B+kepeken+%2B+kili+%2B+kiwen+%2B+ko+%2B+kon+%2B+kute+%2B+kulupu+%2B+lape+%2B+laso+%2B+lawa+%2B+len+%2B+lete+%2B+linja+%2B+lili+%2B+lipu+%2B+loje+%2B+lon+%2B+luka+%2B+lukin+%2B+lupa&minSentLen=1&scale=cmsum&start=1280620800&end=1722470400

ilo Muni

Watch toki pona grow and change- now with graphs!

meager steeple Aug 13, 2024, 4:34 PM

#

wawa...

#

nimi ale li lon ala a a

quaint wagon Aug 13, 2024, 4:34 PM

#

i know this for a fact: the pure particles are only about 20% of toki pona

merry walrus Aug 13, 2024, 4:34 PM

#

I did it from memorie so I might be missing a couple words in L

quaint wagon Aug 13, 2024, 4:35 PM

#

you haven't included "toki" or "pona" or "sona", or any of the pronouns for that matter which will ofc be a huge portion of all toki pona

#

i get that was the point to be clear :P

merry walrus Aug 13, 2024, 4:35 PM

#

Oh I missed kule and laso damn

quaint wagon Aug 13, 2024, 4:36 PM

#

those won't make much of a difference ehehe

merry walrus Aug 13, 2024, 4:36 PM

#

Critical words

quaint wagon Aug 13, 2024, 4:38 PM

#

i mean number wise

quaint wagon Aug 14, 2024, 2:39 AM

#

i found the source of the wan/tu/mute/luka/ale spike in oct 2021

#

there is a channel in one server where people genuinely counted to the high thousands in the pu numbering system

#

well, i say people, but it seems to have been almost entirely one person

meager steeple Aug 14, 2024, 2:40 AM

#

merry walrus Aug 14, 2024, 2:40 AM

#

based

meager steeple Aug 14, 2024, 2:40 AM

#

quaint wagon well, i say people, but it seems to have been almost entirely one person

aaa musi

quaint wagon Aug 14, 2024, 2:40 AM

#

things brings on an entirely new question
like, do i exclude that

#

how much is that reflective of "using the language"

merry walrus Aug 14, 2024, 2:41 AM

#

Anoþer reason for a unique auþor feature

quaint wagon Aug 14, 2024, 2:41 AM

#

you know, that's true tbh

#

i'll leave it be for now

placid crow Aug 14, 2024, 3:06 AM

#

@quaint wagon how did you get the data for Muni? Ik it was through online to communities, but how did you download the data itself?

merry walrus Aug 14, 2024, 3:24 AM

#

Illegal meþods /idk

quaint wagon Aug 14, 2024, 3:42 AM

#

placid crow <@497549183847497739> how did you get the data for Muni? Ik it was through onlin...

i talk about that on the about page! In short, Reddit has an archival project, Telegram lets you export chats, and Discord you'll have to read on the site

low timber Aug 14, 2024, 7:10 AM

#

toki pono vs pono

Screenshot_2024-08-14_at_12.10.28_AM.png

#

Screenshot_2024-08-14_at_12.16.59_AM.png

verbal marten Aug 14, 2024, 7:51 AM

#

nasa

mild glacier Aug 14, 2024, 11:50 AM

#

meager steeple oo that little jump in lu near the end is because of april fools day when everyb...

ah mi sona :D

#

what are the little numbers that people add?

#

like "a a a_6

whole epoch Aug 14, 2024, 11:53 AM

#

that's the minimum length needed

mild glacier Aug 14, 2024, 11:53 AM

#

ah okay

whole epoch Aug 14, 2024, 11:53 AM

#

so like
toki_4 only shows toki from phrases with 4 or more words

mild glacier Aug 14, 2024, 11:53 AM

#

ahhhh... okay!

#

i get it!

#

gants :3

whole epoch Aug 14, 2024, 11:54 AM

#

it's very useful!

tall rune Aug 14, 2024, 11:56 AM

#

i found another interesting spike

whole epoch Aug 14, 2024, 11:56 AM

#

lmao??

tall rune Aug 14, 2024, 11:57 AM

#

(this is a relative graph, the absolute graph is less extreme)

#

still there tho

mild glacier Aug 14, 2024, 11:57 AM

#

hows this?

whole epoch Aug 14, 2024, 11:58 AM

#

too little to say anything tbh, only a handful of uses

mild glacier Aug 14, 2024, 11:58 AM

#

yeah

#

lol :3

#

(i have a weird urge to say sorry after every problem i kinda make)

tall rune Aug 14, 2024, 12:00 PM

#

for some reason the tool isnt letting me search "unpa li ken *"

#

i wanna find out what the next word isssss

whole epoch Aug 14, 2024, 12:00 PM

#

just unpa li ken has no results found so that's probably why

tall rune Aug 14, 2024, 12:02 PM

#

thats incorrect

#

maybe youre in a too specific timeframe

#

"unpa li ken" is the big spike in march 2016

mild glacier Aug 14, 2024, 12:02 PM

#

musi la:

whole epoch Aug 14, 2024, 12:02 PM

#

tall rune thats incorrect

huh, yea now it works for some reason??

#

nasa a

tall rune Aug 14, 2024, 12:06 PM

#

#

the spike is some 6 word phrase that starts with unpa li ken

mild glacier Aug 14, 2024, 12:06 PM

#

what is it?

tall rune Aug 14, 2024, 12:07 PM

#

idk

#

its not letting me do the *

mild glacier Aug 14, 2024, 12:07 PM

#

unpa li ken pakala e sina e sina kin

#

:D

tall rune Aug 14, 2024, 12:07 PM

#

its not pakala

mild glacier Aug 14, 2024, 12:07 PM

#

ah

tall rune Aug 14, 2024, 12:11 PM

#

i cant search for it because its not from this server

whole epoch Aug 14, 2024, 12:12 PM

#

since it's 2016 i assume it must've been like, a meme or running gag on telegram?

#

or maybe reddit

tall rune Aug 14, 2024, 12:12 PM

#

i guess

quaint wagon Aug 14, 2024, 12:13 PM

#

tall rune still there tho

that's a "spike" up to 5 occurrences tbf :P

tall rune Aug 14, 2024, 12:13 PM

#

ah

#

it just sticks out funkily

quaint wagon Aug 14, 2024, 12:14 PM

#

granted it's odd it happened all at once, but probably makes sense to whatever discussion happened there

#

yeah, there was just a lot less conversation going on at the time

hidden tide Aug 14, 2024, 3:47 PM

#

despair

hidden tide Aug 14, 2024, 4:57 PM

#

big spaghetti time:
here's my relative minmax graph attempting to convey the trends in focus over time of various meme phrases throughout ma pona's history. by "meme" phrases, i mean any word or phrase that is often repeated among speakers or has some sort of server addon like an emoji or a sticker. obviously incomplete. o lukin pona a!
https://gregdan3.github.io/ilo-muni/?query=a+-+a_2%2C+omekapo%2C+omekalike%2C+ale+li+pona%2C+mu%2C+kijetesantakalu%2C+ikea%2C+ilo+nanpa%2C+kekan+san%2C+jan+telakoman+li%2C+misali%2C+lonsi%2C+mi+tawa+tomo%2C+mi+tawa+e+tomo%2C+lon+-+lon+a+-+lon+ala%2C+lon+ala%2C+mi+sona+ala+a%2C+pingo%2C+kasi+ike+mute%2C+usawi%2C+nimi+sin%2C+nimisin%2C+tonsi%2C+su%2C+owe%2C+kamalawala%2C+wawa%2C+waken%2C+akesi+kule%2C+lon+a&minSentLen=1&scale=normrel&start=1470009600&end=1722470400&smoothing=0&smoother=cwin

ilo Muni

Watch toki pona grow and change- now with graphs!

#

something interesting that can be found by max smoothing it out is how, depending on what shape is portrayed, "wawa" is in a steady trend upward and either has passed over or is on the verge of passing over "ale li pona"

meager steeple Aug 14, 2024, 5:14 PM

#

for standalone sentences it's been surpassed

hidden tide Aug 14, 2024, 5:16 PM

#

fun

fresh sentinel Aug 14, 2024, 5:35 PM

#

https://gregdan3.github.io/ilo-muni/?query=wawa+-+wawa_2%2C+epiku+-+epiku_2&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin

#

i feel like wawa interjection is maybe the closest pu equivalent to epiku interjection

#

that's how i use it anyway

icy turret Aug 14, 2024, 5:37 PM

#

fresh sentinel i feel like wawa interjection is maybe the closest pu equivalent to epiku interj...

also pona a

fresh sentinel Aug 14, 2024, 5:37 PM

#

oh good point

#

is it possible to search for a two-word interjection?

icy turret Aug 14, 2024, 5:37 PM

#

pona a - pona a_3

river shard Aug 14, 2024, 5:38 PM

#

uh
wawa_2 ?

meager steeple Aug 14, 2024, 5:39 PM

#

wawa_2 is all messages that contain wawa with length 2 or more

fresh sentinel Aug 14, 2024, 5:39 PM

#

https://gregdan3.github.io/ilo-muni/?query=wawa+-+wawa_2%2C+epiku+-+epiku_2%2C+pona+a+-+pona+a_3&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin

meager steeple Aug 14, 2024, 5:39 PM

#

so if i do wawa - wawa_2, it's the number of sentences containing wawa minus the number of sentences containing wawa of length 2 or more; aka how many times wawa was a standalone sentence/message

hidden tide Aug 14, 2024, 5:43 PM

#

i added a few more memes and changed the wawa parameters to have it be standalone, it drastically drops in terms of relativity (naturally because it is a word that people say outside of commending someone)
https://gregdan3.github.io/ilo-muni/?query=a+-+a_2%2C+omekapo%2C+omekalike%2C+jipi%2C+slape%2C+jans%2C+ale+li+pona%2C+mu%2C+kijetesantakalu%2C+ikea%2C+ku%2C+nja%2C+yutu%2C+osu%2C+ilo+nanpa%2C+epiku%2C+tuki%2C+kekan+san%2C+jan+telakoman+li%2C+misali%2C+lonsi%2C+mi+tawa+tomo%2C+mi+tawa+e+tomo%2C+lon+-+lon_2%2C+lon+ala+-+lon+ala_3%2C+mi+sona+ala+a%2C+pingo%2C+kasi+ike+mute%2C+usawi%2C+nimi+sin%2C+ojuta%2C+a+pilin+ike%2C+o+uta+e+sike+mi%2C+toki+a+-+toki+a_3%2C+moku+wapa%2C+apeja+li+mi%2C+mi+unpa+e+mama+sina%2C+o+luka+e+kasi%2C+ma+masina%2C+mijun%2C+nimisin%2C+o+moku+e+kala+pona%2C+sutopatikuna%2C+o+moku+e+kala+ike%2C+o+wawa%2C+tonsi%2C+su%2C+owe%2C+kamalawala%2C+wawa+-+wawa_2%2C+waken%2C+akesi+kule%2C+lon+a&minSentLen=1&scale=normrel&start=1470009600&end=1722470400&smoothing=0&smoother=cwin

ilo Muni

Watch toki pona grow and change- now with graphs!

winter thicket Aug 14, 2024, 5:44 PM

#

jsyk a link in þe help article is missing a slash

hidden tide Aug 14, 2024, 5:48 PM

#

for whatever reason, August 2022 was the month of mu

#

absolute chart tells it better

quaint wagon Aug 14, 2024, 5:56 PM

#

hidden tide for whatever reason, August 2022 was the month of mu

this is because of sptp!

quaint wagon Aug 14, 2024, 5:56 PM

#

winter thicket jsyk a link in þe help article is missing a slash

thanks! fixed it

#

well, fix is publishing in like 1 minute

tidal knoll Aug 14, 2024, 6:07 PM

#

https://gregdan3.github.io/ilo-muni/?query=kepeken+pi%2C+kepeken+e%2C+kepeken+lon%2C+lon+kepeken&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin
nasin pi te kepeken e to li kama lili anu seme

ilo Muni

Watch toki pona grow and change- now with graphs!

quaint wagon Aug 14, 2024, 6:08 PM

#

tidal knoll https://gregdan3.github.io/ilo-muni/?query=kepeken+pi%2C+kepeken+e%2C+kepeken+lo...

ona li lili mute, taso alasa sina li pana ala e sona ni

tidal knoll Aug 14, 2024, 6:09 PM

#

nasin pi te kepeken pi to en
nasin pi te kepeken lon to en
nasin pi te lon kepeken to li awen nasa

#

^ tan fucking seme la ilo pi toki pona taso li pipi e mi?!

quaint wagon Aug 14, 2024, 6:11 PM

#

o alasa e ni: kepeken *, kepeken ala *
ni la sina ken lukin e ni: nasin ni li lon; ona li suli tawa nasin wan mute pi nasin ante. taso ona li ijo wan lon poka pi nasin ante mute li lili wawa tawa ona ale.

quaint wagon Aug 14, 2024, 6:12 PM

#

tidal knoll ^ tan fucking seme la ilo pi toki pona taso li pipi e mi?!

nimi te en nimi to li lon ala tawa ilo pi toki pona taso tan ni: lipu Linku la ona li lili.
ona li lili lon sitelen la nasin ale ante la ilo li lukin ala e ona
(o sona e ni: nimi te li lon tawa ilo Muni, taso nimi to li lon ala tan toki Inli)

tidal knoll Aug 14, 2024, 6:13 PM

#

quaint wagon o alasa e ni: `kepeken *, kepeken ala *` ni la sina ken lukin e ni: nasin ni li ...

https://gregdan3.github.io/ilo-muni/?query=kepeken+toki%2C+kepeken+e%2C+kepeken+ilo%2C+kepeken+nimi%2C+kepeken+nasin%2C+kepeken+ala%2C+kepeken+tenpo%2C+kepeken+sitelen%2C+kepeken+ona%2C+kepeken+ni%2C+kepeken+ala+e%2C+kepeken+ala+nimi%2C+kepeken+ala+kepeken%2C+kepeken+ala+ilo%2C+kepeken+ala+ona%2C+kepeken+ala+nasin%2C+kepeken+ala+toki%2C+kepeken+ala+sitelen%2C+kepeken+ala+ni%2C+kepeken+ala+la&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin kepeken e li suli nanpa tu

ilo Muni

Watch toki pona grow and change- now with graphs!

quaint wagon Aug 14, 2024, 6:14 PM

#

lon! ona li suli nanpa tu, taso ona li open e poki. ona li suli nanpa tu lon poka pi nimi ale ni: open poki en ijo poki li lon.

#

sina wile alasa e ni:
nimi kepeken la ni li ale ala. taso, nasin "kepeken e" la ni li ale.
https://gregdan3.github.io/ilo-muni/?query=kepeken+e+%2B+kepeken+ala+e%2C+kepeken+toki+%2B+kepeken+ilo+%2B+kepeken+nimi+%2B+kepeken+nasin+%2B+kepeken+tenpo+%2B+kepeken+sitelen+%2B+kepeken+ona+%2B+kepeken+ni+%2B+kepeken+ala+nimi+%2B+kepeken+ala+ilo+%2B+kepeken+ala+ona+%2B+kepeken+ala+nasin+%2B+kepeken+ala+toki+%2B+kepeken+ala+sitelen&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin

ilo Muni

Watch toki pona grow and change- now with graphs!

#

ni la sina ken sona e ni:

nasin pi nimi e li kama lili
nasin pi nimi e ala li kama suli
nasin pi nimi e ala li suli mute tawa nasin pi nimi e

tidal knoll Aug 14, 2024, 6:19 PM

#

wawa

#

https://gregdan3.github.io/ilo-muni/?query=mi+e%2C+sina+e%2C+ona+e&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin
mi e, sina e, ona e
transitive pronouns li lon

ilo Muni

Watch toki pona grow and change- now with graphs!

#

https://gregdan3.github.io/ilo-muni/?query=li+mi+e%2C+li+sina+e%2C+li+ona+e&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin though nobody has ever said li sina e

ilo Muni

Watch toki pona grow and change- now with graphs!

fresh sentinel Aug 14, 2024, 6:25 PM

#

ilo penpo li pipi tu e sina

quaint wagon Aug 14, 2024, 6:57 PM

#

seme a?

fresh sentinel Aug 14, 2024, 6:59 PM

#

pipi utala tu li lon li sama

#

pipi kala wan li lon li wile

plucky gazelle Aug 14, 2024, 9:31 PM

#

hidden tide i added a few more memes and changed the wawa parameters to have it be standalon...

(cw: flashing lights) mmh rendering soup

wicked stratus Aug 14, 2024, 9:36 PM

#

what did that one person say about forbidden spaghetti

river shard Aug 14, 2024, 9:37 PM

#

Hmmm something about HAVING HIGH STANDARDS?

#

but I try not to be upsetti

quaint wagon Aug 14, 2024, 10:16 PM

#

i should Probably prevent people from doing this for exactly the reason you're seeing
but I also don't think it matters enough lmao
maybe it does to avoid bandwidth problems

hidden tide Aug 15, 2024, 2:45 AM

#

so why did mun Kekan San choose to say walo loje rather than loje walo?

quaint wagon Aug 15, 2024, 2:52 AM

#

hidden tide so why did mun Kekan San choose to say walo loje rather than loje walo?

ona li kule ante

low timber Aug 15, 2024, 5:06 AM

#

Screenshot_2024-08-14_at_10.02.29_PM.png

#

nimi suli li suli
nimi sewi li sewi lili
taso, nimi suwi li suwi e sewi ona
nimi seli li awen lili sama seli moli

#

pakala ni li seme a

Screenshot_2024-08-14_at_10.12.23_PM.png

#

ni kin li nasa

Screenshot_2024-08-14_at_10.13.16_PM.png

tall rune Aug 15, 2024, 8:11 AM

#

low timber pakala ni li seme a

#

theres a lot of these

#

both in #sitelen-pona and in the now archived #spt-jaki

#

@quaint wagon would it be worth excluding that channel?

#

i remember we talked about how identifying webhooks (which is what these are) didnt work out for you

#

but excluding spt-jaki from the data seems like a good idea

edgy cradle Aug 15, 2024, 10:37 AM

#

tall rune Aug 15, 2024, 10:40 AM

#

cumulativeeeee

quaint wagon Aug 15, 2024, 12:32 PM

#

tall rune <@497549183847497739> would it be worth excluding that channel?

hmm!
this is a case where i think the following two improvements would be better:

scanning through all my webhook messages for pluralkit messages specifically, so i don't count we hooks
scoring sentences with consideration for the message they're a part of

i would like to avoid poking more channels out of the list, preferring a code based solution that more correctly separates good and bad sentences, so that i can keep the intended fairness of the system

in this case, a document level score would ideally mark that first sentence low because it's among a bunch of non-tp words

#

also, there are other weird spikes that would be fixed by document level scoring! like pipi Kewapi saying "ilo o ken e toki ni a" genuinely some thousand times which i think is absolutely hilarious

tall rune Aug 15, 2024, 12:34 PM

#

if you can do that, great!

#

i just thought cutting out that channel would be an easy solution

#

it feels like a channel where not much would be lost anyway

quaint wagon Aug 15, 2024, 12:35 PM

#

that's probably true, yes
but i need an excuse to do the webhook check and the document scorer anyway :P

tall rune Aug 15, 2024, 12:35 PM

#

fair

royal orchid Aug 15, 2024, 2:21 PM

#

mild glacier Aug 15, 2024, 2:25 PM

#

sitelen pi ilo Muni /musi /literal

pseudo smelt Aug 15, 2024, 2:26 PM

#

musi a
seme a la ona li ilo tawa

mild glacier Aug 15, 2024, 2:26 PM

#

mi sona ala /shrug

#

¯_(ツ)_/¯

icy turret Aug 15, 2024, 2:29 PM

#

@quaint wagon sona pona wants the ilo muni logo cc-licensed btw

#

talk to em for details idk shit

mild glacier Aug 15, 2024, 2:30 PM

#

couldnt they just ask for permision to use the picuture?

#

sorry for asking

#

i dont know alot of things

pseudo smelt Aug 15, 2024, 2:30 PM

#

mi kama lukin e sitelen ilo… la

meager steeple Aug 15, 2024, 2:30 PM

#

pakala...

mild glacier Aug 15, 2024, 2:30 PM

#

among ku!!!

novel lintel Aug 15, 2024, 3:06 PM

#

mild glacier couldnt they just ask for permision to use the picuture?

ken a · taso jan [Juwan] li wile cc e ale tan,, ijo pi sona mi ala

fresh sentinel Aug 15, 2024, 3:32 PM

#

ona li wile pana e sitelen ale tawa poki Wikimilija

hidden tide Aug 15, 2024, 3:49 PM

#

i believe in waso supremacy (not accounting for soweli which is giant in comparison)

meager steeple Aug 15, 2024, 3:53 PM

#

aa seme la kijetesantakalu li suli sama akesi sama pipi

quaint wagon Aug 15, 2024, 3:55 PM

#

hidden tide i believe in waso supremacy (not accounting for soweli which is giant in compari...

psst, the relative log is a bit misleading rn because it shows the actual logarithm of the value rather than changing the y axis to be logarithmic
(it's like this because i was running out of time)
you should post the actual relative graph of this

#

the shape of the relative log graph is ultimately correct, but the numbers don't mean much

hidden tide Aug 15, 2024, 3:59 PM

#

(wink wink)

plucky gazelle Aug 15, 2024, 8:42 PM

#

pretty linjas

royal orchid Aug 15, 2024, 9:28 PM

#

tan tomo https://discord.com/channels/301377942062366741/1187212477155528804 la, ni:

royal orchid Aug 15, 2024, 9:55 PM

#

over the past year, anu seme has been, on average, 65% of all instances of anu

quaint wagon Aug 16, 2024, 1:27 AM

#

sonja put ilo muni on tokipona.org! which is super exciting.

quaint wagon Aug 16, 2024, 1:29 AM

#

royal orchid over the past year, anu seme has been, on average, 65% of all instances of anu

nasin ni kin la sina ken lukin e ni!

royal orchid Aug 16, 2024, 1:30 AM

#

sona a

mild glacier Aug 16, 2024, 11:56 AM

#

novel lintel ken a · taso jan [Juwan] li wile cc e ale tan,, ijo pi sona mi ala

a okie :3

neat bramble Aug 16, 2024, 3:16 PM

#

tall rune https://txnor.com/mathchallenge

32/4+15 = 19 23
(i can haz more?)

short narwhal Aug 16, 2024, 3:20 PM

#

lmao i got 24/4+13=19

#

wait fym 32/4 + 15 = 19 thats 23

neat bramble Aug 16, 2024, 3:29 PM

#

oh i'm silly

#

~~that's what they mean when they say 99% fail~~

#

hey look it's txnor the home of the unpa hack

edgy cradle Aug 17, 2024, 9:10 AM

#

quaint wagon nasin ni kin la sina ken lukin e ni!

woa what's with that random jump in anu toki

royal orchid Aug 17, 2024, 2:05 PM

#

sama nimi wawa la, nimi sona kin li kama mute:

royal orchid Aug 17, 2024, 2:32 PM

#

#

wan la, ni

#

musi la, ni

pseudo smelt Aug 17, 2024, 2:45 PM

#

seme a

quaint wagon Aug 17, 2024, 3:40 PM

#

ilo li toki e ni:
"tenpo wan la... toki ni li lon."

drifting breach Aug 17, 2024, 5:44 PM

#

royal orchid musi la, ni

what is "XXX_n" ("a_3", "sona_2", etc.)?

#

what are + and -?

#

@quaint wagon

quaint wagon Aug 17, 2024, 5:52 PM

#

drifting breach what is "XXX_n" ("a_3", "sona_2", etc.)?

the number after the underscore means to only count the times that word occurred in sentences with that many words or more! e.g. toki_2 counts only times where toki was in a sentence with at least 2 words.
and the plus and minus are literal- add or subtract two or more things. the math being done is, for every period of time that is represented, add together all the words. or, add/subtract, from left to right, to be exact.

merry walrus Aug 18, 2024, 5:18 AM

#

Kokosila isn't ÞAT far ahead

#

Huh

#

Stupid þought

#

but I wonder how much affect I had on þat huge isipin spike in Oct and Dec

#

Oh it's a bot reposting some nimisin

#

And maybe Giggity Mantis yapping about using isipin

fresh sentinel Aug 18, 2024, 6:23 AM

#

bots aren't counted in ilo Muni

pseudo drift Aug 18, 2024, 8:24 AM

#

lmaoo

#

is there any info on how the data is handled here, by the way? I couldn't see any info about how or if it was anonymised

royal orchid Aug 18, 2024, 9:44 AM

#

ni li tan seme a?!

#

kalama lon sptp???? toki jaki ala la, mi sona ala e tan ken ante

whole epoch Aug 18, 2024, 10:30 AM

#

didnt notice this until now but you should probably add somewhere that wildcard ignores e and li

royal orchid Aug 18, 2024, 10:40 AM

#

whole epoch didnt notice this until now but you should probably add somewhere that wildcard ...

nnnnn...
nasa mute a

whole epoch Aug 18, 2024, 10:41 AM

#

erm seme a

royal orchid Aug 18, 2024, 10:45 AM

#

pakala nasa a

#

ona li pana e nimi pi mute nanpa 11-19??????? tan seme a??????? 😵‍💫

#

quaint wagon Aug 18, 2024, 12:06 PM

#

whole epoch didnt notice this until now but you should probably add somewhere that wildcard ...

Uhhhhhh
It should not
What the fuck

whole epoch Aug 18, 2024, 12:06 PM

#

nasa a,,

#

ok now it doesnt but it did before

quaint wagon Aug 18, 2024, 12:08 PM

#

That's a lil cursed and given the query I'm doing is incredibly simple and the processing I'm doing after that is also simple, I think I can only blame the library I'm using to query the DB?

#

And it's a funny one so fair

whole epoch Aug 18, 2024, 12:08 PM

#

royal orchid ona li pana e nimi pi mute nanpa 11-19??????? tan seme a??????? 😵‍💫

know why it would skip the top 10?

quaint wagon Aug 18, 2024, 12:19 PM

#

Uh, not really, no
The query as written should fetch the top 10 items, because it searches the ranks table and orders that by occurrences
The only way that could return something other than the actual top 10 would be if the query were convinced of having the top 10 items early/incorrectly

low timber Aug 18, 2024, 2:10 PM

#

royal orchid kalama lon sptp???? toki jaki ala la, mi sona ala e tan ken ante

ni li ken musi Mapo

#

musi Mapo la ilo li toki e ni

quaint wagon Aug 18, 2024, 2:13 PM

#

ILO

quaint wagon Aug 18, 2024, 4:31 PM

#

royal orchid ni li tan seme a?!

I KNOW WHY THIS HAPPENED

there's a funny but unavoidable oversight in how i detect bots vs webhooks
i don't actually have specific info about whether a given author is a webhook
all i know is whether they're a bot, and what roles they have
webhooks never have roles, while bots do if they're in the server
this is all i have to distinguish them as far as i'm aware?
so if i encounter a bot user with no roles, i mark them as a webhook

anyway mappo is not in the server!

#

if i just do the pluralkit message check thing i can fix this but i have not yet

royal orchid Aug 18, 2024, 4:33 PM

#

a, sona!

quaint wagon Aug 18, 2024, 4:33 PM

#

quaint wagon I KNOW WHY THIS HAPPENED there's a funny but unavoidable oversight in how i det...

note that when a bot is in the server, they're obligated to have at least one role- their own, which they create on joining, and which cannot be assigned to any user beside that bot

merry walrus Aug 18, 2024, 7:36 PM

#

fresh sentinel bots aren't counted in ilo Muni

pog

novel lintel Aug 18, 2024, 7:39 PM

#

mi alasa e linja sama kepeken ilo Nanpa

📎 ilo_muni_correlation_detection.py

#

mute la nasin [UCSUR] en nanpa suli li lon · taso ni kin · seme a

#

https://gregdan3.github.io/ilo-muni/?query=o+sitelen+lon+lipu+sina%2C+o+sitelen+lon%2C+lipu+sina&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin

#

suno wan la toki tu wan ni li lon · ma ni ala · ma pi kama sona ala · la ma seme

#

ni li musi wile mi

#

wile mi la mi ni ala https://xkcd.com/1138/

xkcd: Heatmap

#

mi kepeken nanpa pi mute kepeken la linja ale li sama mute tan ante tenpo pi suli kulupu · taso mi alasa e linja pi sama mute mute · la ni ala anu seme

quaint wagon Aug 18, 2024, 10:33 PM

#

novel lintel mute la nasin [UCSUR] en nanpa suli li lon · taso ni kin · seme a

ma wan li lon tawa ni: sina sitelen lon lipu sina
ona la ilo li kama lon tenpo suno ale li toki e ni

quaint wagon Aug 18, 2024, 10:33 PM

#

novel lintel mi alasa e linja sama kepeken ilo Nanpa

mi lukin e ni lon tenpo ni: mi kama sin lon tomo!

quaint wagon Aug 18, 2024, 10:35 PM

#

quaint wagon ma wan li lon tawa ni: sina sitelen lon lipu sina ona la ilo li kama lon tenpo s...

ilo ni li lon nasin Webhook
ilo Pulaki kin li ni la mi ken ala sona e ante
nasin tu wan li ken:

toki pi nasin Webhook o lon
jan pi ilo Pulaki o lon ala
mi pali e ilo la mi pana e toki ale pi nasin Webhook tawa ilo Pulaki li alasa e sona ni: ni li tan sijelo seme pi ilo Siko?

#

mi sin e sona lon tenpo kama la nasin nanpa tu wan o lon anu seme...

fresh sentinel Aug 19, 2024, 12:19 AM

#

sona mi la, ilo Pulaki li weka e sona mama pi toki majuna

#

mi sona ala e suli tenpo

#

taso mi alasa e sona pi toki majuna la ilo Pulaki li toki ala

quaint wagon Aug 19, 2024, 2:24 AM

#

fresh sentinel sona mi la, ilo Pulaki li weka e sona mama pi toki majuna

ona li ni ala
tenpo ale la sina ken pana e sitelen ❓ tawa toki pi ilo Pulaki
ni la ona li toki e sijelo open tawa sina

fresh sentinel Aug 19, 2024, 4:43 AM

#

n, pakala mi li ken tan ijo ante

novel lintel Aug 19, 2024, 9:14 AM

#

mi ante e ilo li weka e nasin [UCSUR] e nimi lili e toki pi nanpa taso

📎 ilo_muni_correlation_detection.py

#

mi kama lukin e ijo pi tuki tiki e ijo musi mute ala · ni li musi lili

novel lintel Aug 19, 2024, 11:20 AM

#

ken la ilo o moku e toki anpa tan sitelen pi ma [YouTube] · pali li lili tan ilo sewi [yt-dlp] · taso mi sona ala e ni → mute li pona ala pona

low timber Aug 19, 2024, 11:32 AM

#

sona kiwen

quaint wagon Aug 19, 2024, 11:46 AM

#

low timber sona kiwen

ni, kin, li sona kiwen

quaint wagon Aug 19, 2024, 11:48 AM

#

novel lintel --- ken la ilo o moku e toki anpa tan sitelen pi ma [YouTube] · [pali li lili ta...

ni li ken wile, taso pali li suli ike
sona mi la mi wile ni la mi o kama jo e toki tan sitelen ale
ni la mi o jo e sitelen ale...

novel lintel Aug 19, 2024, 11:48 AM

#

sina ken jo ala a e sitelen

#

sina o sona e sitelen wile · taso poki pi kalama musi li lon

#

mi ni

yt-dlp https://www.youtube.com/playlist?list=PL3meDZ0v1E3e5hwSyfz9Os9ZUsMw4ecwz --get-comments --exec pre_process:"del %(id)s.comments.json" --print-to-file "%(comments)j" "%(id)s.comments.json" --exec "type %(id)s.comments.json" --skip-download

la ale li pona

#

a taso kalama pi mute ala li lon poki · la n

#

"ytsearchall:toki pona" li ken

quaint wagon Aug 19, 2024, 11:54 AM

#

mi toki ala e sitelen suli li toki e sitelen wile
mi o jo e ona ale
alasa mute a

novel lintel Aug 19, 2024, 11:55 AM

#

sina jo e ale pi ma mute anu seme · la o alasa e nimi linjuwi pi ma [YouTube] lon ona

quaint wagon Aug 19, 2024, 11:56 AM

#

aaaaaAAAAA

#

wawa

icy turret Aug 19, 2024, 11:58 AM

#

quaint wagon ni, kin, li sona kiwen

i still don't know the sona kiwen joke

novel lintel Aug 19, 2024, 12:00 PM

#

-a mi toki ike · mi wile toki e toki anpa pi jan ante e toki pilin · e ni ala → jan li sitelen nimi e toki pi sitelen ona

#

ni nanpa tu li pona kin · taso lili a

plucky gazelle Aug 19, 2024, 12:02 PM

#

quaint wagon ni, kin, li sona kiwen

ni kin li olin Juli

quaint wagon Aug 19, 2024, 12:05 PM

#

icy turret i still don't know the sona kiwen joke

there is not even a joke anymore, it's basically a way to irritate lipamanka
at two separate times, I defended the idea that somebody could use the phrase sona kiwen to refer to something which is difficult to understand
once in toki pona, where the response was middling
the second time in English, the true place where nasin discussion lives, and my defense bugged lipamanka so it argued against my point for a few Hours
mind you after like 20 minutes i was barely involved anymore, but as these things go the discussion remained for a while
eventually most the participants were so annoyed that bringing up sona kiwen genuinely irritated them
naturally pipi Kewapi and I think this is hilarious

quaint wagon Aug 19, 2024, 12:06 PM

#

plucky gazelle ni kin li olin Juli

sina lukin e toki pi kili Juli!

quaint wagon Aug 19, 2024, 12:06 PM

#

novel lintel -a mi toki ike · mi wile toki e toki anpa pi jan ante e toki pilin · e ni ala → ...

mi sona! mi kin li wile e toki anpa kulupu

plucky gazelle Aug 19, 2024, 12:07 PM

#

quaint wagon sina lukin e toki pi kili Juli!

aaaa

#

#toki-ale message kepeken ni pi nimi Juwi li lon ala tan seme

quaint wagon Aug 19, 2024, 12:08 PM

#

plucky gazelle https://discord.com/channels/301377942062366741/301377942062366741/1221683668070...

nimi li wile lon ilo la ona o mute mute
(mute li nanpa)

#

mi weka e nimi ale pi mute ni ala tan ni:
ona li mute wawa li anpa e ken ilo

plucky gazelle Aug 19, 2024, 12:09 PM

#

a

icy turret Aug 19, 2024, 12:11 PM

#

quaint wagon there is not even a joke anymore, it's basically a way to irritate lipamanka at...

wild

drifting breach Aug 19, 2024, 1:30 PM

#

novel lintel wile mi la mi ni ala https://xkcd.com/1138/

what is that "SITEŚ"?

tall rune Aug 19, 2024, 1:35 PM

#

site's

plucky gazelle Aug 19, 2024, 1:44 PM

#

x/sites/

charred patrolBOT Aug 19, 2024, 1:44 PM

#

plucky gazelle x/sites\/

/siteɕ/

quaint wagon Aug 19, 2024, 5:13 PM

#

novel lintel mi ni ```sh yt-dlp https://www.youtube.com/playlist?list=PL3meDZ0v1E3e5hwSyfz9Os...

ilo pre_process li wile seme lon ni?

novel lintel Aug 19, 2024, 5:14 PM

#

mi,, sona wawa ala

#

mi kama jo tan lipu pana mi

#

ken la --write-comments --skip-download taso li pona sama

quaint wagon Aug 19, 2024, 5:16 PM

#

sona, lukin la ni

#

sitelen ale la ona li kama jo e toki

#

wawa

#

lukin la pali ale ante li wile poki e toki taso

neat bramble Aug 19, 2024, 7:46 PM

#

wawa

quaint wagon Aug 19, 2024, 10:27 PM

#

novel lintel mi ni ```sh yt-dlp https://www.youtube.com/playlist?list=PL3meDZ0v1E3e5hwSyfz9Os...

musi la, toki ~luka luka tu tu mute ale li lon lipu ni taso:
#pana message
taso, mi kepeken ilo alasa la toki pi mute wawa li lon...

#

mi o kepeken nasin nanpa ni anu seme:
toki wan li lon kulupu lili la toki luka tu li lon kulupu suli.
mi open e alasa lon tenpo luka pini.

devout jettyBOT Aug 20, 2024, 6:09 AM

#

poka pi ma [sanpansiko] la ilo tawa li lon • nimi ona li ilo [muni] a

sike Kapo li lukin li ton(Sí?) ↩️

[Reply to:](#1272180068721889290 message) musi a
seme a la ona li ilo tawa

novel lintel Aug 20, 2024, 9:11 AM

#

quaint wagon musi la, toki ~luka luka tu tu mute ale li lon lipu ni taso: https://discord.co...

"mute ale" · seme a · toki,, tu ale mute mute mute mute ale anu seme · ni anu ni ala la wawa

pseudo smelt Aug 20, 2024, 11:54 AM

#

devout jetty poka pi ma [sanpansiko] la ilo tawa li lon • nimi ona li ilo [muni] a

a! musi

quaint wagon Aug 20, 2024, 12:29 PM

#

novel lintel "mute ale" · seme a · toki,, tu ale mute mute mute mute ale anu seme · ni anu ni...

(luka luka) (tu tu mute) (ale)
sina ken sona e poki tan kama lili nimi

novel lintel Aug 20, 2024, 12:30 PM

#

mi awen sona ala :p

quaint wagon Aug 20, 2024, 12:30 PM

#

pakala

plucky gazelle Aug 20, 2024, 5:22 PM

#

ni li ike nanpa

quaint wagon Aug 20, 2024, 5:23 PM

#

tu tu mute li mute mute mute mute
luka luka li lili tawa ni la o suli e ona kepeken mute mute mute mute
ni o suli lon tenpo ale. ale li nanpa.

#

(5+5) * ((2+2)*20) * (100)

novel lintel Aug 20, 2024, 5:38 PM

#

taso seme li pana e sona ni → tu en luka ala li lon poki mute

quaint wagon Aug 20, 2024, 5:43 PM

#

novel lintel taso seme li pana e sona ni → `tu` en `luka` ala li lon poki `mute`

nanpa lili li lon poka pini pi nanpa suli

novel lintel Aug 20, 2024, 5:44 PM

#

luka tu wan mute li seme

quaint wagon Aug 20, 2024, 5:44 PM

#

a. mi kama sona e pakala.

wanton shoal Aug 22, 2024, 4:45 PM

#

tHONK mi pilin nasa tan ni

#

taso mute pi toki "anu seme" li 1820 taso la...

#

ken

quaint wagon Aug 22, 2024, 4:46 PM

#

ni li nasa seme?

wanton shoal Aug 22, 2024, 4:47 PM

#

mi pilin ni: toki "kxk" li lon mute

quaint wagon Aug 22, 2024, 4:47 PM

#

aaa sona

wanton shoal Aug 22, 2024, 4:47 PM

#

taso toki "ken ala ken" li mute ala kin lon lipu nanpa la mi ken sona

quaint wagon Aug 22, 2024, 4:47 PM

#

toki mute pi nimi tu wan li mute lili taso

wanton shoal Aug 22, 2024, 4:49 PM

#

lon, taso mi pana e ona tawa poki la ona li lukin lon nimi ale toki, lon ala lon?
taso.. ona li suli ala

quaint wagon Aug 22, 2024, 5:03 PM

#

wanton shoal lon, taso mi pana e ona tawa poki la ona li lukin lon nimi ale toki, lon ala lon...

lukin a

#

kin la toki lili "kxk" li lon lipu nimi pi ilo alasa

wanton shoal Aug 22, 2024, 5:07 PM

#

lipu seme?

quaint wagon Aug 22, 2024, 5:31 PM

#

a, ilo alasa li ilo Sona Toki. ona li kama jo e toki li alasa e sona ni: ni li toki ala toki pona?
lipu ni li toki pona nanpa wan tawa ilo
nimi pi nasin kalama pona li nanpa tu
nimi ijo li suli lon sitelen open li nanpa tu wan
nimi pi sitelen ale ken li nanpa tu tu
nimi mute li lon ala ni ale la, toki li pona ala

merry walrus Aug 22, 2024, 10:01 PM

#

Damn a completely fucking blows it out of þe water (I wonder how much oþer languages have affected it)

quaint wagon Aug 22, 2024, 10:34 PM

#

merry walrus Damn a completely fucking blows it out of þe water (I wonder how much oþer langu...

Obviously not none, but given the absolute graph for a follows a similar trend to every other word in the top 20 most frequent, I feel extremely confident its frequency is accurate to its actual occurrence in Toki Pona!

merry walrus Aug 22, 2024, 10:34 PM

#

pog pog

tidal knoll Aug 23, 2024, 12:44 PM

#

#

https://gregdan3.github.io/ilo-muni/?query=o+e+soweli+ala%2C+o+ala+e+soweli&minSentLen=1&scale=rel&start=1470009600&end=1722470400&smoothing=2&smoother=cwin

ilo Muni

Watch toki pona grow and change- now with graphs!

icy turret Aug 23, 2024, 12:45 PM

#

tidal knoll

i prefer ala la o e soweli

tidal knoll Aug 23, 2024, 12:45 PM

#

icy turret i prefer ala la o e soweli

ala la o e soweli: No results found for this query.

icy turret Aug 23, 2024, 12:46 PM

#

sadness

meager steeple Aug 23, 2024, 2:53 PM

#

ala la o e soweli

#

mi toki e ni la ona o kama lon ilo pi tenpo kama

icy turret Aug 23, 2024, 2:56 PM

#

-# ilo pi penpo kama

meager steeple Aug 23, 2024, 2:57 PM

#

pakala nimi wawa

novel lintel Aug 23, 2024, 3:09 PM

#

icy turret -# ilo pi penpo kama

kon san li kama lon lape mi · kon pi penpo weka · kon pi penpo lon · kon pi penpo kama

fresh sentinel Aug 23, 2024, 3:09 PM

#

khdhkdsgg

icy turret Aug 23, 2024, 3:10 PM

#

ona li kon kekan san

icy turret Aug 25, 2024, 2:37 AM

#

#

....apparently russia(n(s)) died in 2019

#

also oof @ that peak around early 2022

#

apeja consistently peaks every august and i love that

tall rune Aug 25, 2024, 7:24 AM

#

why is that

mild glacier Aug 25, 2024, 7:27 AM

#

thick gust Aug 25, 2024, 7:28 AM

#

sona musi

mild glacier Aug 25, 2024, 7:28 AM

#

lon

#

seme li nena ni?

fresh sentinel Aug 25, 2024, 7:44 AM

#

icy turret

which one is red and which one is blue? for both of these

icy turret Aug 25, 2024, 8:40 AM

#

fresh sentinel which one is red and which one is blue? for both of these

lon is blue, ala is red

icy turret Aug 25, 2024, 8:41 AM

#

tall rune why is that

people singing the lyrics of apeja li mi in chat on suno pi toki pona

tall rune Aug 25, 2024, 8:41 AM

#

a

fresh sentinel Aug 25, 2024, 4:02 PM

#

icy turret ....apparently russia(n(s)) died in 2019

and this one?

icy turret Aug 25, 2024, 4:03 PM

#

this one wasnt actually a comparison, rather the same trend for both words

#

but anyway the first entry is blue, the second is red

fresh sentinel Aug 25, 2024, 4:04 PM

#

ahh

icy turret Aug 26, 2024, 5:00 PM

#

the ✨ hierarchy ✨

fresh sentinel Aug 26, 2024, 5:11 PM

#

it would really help if you include the labels on these, most of us don't have the color order memorized

icy turret Aug 26, 2024, 5:11 PM

#

improved the hierarchy

#

icy turret Aug 26, 2024, 5:12 PM

#

fresh sentinel it would really help if you include the labels on these, most of us don't have t...

ordered from top to bottom

fresh sentinel Aug 26, 2024, 5:12 PM

#

pona

icy turret Aug 26, 2024, 5:16 PM

#

#

i expected namako below core words but nope, right there with noka and nena

fresh sentinel Aug 26, 2024, 5:37 PM

#

noka and nena probably experience the monsi effect

quaint wagon Aug 26, 2024, 5:47 PM

#

they do, as far as the charts show, yes

plucky gazelle Aug 26, 2024, 6:10 PM

#

icy turret

seme li tu e linja meso

icy turret Aug 26, 2024, 6:11 PM

#

plucky gazelle seme li tu e linja meso

sina toki e ni anu seme

plucky gazelle Aug 26, 2024, 6:11 PM

#

ni

icy turret Aug 26, 2024, 6:12 PM

#

jan li kepeken ala ->
log(0) = undefined ->
sitelen li ala

plucky gazelle Aug 26, 2024, 6:12 PM

#

a

#

tan li nanpa.

icy turret Aug 26, 2024, 6:12 PM

#

tan li nanpa.

merry walrus Aug 26, 2024, 7:23 PM

#

icy turret apeja consistently peaks every august and i love that

Why is þat profound /gen

#

Oh it is explained later down

#

sorry

icy turret Aug 26, 2024, 7:23 PM

#

no worries

#

🐸 🎮

plucky gazelle Aug 26, 2024, 8:04 PM

#

why does lija even say gaming in that song

#

like what does it mean

hidden tide Aug 26, 2024, 8:06 PM

#

she's a gamer

verbal marten Aug 27, 2024, 5:07 AM

#

fresh sentinel noka and nena probably experience the monsi effect

What's the monsi effect?

fresh sentinel Aug 27, 2024, 5:15 AM

#

direction words and body part words are used less frequently in text and VC, and more frequently IRL and in VR

#

so they show up less frequently in ilo Muni

#

which only looks at text

edgy cradle Aug 28, 2024, 6:37 PM

#

fresh sentinel direction words and body part words are used less frequently in text and VC, and...

first i thought "what, that's crazy" and then i thought "oh yeah that makes sense"

thick gust Aug 29, 2024, 9:27 AM

#

wicked stratus Aug 29, 2024, 11:22 AM

#

mijun

verbal marten Aug 29, 2024, 2:28 PM

#

wicked stratus mijun

wicked stratus Aug 30, 2024, 9:21 AM

#

wait thats cool

#

what happened ,,, what happened in august 2023,,,,

hidden tide Aug 30, 2024, 10:21 AM

#

me

quaint wagon Aug 30, 2024, 12:17 PM

#

wicked stratus what happened ,,, what happened in august 2023,,,,

jan Mijun li kama lon

low timber Aug 30, 2024, 12:29 PM

#

verbal marten Aug 30, 2024, 7:09 PM

#

ni li tan seme?

quaint wagon Aug 30, 2024, 7:14 PM

#

verbal marten ni li tan seme?

verbal marten Aug 30, 2024, 7:15 PM

#

a sona

quaint wagon Aug 30, 2024, 7:18 PM

#

taso, ni li ale ala

#

mi alasa

quaint wagon Aug 30, 2024, 7:23 PM

#

tall rune

mi lukin e ni lon tenpo pini li weka e sona

merry walrus Aug 31, 2024, 3:01 AM

#

lmao

short narwhal Aug 31, 2024, 3:08 AM

#

o tonsi tawa ali (:<

pseudo smelt Aug 31, 2024, 3:15 AM

#

Be all moving nonbinary-ers

merry walrus Aug 31, 2024, 5:33 AM

#

📠 but wiþ a t before þe x

#

or

#

in þe middle

#

fa>t<

thick gust Aug 31, 2024, 11:02 AM

#

what's this?

#

ah wait, this is without smoothing

quaint wagon Aug 31, 2024, 11:30 AM

#

thick gust what's this?

uhhhhhhhhh, perhaps that copy/paste blurb where lon is nearly every word in the sentence?

#

will check db and report back when able

#

-remindme 7h30m o alasa e lon lon

latent zodiacBOT Aug 31, 2024, 11:30 AM

#

Set a reminder in 7 hours and 30 minutes from now (<t:1725130846:f>)
View reminders with the reminders command

thick gust Aug 31, 2024, 11:38 AM

#

ooh, mysterious

icy turret Aug 31, 2024, 11:57 AM

#

thick gust what's this?

try looking up lon lon lon lon etc in ma pona

#

im guesing its someokne playing around with "bubble wrap spoiler shields"

mild glacier Aug 31, 2024, 12:00 PM

#

thick gust ooh, mysterious

lipu ni li pu?

plucky gazelle Aug 31, 2024, 12:03 PM

#

icy turret try looking up lon lon lon lon etc in ma pona

#jaki message
jan li musi sitelen ilo!

mild glacier Aug 31, 2024, 12:04 PM

#

aa mi sona

quaint wagon Aug 31, 2024, 12:18 PM

#

plucky gazelle https://discord.com/channels/301377942062366741/316066233755631616/7696363214041...

aaaaaa
taso, ni li lon tomo jaki ala anu seme?
ilo Muni li lukin ala e tomo ni

plucky gazelle Aug 31, 2024, 12:22 PM

#

lon ala

#

lonala

quaint wagon Aug 31, 2024, 12:40 PM

#

pakalaaaa

rancid quarry Aug 31, 2024, 1:12 PM

#

i cant find the link a

quaint wagon Aug 31, 2024, 1:20 PM

#

rancid quarry i cant find the link a

https://gregdan3.github.io/ilo-muni/

ilo Muni

Watch toki pona grow and change- now with graphs!

rancid quarry Aug 31, 2024, 1:30 PM

#

quaint wagon https://gregdan3.github.io/ilo-muni/

thanks

verbal marten Aug 31, 2024, 6:32 PM

#

quaint wagon uhhhhhhhhh, perhaps that copy/paste blurb where lon is nearly every word in the ...

ni anu seme?

quaint wagon Aug 31, 2024, 6:37 PM

#

niiiiii

pseudo smelt Aug 31, 2024, 6:37 PM

#

nasa

latent zodiacBOT Aug 31, 2024, 7:00 PM

#

Reminder for @quaint wagon

Reminder from YAGPDB

o alasa e lon lon

pseudo smelt Aug 31, 2024, 7:23 PM

#

alasa li pini (anu seme)

quaint wagon Aug 31, 2024, 8:30 PM

#

a, pini

verbal marten Aug 31, 2024, 8:54 PM

#

taso kulupu ni pi nimi "lon" li tan seme a‽

quaint wagon Aug 31, 2024, 8:56 PM

#

verbal marten taso kulupu ni pi nimi "lon" li tan seme a‽

lukin la ona li musi sitelen kepeken ilo

verbal marten Aug 31, 2024, 8:56 PM

#

a sona nasa

pseudo smelt Aug 31, 2024, 9:00 PM

#

a

thick gust Sep 1, 2024, 12:02 AM

#

turns out they're playing some sort of chess

#

sona musi a

merry walrus Sep 1, 2024, 6:07 AM

#

lmao

mild glacier Sep 1, 2024, 6:23 AM

#

i have an idea now

#

imma make chess :D

plucky gazelle Sep 6, 2024, 1:14 PM

#

verbal marten Sep 6, 2024, 2:07 PM

#

toki. mi Sam. mi li pana wile en sona ijo toki pona.

mild glacier Sep 6, 2024, 2:20 PM

#

sina sona ala sona?

icy turret Sep 6, 2024, 4:01 PM

#

#

have we ever talked about the tenpo ni la dropoff

quaint wagon Sep 6, 2024, 4:06 PM

#

icy turret

a lot of words and phrases exhibit a significant change in usage starting in 2020, and while it's pretty easy to see the correlation with the pandemic, there's no telling what exactly that implies about the language
i would say something like, a sudden increase is skillful conversation? tenpo ni is a pretty simple and even first day of learning sort of construction used for basic conversation
try contrasting with "tenpo ni la sina pali"?

icy turret Sep 6, 2024, 4:11 PM

#

quaint wagon a lot of words and phrases exhibit a significant change in usage starting in 202...

it goes too low usage

#

but yeah

#

this is a similar dropoff but slower imo

#

oh fuck nvm, wrong smoothing

#

#

similar yeah

meager steeple Sep 6, 2024, 4:26 PM

#

icy turret Sep 8, 2024, 3:42 PM

#

#

a a mi kala.

wicked stratus Sep 8, 2024, 3:57 PM

#

a a mi kala

wicked stratus Sep 8, 2024, 3:58 PM

#

icy turret

hmm i wonder whats missing

icy turret Sep 8, 2024, 5:02 PM

#

#

placid crow Sep 8, 2024, 5:21 PM

#

icy turret

poki wesi - reddit containment center

verbal marten Sep 8, 2024, 6:21 PM

#

ma siko pi toki siko

low timber Sep 8, 2024, 6:52 PM

#

so it’s lipu Wesi, but ilo Siko

quaint wagon Sep 8, 2024, 10:08 PM

#

@icy turret

#

#

10 smoothing, because it's much more apt here

icy turret Sep 9, 2024, 4:50 AM

#

quaint wagon <@183528471031447552>

oh shit, new sources? youtube i assume?

quaint wagon Sep 9, 2024, 4:51 AM

#

icy turret oh shit, new sources? youtube i assume?

YouTube, and the old forum + its archive of the yahoo group
the data is very sparse there, but it will be available soon!

icy turret Sep 9, 2024, 4:52 AM

#

awesome

quaint wagon Sep 9, 2024, 4:52 AM

#

once again asking for any archived irc conversations lmao

#

they certainly don't exist beyond the few well known examples, sadly

icy turret Sep 9, 2024, 4:53 AM

#

quaint wagon once again asking for any archived irc conversations lmao

you know how tokipona.net had a corpus
was it just jan Kipo's corpus or

quaint wagon Sep 9, 2024, 4:54 AM

#

i don't know tbh
i think it was though

#

I saw a 2017 fb post where kipo referred to that corpus with a lot more detailed information than anyone but its author would have

icy turret Sep 9, 2024, 4:54 AM

#

whats responsible for higher traffic in 2007 and 2010 btw

quaint wagon Sep 9, 2024, 4:54 AM

#

no clue!

#

this is another instance where i spent a long time running tests and manual queries to be sure i wasn't making some error

#

for 2010, perhaps the newness of the forum itself?

#

it came out in Oct 2009

fresh sentinel Sep 10, 2024, 6:53 PM

#

quaint wagon # https://gregdan3.github.io/ilo-muni/ sina alasa e sona nimi lon ilo Muni la o ...

wicked stratus Sep 10, 2024, 8:51 PM

#

low timber Sep 10, 2024, 10:37 PM

#

a

meager steeple Sep 10, 2024, 11:13 PM

#

seme la ona li jan ala

quaint wagon Sep 10, 2024, 11:50 PM

#

meager steeple seme la ona li jan ala

tenpo weka la ona li soko

meager steeple Sep 11, 2024, 12:21 AM

#

aaaaa sona

low timber Sep 12, 2024, 11:46 PM

#

kapesi..

#

nasa a

#

kapilu

#

nimi kapesi en nimi kapilu li suli lon tenpo sama

#

nimi kap- li wawa lon sike 2021

#

n, nimi kapa ala

plush gust Sep 13, 2024, 12:30 AM

#

definitely been done before, but i find it very interesting that people are saying "o kama pona" more now vs. just "kama pona"

jan mute pi tenpo pini li toki e "kama pona" tan seme? mi la "o kama pona" makes more sense. maybe it's a difference between announcing somebody has "come well" vs. telling somebody to "well come" but msa

low timber Sep 13, 2024, 12:36 AM

#

my uneducated guess is that people are trying to steer clear of calques, and kama pona looks like an english calque for "well come"
perhaps "o kama pona" is more tokiponalike. though i don't see a problem with "kama pona" being an exclamation- though nanpa tu la it could be considered a lexicalization

i don't know

plush gust Sep 13, 2024, 1:05 AM

#

plush gust definitely been done before, but i find it very interesting that people are sayi...

hmmmmm, interesting

plush gust Sep 13, 2024, 1:05 AM

#

plush gust hmmmmm, interesting

to complete the set

thick gust Sep 13, 2024, 1:13 AM

#

interesting

low timber Sep 13, 2024, 2:21 AM

#

something happened mid 2020

plush gust Sep 13, 2024, 3:41 AM

#

low timber something happened mid 2020

the global pandemic, along with other factors, caused toki pona to have a surge in popularity around then

low timber Sep 13, 2024, 4:16 AM

#

but this is relative data, right?

plush gust Sep 13, 2024, 4:16 AM

#

lon

low timber Sep 13, 2024, 4:16 AM

#

so that shouldn't affect the words relative to each other

#

maybe the absolute data

plush gust Sep 13, 2024, 4:16 AM

#

taso toki pona li ante mute lon tenpo ni

low timber Sep 13, 2024, 4:16 AM

#

ken

plush gust Sep 13, 2024, 4:17 AM

#

with the influx of people, nasins ante'd mute

low timber Sep 13, 2024, 4:17 AM

#

i arrived some time after that, so i don't know how different the community was before then

plush gust Sep 13, 2024, 4:18 AM

#

and specifically phrases like "kama pona" were popular because new people were coming lots

low timber Sep 13, 2024, 4:18 AM

#

hmm but i'm not sure why that would have caused people to prefer one or the other

plush gust Sep 13, 2024, 4:19 AM

#

mi kin li sona ala

icy turret Sep 13, 2024, 6:50 AM

#

overall "monthly volume of toki pona" seen by ilo Muni:

#

there are high plateaus in 2007 and 2010 and we don't know what they correspond to, as of now

#

so 2017 represents the first time toki pona escapes this average level of activity, by virtue of reddit + discord

#

unless, ofc, the communities not yet in muni (like obvs all the irc chat logs which may or may not exist) reshape this graph starkly

#

@quaint wagon any reason it goes back to march 2002 instead of august 2001 btw?

#

in the sparse data pre-2017, mi + sina vs li show some amount of inverse relation, which to me sounds alternating between mostly chatting (hence 1st 2nd person) and prose (hence 3rd person)

#

whereas by now the community is large enough that the movement cancels out

plucky gazelle Sep 13, 2024, 11:54 AM

#

🔵 mi + sina 🔴 li

quaint wagon Sep 13, 2024, 1:47 PM

#

icy turret <@497549183847497739> any reason it goes back to march 2002 instead of august 20...

icy turret Sep 13, 2024, 1:47 PM

#

..right fair

#

do we have any known posts from even earlier?

#

regardless of ilomuniability

quaint wagon Sep 13, 2024, 1:50 PM

#

icy turret do we have any known posts from even earlier?

https://web.archive.org/web/20220808015056/https://wyub.github.io/tokiponaarchive/2001-08-08.html
this, archive of an archive,?

quaint wagon Sep 13, 2024, 1:51 PM

#

icy turret unless, ofc, the communities not yet in muni (like obvs all the irc chat logs wh...

facebook could

#

idk its exact active years but it may make a difference in the 2014-2019 range

river shard Sep 13, 2024, 1:58 PM

#

facebook groups is a pain
the API didn't work back when facebook had it normal for groups, and now they restricted it further

icy turret Sep 13, 2024, 1:59 PM

#

i look forward to mKS singlehandedly creating a working facebook scraper

quaint wagon Sep 13, 2024, 1:59 PM

#

icy turret i look forward to mKS singlehandedly creating a working facebook scraper

it would be the second

#

unfortunately, the first is proprietary

#

altho granted my work wasn't singlehanded on that first one but it was A Lot

icy turret Sep 13, 2024, 2:03 PM

#

interesting

#

btw honrstly

#

like

#

is the reason for not using the proprietary one legal or academic (repeatability)

#

cause the contemporary fb group probably generates a negligible ampunt of traffic and having just a dunp that never gets updated is sufficient for muni

quaint wagon Sep 13, 2024, 2:07 PM

#

icy turret is the reason for not using the proprietary one legal or academic (repeatability...

Both of these

#

I can't use it for private purposes, and nobody else would have access to it

icy turret Sep 13, 2024, 2:09 PM

#

alright fair

fresh sentinel Sep 14, 2024, 11:27 AM

#

it's repeatable because someone else can just write another facebook scraper /musi

icy turret Oct 28, 2024, 12:43 PM

#

anu-predicate

quaint wagon Oct 28, 2024, 12:49 PM

#

nimi o li lon poka la seme?

icy turret Oct 28, 2024, 12:50 PM

#

a pakala mi

#

wide pawn Oct 28, 2024, 1:16 PM

#

ken la o lukin e "anu ala e" kin?

icy turret Oct 28, 2024, 1:20 PM

#

wide pawn ken la o lukin e "anu ala e" kin?

ive done that on a separate occasion but it wouldn't count towards "anu e" so

hidden tide Nov 26, 2024, 2:52 PM

#

here are all the pu words placed into ilo muni! have fun.
(don't click if your ilo can't handle the huge amount of data)
https://gregdan3.github.io/ilo-muni/?query=a%2C+akesi%2C+ala%2C+alasa%2C+ale%2C+anpa%2C+ante%2C+anu%2C+awen%2C+e%2C+en%2C+esun%2C+ijo%2C+ike%2C+ilo%2C+insa%2C+jaki%2C+jan%2C+jelo%2C+jo%2C+kala%2C+kalama%2C+kama%2C+kasi%2C+ken%2C+kepeken%2C+kili%2C+kin%2C+kipisi%2C+kiwen%2C+ko%2C+kon%2C+kule%2C+kulupu%2C+kute%2C+la%2C+lape%2C+laso%2C+lawa%2C+leko%2C+len%2C+lete%2C+li%2C+lili%2C+linja%2C+lipu%2C+loje%2C+lon%2C+luka%2C+lukin%2C+lupa%2C+ma%2C+mama%2C+mani%2C+meli%2C+mi%2C+mije%2C+moku%2C+moli%2C+monsi%2C+monsuta%2C+mu%2C+mun%2C+musi%2C+mute%2C+namako%2C+nasa%2C+nasin%2C+nena%2C+ni%2C+nimi%2C+noka%2C+o%2C+oko%2C+olin%2C+ona%2C+open%2C+pakala%2C+pali%2C+palisa%2C+pan%2C+pana%2C+pi%2C+pilin%2C+pimeja%2C+pini%2C+pipi%2C+poka%2C+poki%2C+pona%2C+pu%2C+sama%2C+seli%2C+selo%2C+seme%2C+sewi%2C+sijelo%2C+sike%2C+sin%2C+sina%2C+sinpin%2C+sitelen%2C+sona%2C+soweli%2C+suli%2C+suno%2C+supa%2C+suwi%2C+tan%2C+taso%2C+tawa%2C+telo%2C+tenpo%2C+toki%2C+tomo%2C+tu%2C+unpa%2C+uta%2C+utala%2C+walo%2C+wan%2C+waso%2C+wawa%2C+weka%2C+wile%2C+‌nanpa&minSentLen=1&scale=rel&start=1501545600&end=1722470400&smoothing=2&smoother=cwin

ilo Muni

Watch toki pona grow and change- now with graphs!

quaint wagon Nov 26, 2024, 2:54 PM

#

tan.... seme

#

mi ken LUKIN A e ni: ILO LI PILIN IKE TAN MUTE NIMI

hidden tide Nov 26, 2024, 2:58 PM

#

i'll just post screenshots

#

first off, the absolute scale! you can see how toki pona has grown over time

#

(i am interested to know what occurred the first half of 2023 where there is a notable trough)

icy turret Nov 26, 2024, 3:01 PM

#

hidden tide (i am interested to know what occurred the first half of 2023 where there is a n...

watch the sptp2024 ilo muni presentation!

hidden tide Nov 26, 2024, 3:02 PM

#

it's more visible in the absolute entropy scale

quaint wagon Nov 26, 2024, 3:03 PM

#

first recorded usage of the entropy scale

hidden tide Nov 26, 2024, 3:06 PM

#

the entropy scale is very fun

quaint wagon Nov 26, 2024, 3:06 PM

#

hidden tide (i am interested to know what occurred the first half of 2023 where there is a n...

kulupu lawa pi ma ni (en mi) li pini e tomo ale ni:
#toki-moku #toki-nanpa #kalama-tpt #nasin-tpt #tpt-ale-kin

mi pini e ona tan ni: mi lukin e toki mute pi nasin ni:
1: "toki!"
2: "toki, sina pona ala pona?"
1: "mi pona. sina seme"
ni la toki li kama pini. jan li toki wawa ala li toki lon weka tenpo la ona li kama ala toki suli.
mi pini e tomo la mi pilin e ni: tenpo toki li kama lili, tan ni: jan li kama lukin e toki pi jan ante lon tenpo pona!

taso! mi kama sona e ni: lon la jan li wile e ijo toki. ni li kama e toki wawa mute. mi weka e tomo pi ijo toki la jan o toki lon seme? ala a. ni la ona li kama toki ala.

#

(sina ken ala lukin e tomo la ona ale li pini. o kama jo e poki pi lukin pini.)

#

a. mi pana ike e ona tu. taso, tomo pi nasin sama li lon tenpo pi weka tomo li pini.

hidden tide Nov 26, 2024, 3:07 PM

#

sona!

quaint wagon Nov 26, 2024, 3:08 PM

#

sina wile pali e tomo toki e kulupu la, o pana e ijo toki!

hidden tide Nov 26, 2024, 3:09 PM

#

ijo toki li pona

hidden tide Nov 26, 2024, 3:10 PM

#

hidden tide it's more visible in the absolute entropy scale

sitelen sama, taso linja li supa ala
(nena suli loje li nimi "nena", nena suli walo li nimi "luka", nena suli laso li nimi "unpa")

quaint wagon Nov 26, 2024, 3:12 PM

#

hidden tide sitelen sama, taso linja li supa ala (nena suli loje li nimi "nena", nena suli w...

nimi nanpa li kama suli nasa tan ni: tenpo suno tu tu la jan wan li nasin e nanpa mute lon tomo wan lon ma wan.

#

wan. tu. tu wan. tu tu. luka. luka wan. luka tu.
ona li ni li pini lon poka pi mute ale. (ni li nanpa).

hidden tide Nov 26, 2024, 3:14 PM

#

nanpa mute

#

sitelen ni la, sina ken lukin e suli pi kama suli nimi lon tenpo. linja walo sewi li nimi "pu." jan li toki mute ala e nimi ni la ona li sewi tawa nanpa wan.

#

mi supa ale e sitelen nasa la sina ken lukin e kama suli pi toki pona lon tenpo

meager steeple Dec 4, 2024, 1:29 AM

#

nasin a

quaint wagon Dec 4, 2024, 1:51 AM

#

nasa, toki pini pi wile sona li kama suli a lon tenpo

devout jettyBOT Dec 4, 2024, 6:11 AM

#

tenpo wan la mi lukin e toki pi tenpo weka li kama sona e ni: sike pini la toki "anu seme" li nasa lili tawa ijo Osuka. ni li nasa tawa mi! toki ni li suli a tawa nasin mi toki

#

toki ona la ona li kepeken toki "anu seme" lon tenpo pi mute lili taso li pilin e ni: toki "X ala X" li pona nanpa wan

serene hollow Dec 7, 2024, 5:51 PM

#

oh my god 😭

fresh sentinel Dec 7, 2024, 5:59 PM

#

is this when kijetesantakalu was coined?

serene hollow Dec 7, 2024, 6:04 PM

#

thats the record for the usage of "suwi" from august of 2001 to aug. 2024

fresh sentinel Dec 7, 2024, 6:07 PM

#

the hovered spike I mean

icy turret Dec 7, 2024, 6:08 PM

#

fresh sentinel is this when kijetesantakalu was coined?

feb 2009, too early

devout jettyBOT Dec 7, 2024, 6:19 PM

#

what happened 😭

ilo musi Anjelita ↩️

[Reply to:](#1272180068721889290 message) oh my god 😭 📎

#

I don't think there's much data for that time period, so maybe someone used suwi a bunch and it became a significant percentage of all toki pona recorded that month

serene hollow Dec 7, 2024, 6:20 PM

#

fresh sentinel the hovered spike I mean

oh

serene hollow Dec 7, 2024, 6:21 PM

#

devout jetty I don't think there's much data for that time period, so maybe someone used suwi...

its gonna be someone who uses kawaii on a minutely basis watch

#ilo.muni.la: graph how toki pona is used!

https://gregdan3.github.io/ilo-muni/