Wait, why is J only worth 8 then while Q and Z are 10?
Jonesm1 on
It would be interesting to compare this (distribution in 460k words) with the distribution in the relatively small set of maybe 20k words people really use. Few people have a vocabulary of over 30k words.
underlander on
label your axes people
aiahiced on
Saw the article, then the Letter E and mouthed ‘Really?’ Realized it has an E, then said ‘Oh Yeah’ lol
Lurking_all_the_time on
Interestingly printing machinery listed etaoin shrdlu as the most common letters.
I wonder why – a change in usage since the early 1900s or different usage patterns?
The S one feels like cheating because you get another count for each countable noun.
ohiocodernumerouno on
now perform this analysis on a about the Java programming language.
Bunkyo-Koishikawa on
Cool. Would also be interested in seeing the most used phonemes.
Gargomon251 on
I remember at one point it was ETAOINSHRDLU but I know language has changed over the years
JeeEyeElElEeTeeTeeEe on
Back in the day when I took a cryptography class, I memorized the alphabet in this order. Need to know the most common letters for most substitution cyphers.
fromYYZtoSEA on
And this is how cryptographers can very easily break basic “substitution ciphers” (where you replace a letter for another, for example replacing E with Z in every occurrence): https://en.wikipedia.org/wiki/Frequency_analysis
AgradableSujeto on
That’s right U, know your fucking place. You are not welcomed amongst the rest of the vowels.
21 Comments
Source: [https://github.com/dwyl/english-words?tab=readme-ov-file](https://github.com/dwyl/english-words?tab=readme-ov-file)
Tool: Custom python script to get data, Excel to visualise
J being that rare is quite surprising.
EIAON, looks like an average word in Irish
Clearly we need to start uuusing ewe more.
Hmm j is really getting under used, or next add to the English language needs to be like jjujj the act of getting scammed by a Ai or something.
Where’s the Batman symbol?
(https://youtu.be/cTBuj9TC-40?si=oj76-LtgeVh3Yy7R)
RSTLN E. Need three more consonants and a vowel.
Wait, why is J only worth 8 then while Q and Z are 10?
It would be interesting to compare this (distribution in 460k words) with the distribution in the relatively small set of maybe 20k words people really use. Few people have a vocabulary of over 30k words.
label your axes people
Saw the article, then the Letter E and mouthed ‘Really?’ Realized it has an E, then said ‘Oh Yeah’ lol
Interestingly printing machinery listed etaoin shrdlu as the most common letters.
I wonder why – a change in usage since the early 1900s or different usage patterns?
[https://en.wikipedia.org/wiki/Etaoin_shrdlu](https://en.wikipedia.org/wiki/Etaoin_shrdlu)
I’m ready for my next game of hangman
is there an r/citations are beautiful? lmao.
The S one feels like cheating because you get another count for each countable noun.
now perform this analysis on a about the Java programming language.
Cool. Would also be interested in seeing the most used phonemes.
I remember at one point it was ETAOINSHRDLU but I know language has changed over the years
Back in the day when I took a cryptography class, I memorized the alphabet in this order. Need to know the most common letters for most substitution cyphers.
And this is how cryptographers can very easily break basic “substitution ciphers” (where you replace a letter for another, for example replacing E with Z in every occurrence): https://en.wikipedia.org/wiki/Frequency_analysis
That’s right U, know your fucking place. You are not welcomed amongst the rest of the vowels.