Deykun@kbin.social to DACH - jetzt auf feddit.org@feddit.de · 8 months agoGerman heatmap based on Wikipedia articles 🇩🇪media.kbin.socialimagemessage-square21fedilinkarrow-up10arrow-down10
arrow-up10arrow-down1imageGerman heatmap based on Wikipedia articles 🇩🇪media.kbin.socialDeykun@kbin.social to DACH - jetzt auf feddit.org@feddit.de · 8 months agomessage-square21fedilink
minus-squareGravitySpoiled@lemmy.mllinkfedilinkarrow-up0·8 months agoA million words doesn’t sound like a lot
minus-squareDeykun@kbin.socialOPlinkfedilinkarrow-up0·8 months agoTo clarify, it is not a matter of considering the total number of words but rather the number of unique words considered.
minus-squaresbv@sh.itjust.workslinkfedilinkarrow-up0·8 months agoThat million words sounds like a lot.
minus-squareGBU_28@lemm.eelinkfedilinkarrow-up0·8 months agoHave you considered a similarity search approach? They would handle your oddly specific synonym issue
minus-squareLocalhorst86@feddit.delinkfedilinkDeutscharrow-up0·8 months agoevery word in one single picture:
minus-squareAnekdoteles@feddit.delinkfedilinkDeutscharrow-up0·8 months agoIt’s incomplete, as you will only finde 95% of words used on ich_iel.
A million words doesn’t sound like a lot
To clarify, it is not a matter of considering the total number of words but rather the number of unique words considered.
That million words sounds like a lot.
Have you considered a similarity search approach? They would handle your oddly specific synonym issue
Oh? Name all of them.
every word in one single picture:
It’s incomplete, as you will only finde 95% of words used on ich_iel.