Missing letters in Romanian?

Greg Shenaut shared this idea 10 months ago
Completed

I just found this site, and I like it a lot! However, one of the languages I tried is Romanian, and it appears that several letters in their alphabet are completely missing from the word list. Specifically, ăĂ â îÎ șȘ țȚ are nowhere to be found. Here is an illustrative sample text from omniglot.com: « Toate ființele umane se nasc libere și egale în demnitate și în drepturi. Ele sunt înzestrate cu rațiune și conștiință și trebuie să se comporte unele față de altele în spiritul fraternității. » One can work around this gap via a custom list, but unless I'm missing something, I think the Romanian word list should be updated. Thanks! By the way, be careful: ș (comma; Mac unicode option-p) is not the same letter as ş (cedilla; Mac unicode option-c), etc., but some sites use cedillas instead of commas under s and t because the cedilla is generally more available on keyboard layouts. Also note that Romanian ă (Mac unicode option-b) is not the same as ǎ (Mac unicode option-v).

Comments (7)

photo
66

Hello,

You are absolutely right, the top 200 definitely needs an update. I see no reason to remove the diacritics in Romanian and to keep them in all other languages. Moreover, the words used are quite long / difficult and are not the 200 most common. The wordlist should be the real top 200 with diacritics.

However, you will notice that the diacritics are present in the advanced Romanian typing test. Besides, there is the same problem in Albanian: the top 1000 has the diacritics but not the top 200.

If the admin agrees, here is a list that could be used:

sa|nu|de|o|ca|și|a|ce|în|e|am|pe|te|la|mai|cu|un|asta|ai|ma|da|sunt|bine|eu|pentru|se|este|dar|mi|tu|care|aici|va|fi|din|dacă|așa|ne|ești|trebuie|ar|fost|cum|acum|îmi|ceva|tine|mine|el|știu|când|voi|lui|doar|unde|au|ei|pot|poate|noi|cred|face|vrei|mea|făcut|meu|nici|tot|ea|știi|poți|le|mult|acolo|îți|vreau|ta|faci|sau|buna|nimic|era|putea|rău|spus|pare|spune|cine|vom|aceasta|foarte|chiar|după|trebui|tău|as|rog|viaţa|avem|prea|fie|atunci|încă|fac|despre|cat|mulțumesc|are|hai|fata|până|timp|nevoie|vor|al|văzut|într|haide|crezi|niciodată|spun|toate|vei|regula|decât|suntem|cineva|pilot|toți|nava|lasa|cel|putem|naiba|jos|iar|vrea|puțin|ia|deci|stai|vad|acest|vezi|uite|aia|mergem|fără|bun|ați|data|sigur|domnule|gata|înapoi|avea|totul|tata|toată|mama|mare|facem|venit|înainte|ani|zis|avut|fii|fel|dat|spui|merge|întâmpla|unul|prin|place|lucru|pai|ceea|afara|fiu|nimeni|grija|acasă|zi|astea|timpul|atât|loc|treaba|numai|lor|omule|doua|uita|repede|orice|asculta|adevărat|nou

Source: https://en.wiktionary.org/wiki/Wiktionary:Frequency_lists/Romanian

photo
57

The wordlist has been updated, it might take up to 1 hour for the cache to update to the new words.

photo
60

Awesome!

Just a little mistake in the list: the word viaţa (with cedilla) should be replaced by viața (with comma).

Thanks. ;)

photo
57

updated

photo
67

You shouldn't have done this. Many type without the diacritics and their speed decreases with them.

photo
53

but hey real types have to deal with it

photo
2

I think the truth is actually somewhere in-between. Though it's correct to spell with diacritics, most Romanians don't. They never have nor will they ever. You're not required to even when writing an official notice to the authorities. I've been typing for a very long time and work with a PC all day, never have I ever been bothered to even install RO keyboard language for the special characters. So, even if it's the "correct" way to type, it's not how people do it and it's definitely not how most of the people who want to type fast do it.


Ideally this should be optional, for it may depend on how you're used to typing. If, for some reason, you're used to typing with special characters then it will be difficult to readjust without them. The other way around it's just the same. And if you check messages on social media for example, most of the people don't use the special characters unless they're typing from a mobile phone that uses auto correct.


But if implementing an on/off switch is too difficult then I vote against having them as, theoretically, anyone who's typing in Romanian and English is used to NOT using them. The other way around doesn't apply, not everyone is used to using them.


Thanks,

Theo.

photo