randomness: (Default)
[personal profile] randomness
So, if the intent is to be able to talk to the maximum number of people on the planet, here are the languages I should learn. Numbers vary wildly, so this is only a guide. (Highest estimate for total number of users, native and non-native in millions from each language's Wikipedia page, fetched 29 May 12.):

English 1800
Mandarin Chinese 1020
(Castilian) Spanish 500
Hindi-Urdu 490
Arabic (dialect chain) 340

French 275
Russian 258
Portuguese 252
Bengali 230
Malay 180

Swahili 150
Japanese 127
German 120
Persian 110
Punjabi 104

Turkish 91
Italian 85
Javanese 85
Vietnamese 81
(Jiangxinese) Gan-Hakka Chinese 80

Thai/Lao-Isan 80
Korean 78
(Shanghainese) Wu Chinese 77
Telugu 74
Marathi 72

Gujarati 65.5
Tamil 65
(Filipino) Tagalog 64.3
Pashto 60
(Cantonese) Yue Chinese 56

Dutch/Afrikaans 51
(Hokkien) Min Nan 50
Kannada 47
Oriya 45
Ukrainian 45

Polish 44
Burmese 42

Obviously, diminishing returns set in after a while. But I've made a pretty good start on the first two. Perhaps the plan should be to learn a hundred words in each language, and be able to string them together in some way intelligible to someone who actually speaks the language.

Compiling this list has really brought home to me the messiness of language classification. It has also reminded me how true it is that languages are dialects with flags.

(no subject)

Date: 2012-05-31 05:25 pm (UTC)
From: [identity profile] r-ness.livejournal.com
You could do a similar analysis for other languages

Hey, thanks for that! It's a useful approach.

One of the other things that really might drop Arabic in particular in the rankings is that "Arabic" really is no more a single spoken language than "Chinese". (See my earlier post.) I don't have any figures on how many people actually speak Modern Standard Arabic, but it's probably a lot less than 340 million. If you consider the different varieties of Arabic in an analogous way that one generally thinks of the different varieties (dialects, languages, what have you) of Chinese, you start having to look at things like Egyptian Arabic (80 million) on its own, the way my very rough list already treats Gan-Hakka Chinese.

That gets into the question of where to draw lines between languages. I'm being relatively expansive with my definitions (Thai/Lao-Isaan, for example) because my interest is intelligibility, but I'm given to understand that most of the users of Maghrebi and Gulf Arabic (just to take a couple of geographic extremes) would have trouble talking to each other unless they were to shift to Modern Standard Arabic.

the degree of imprecision is huge.

Boy, isn't that true? I am really suspicious of any of the numbers I listed from Wikipedia that have more than two significant digits. Sometimes I feel like even that's relying on their numbers way too much. :)

One of the interesting things that this turns up is some general evidence for the collapse of French as an international language, although that might just be artifacts of data.

French usage in particular shows a couple of countervailing trends. On the one hand, it really is becoming less useful in the world in general. On the other hand, more and more people are learning it to talk to each other across Francophone Africa, which is itself a part of the world showing significant population growth.

Language is messy. But you were a linguistics major, many many moons ago, you understand that. :)

Profile

randomness: (Default)
Randomness

November 2024

S M T W T F S
     12
3456789
10111213141516
171819 20212223
24252627282930

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags