In the comments section, James Schipper comments on the notion that English has more words than, say, Swedish:
Measuring the number of words in a language isn’t very scientific. What is a word? Is it anything that is separated by empty space? If so, then the more words are written as one, the more words there are in a language. Bookkeeper and steamship would be separate words but book publisher and passenger ship would not be. I’m currently reading a book by your Swedish colleague Mikael Parkvall about language myths. One myth that he discusses is Engelska har fler ord än svenska = English has more words than Swedish. He says that no evidence is ever provided for the claim, except to say that English has borrowed a lot. He mocks an English chauvinist who states that English has over 1 million words and French about 100,000 and who then says that English borrowed a lot from other languages, especially from French. In other words, English is rich and French poor because English borrowed a lot from French. As Parkvall sarcastically notes, the English must have borrowed a lot of words from the French without ever paying them back. He says that if all the works of Shakespeare are run through a computer program designed to count words, the result is 29,066. However, if all the works of August Strindberg are run through the same program, the result is 119,288 words! I can easily see why the Swedish count is so high. In Swedish, all nominal compounds are written as one word and the definite article is a suffix. On top of that, the genitive is used more than in English. We have for instance bil = car, bilen = the car, bilar = cars, bilarna, the cars, bils = of a car, bilens = of the car, bilars = of cars, bilarnas = of the cars, bilolycka = car accident, bilägare = car owner, bilmekaniker = car mechanic, bilparkering = car parking, bilbälte = seat belt, etc. How can a computer or anybody else decide how many of these are separate words or not? When the French language had a lot of prestige, people were saying that it was exceptionally clear. Now that English is very prestigious, we keep hearing that it is exceptionally rich. In any case, languages that have borrowed a lot are not uncommon. Moreover, the more a language borrows, the greater the probability that the borrowed words simply displace native words, in which case no enrichment takes place.
I read somewhere that someone said that Dutch has 4 million words! On my other site, we do a lot of translations of posts to other languages. So far, we have done Spanish, Portuguese, Italian, Norwegian, Swedish, Finnish, Serbo-Croatian, German, French, Bulgarian, Romanian, Polish and Korean. So far, I have had few complaints from translators along the lines of “we don’t have a word or phrase in our language for that English word or phrase.” Cases of having to use an English word or phrase because no translation was available are few. However, Korean did some to stick out. I am told by Korean speakers that Korean has few to no synonyms. I knew a young Korean-American woman who was stunned by the number of synonyms in English. The Koreans think the plethora of US synonyms is somewhere between ridiculous and idiotic. Why do you need more than one word with the same meaning? Norwegian, a very small language in terms of speakers, struck me as being particularly word-rich for some reason. An interesting question is how many words a typical primitive language had or has. A study was recently done on one of the Araucanian languages of South America, Yaghan. A recent dictionary of Yaghan listed around 30,000 words! The author made the supposition that your typical primitive language pre-contact had around 30,000 words. No one knows for sure. I worked for 1½ years on a California Indian language called Chukchansi. It’s true that they lacked words for a lot of modern concepts, many more obscure body parts, and many fine gradations of meaning. The speakers were all elderly and spoke English well. The last near-monolingual speaker died around 1965. She spoke English, but it was broken English. These speakers are helpful for a language. I heard from people who knew this woman that she had coined many Chukchansi borrowings and calques for many words having to do with modern living. When the last of the monolingual or near monolingual speakers die, your small language may get in bad shape. Calques and proper borrowings wedded to the phonology of the receptive language will simply disappear. We have many speakers of a SE Asian language called Hmong around here. It has millions of native speakers, but I understand that it lacks many words for modern concepts, even though there large number of monolinguals to near monolinguals around here – older people, especially women. I don’t understand why they don’t borrow English words or engage is calques or word-formations. The Hmong have an interesting cultural concept – if you are over 40, they say that you are too old to learn a foreign language. Hence, a lot of the older Hmong, especially the women, simply do not even try to learn English here in the US.