More

Wikipedia Language Maps Created By Oxford Internet Institute's Mark Graham (PICTURES)

The Huffington Post   Ramona Emerson and Catharine Smith First Posted: 11/13/11 05:01 PM ET Updated: 11/13/11 05:01 PM ET

Wikipedia Language Maps

Wikipedia, the world's largest online encyclopedia, ranks near the top of the Internet's most-visited websites. According to web data-tracking company Alexia, Wikipedia currently sits fifth from the top of the 500 most popular sites in the world; at the time of this writing, only Yahoo, YouTube, Facebook and Google enjoyed more traffic. But despite its global reach, Wikipedia's users tend to favor certain languages above others.

Mark Graham, a research fellow at the Oxford Internet Institute, led a team of researchers who broke down Wikipedia's geotagged articles by language and examined the global scope of the encyclopedia. They plotted these data onto maps of the world to show the spread of languages within the encyclopedia.

Perhaps unsurprisingly, the most geotagged entries had been written for Wikipedia's English-language edition. Geotagging data collected by Graham's team shows that the United States and Europe are the two most popular regions written about in English.

Indeed, an overview of English Wikipedia reveals it to be Wikipedia's largest edition, with over 3,795,081 articles. According to Graham, the English edition also has over 700,000 geotagged articles, the most of any of Wikipedia's 282 language editions.

And where do these contributors primarily reside? Wikipedia's most recent analysis of the origin of page edits found that a whopping 44 percent of English-language edits are made by people inside the United States, 16.3 percent are from the United Kingdom and 6.6 percent are from Canada.

Graham's team also looked at and mapped geotagged articles written in less commonly used languages, such as Arabic, Persian, Hebrew and Swahili.

The Guardian points out that Graham has noted some interesting oddities in his examination of these maps. Says Graham,

If you look closely at the Arabic or Persian maps you might see some interesting patterns (for instance look closely at the patterns in the US). You see a similar sort of unexpected spatial distribution of articles in the map of Swahili Wikipedia [...] (i.e. why are there so many articles in Turkey?). The answer is simply a few dedicated editors creating stub articles about relatively structured topics such as cities in Turkey (in the Swahili Wikipedia) or every county in the US state of Georgia (in the Arabic Wikipedia).

Wikipedia founder Jimmy Wales recently sat down with The Huffington Post's Bianca Bosker and discussed the project's goals and challenges. He also mentioned the disparity that exists between the site English-language version and its non-English counterparts and spoke of narrowing that gap by encouraging Wikipedia's growth around the world.

“Wikipedia’s vision statement is to provide a free encyclopedia for every single person on the planet in their own language, so it’s core to our mission that we focus a lot on the developing world where Wikipedia is behind where it could be,” Wales told The Huffington Post. “We already have a free encyclopedia in English, and we want to improve it, but really, I’m more and more focused on growth in the developing world.”

Take a look at the slideshow (below) to see some of the languages that Graham and his team mapped. You can view more of Mark Graham's work at his website, Zero Geography. You can also follow him on Twitter (@geoplace).

For more fun with Wikipedia's geotagged articles, check out a visual history of the world, as told by Wikipedia.


FOLLOW HUFFPOST TECH

Wikipedia, the world's largest online encyclopedia, ranks near the top of the Internet's most-visited websites. According to web data-tracking company Alexia, Wikipedia currently sits fifth from the t...
Wikipedia, the world's largest online encyclopedia, ranks near the top of the Internet's most-visited websites. According to web data-tracking company Alexia, Wikipedia currently sits fifth from the t...
 
 
  • Comments
  • 25
  • Pending Comments
  • 0
  • View FAQ
Post Comment Preview Comment
To reply to a Comment: Click "Reply" at the bottom of the comment; after being approved your comment will appear directly underneath the comment you replied to.
View All
Favorites
Recency  | 
Popularity
07:20 PM on 11/14/2011
The web data tracking company is Alexa not Alexia!
photo
Mister Grumpy
An Angry American
04:28 PM on 11/14/2011
It's obvious from the latest Republican debates that they don't view facts as being very important....... so we know the data isn't skewed very much by their unwillingness to use the service........
photo
Anonmouse33
'Ice cream for the voiceless' --Anarchy
09:25 AM on 11/14/2011
i blame obama
This user has chosen to opt out of the Badges program
TheCycad
Shape The Future, Don't Be Swept Away By It
02:58 PM on 11/14/2011
LOL
04:51 AM on 11/14/2011
How accurate is this?

How is it that Turkey has the most Swahili edits in the world?!
photo
LMPE
I connect the most dissimilar things
01:23 PM on 11/14/2011
Maybe a bunch of Swahili-speakers moved to Turkey.
photo
BrassOnes
Hasa Diga Eebowai
08:20 PM on 11/14/2011
Some of the hebrew was posted from the middle of oceans and on northern Canadian islands. You can tag at as from anywhere you like. So the trends are real, but not every dot is real.
02:03 PM on 11/15/2011
You missunderstand.

The dots are not edit locations. They are locations of the object or event described in a given article. Examples: Big Ben, Battle of the Bulge, Nile delta....
02:31 AM on 11/14/2011
English is still the most popular language in the world. (The language of technology.) So relax Bachman and Cain.
photo
HUFFPOST SUPER USER
Gottlieb
hated by left since 1973 and right since 1982
09:08 PM on 11/13/2011
Great maps, Wikipedia is not peer reviewed but still a great starting point for finding out about all things.
photo
HUFFPOST SUPER USER
topkatnc
Give a stray cat or dog a chance .
12:11 PM on 11/14/2011
I agree .. I use it everyday .. I like to look up movies that I am watching .. Or people that I am reading about .. I get alot of information from them ,while I am on the Internet .
photo
LMPE
I connect the most dissimilar things
01:20 PM on 11/14/2011
Same here.
This user has chosen to opt out of the Badges program
TheCycad
Shape The Future, Don't Be Swept Away By It
02:59 PM on 11/14/2011
agreed... I have seen many, many blatant errors on wikipedia, and as such should NEVER EVER be trusted as a source... but it is a good way to open the door... often the citations are the best part of the article.
06:52 PM on 11/13/2011
The Arabic map is particularly interesting. You can see entire states have been left out - especially obvious in the south is Georgia, but not South Carolina.

I wonder why?
photo
HUFFPOST SUPER USER
Jonathan Lawson
08:48 AM on 11/14/2011
having lived my whole live in NC and VA this map is particularly interesting to me as well.
photo
HUFFPOST SUPER USER
topkatnc
Give a stray cat or dog a chance .
12:17 PM on 11/14/2011
It looks like in some places we don't even exist .
10:52 AM on 11/14/2011
Yeah it was the first thing I noticed, being an SC resident. I have a hard time believing geo-tagged articles adhere to state boundaries so closely. I wonder what the real reason is.
photo
LMPE
I connect the most dissimilar things
06:23 PM on 11/13/2011
I've always thought that one of the neatest things about a given Wikipedia article is the languages that it links to. As can be expected, geographical divisions, languages, heads of state and events that severely impacted the course of history have articles in almost every language. But other topics have interesting assortments. For example:

Helicoid has pages in Arabic, Dutch, English, French, German, Hungarian, Italian, Polish, Russian and Ukrainian.

SS Edmund Fitzgerald has pages in Danish, English, Finnish, French, German, Indonesian, Italian, Japanese, Simple English, Spanish and Welsh.

St. Brice's Day Massacre has pages in English, Esperanto, French, German, Italian, Norwegian, Persian, Russian, Swedish, Ukrainian and Welsh.

Jalapeño (get ready for this one) has pages in Bulgarian, Chinese, Czech, Danish, Dutch, English, Finnish, French, Galician, Hungarian, Indonesian, Japanese, Korean, Norwegian, Polish, Portuguese, Russian, Simple English, Spanish, Swedish and Turkish.

It's only thanks to Wikipedia that I even know of the existence of languages like Alemannic, Gilaki, Igbo, Min Nan, Piedmontese, Waray-Waray, Yoruba and Zazaki. Sometimes I'll be scrolling down the page, and the language column shows languages of which I've never previously heard. Cool stuff.

So the next time that you want to read about coffee in Kinyarwanda or prose in Chuvash, you know where to go.
photo
LMPE
I connect the most dissimilar things
06:38 PM on 11/13/2011
PS: prose also has pages in Cebuano, Hindi, Kyrgyz, Mongol, Ruthenian, Tibetan and Yiddish, among others.
02:26 AM on 11/14/2011
What, no Papiamento. (The language of Dutch west Indies.)