Gerv, this sounds really great - kudos to all the localizers as well as L10n-drives and even developers who makes that possible!
That said, once the FF 3.5 hype dies off a little bit, I might come back to you for any analysis of what coverage we are able to provide with SeaMonkey 1.1.x, the en-US-only alphas (just as a baseline) and the first in-sync-localized SeaMonkey release ever, the upcoming Beta 1. Even though the numbers will surely not look nearly as good as any of the coverages in your table, it would be nice if we can come up with some numbers that visualize where we are there.
Posted by Robert Kaiser at June 29, 2009 8:30 PMKairo: If you send me a list of the languages you've got, I can get you stats in about 5 minutes :-)
Gerv
Posted by Gerv at June 29, 2009 8:34 PMThat's in accordance with Asa's recent and now famous graph.
Posted by Funtomas at June 29, 2009 8:55 PMGood show!
I do have a couple of minor quibbles...
First, I'm not sure I'd really put Esperanto on that "only game in town" list. Pretty much everyone who speaks it is an enthusiast, kind of like the people who speak Klingon (albeit with different motivations). They're all fluent in other languages, by necessity. (Notwithstanding the widely-repeated claim that there are "a thousand" native speakers, a claim that Lindstedt appears to have pulled out of his backside and for which he provides no citation, I've never seen ANYONE claim that there are real-world towns and villages of native speakers of Esperanto.) Providing an Esperanto localization for them to play with is cool and everything, but it doesn't make the browser "the only game in town" for them in at all the same way that a Persian/Farsi localization would be for many Iranians (most of whom don't have regular internet access so far, but, as you point out, that may change over time).
Also, I strongly suspect that the numbers display more precision than they really have. 95.7% makes it sound like you've actually got a reasonably accurate count of how many internet users there are in, say, India. Since so many users share computers, I very much doubt a count with three significant digits is really possible. On top of that, multiplying the percentage of first-language speakers by the number of internet users implicitely assumes there's no correlation between first language and internet use, which is likely invalid in some cases. (I'd definitely put money on there being a correlation between first language and internet use in Iran, for instance.)
Still, attempting to track this is great, and I'm sure the effort you've put into it is very much appreciated. I sincerely doubt anyone has a way to calculate more reliable numbers than yours, and it's important for the translation community to have, as you put it, goalposts. So despite the inherent difficulties, it's a good work you're doing. Your numbers are probably a pretty rough estimate, but it's the best thing available and a good deal better than not *having* an estimate.
Purely out of curiousity, what languages does IE have that Firefox lacks?
Posted by Jonadab the Unsightly One at June 30, 2009 3:13 PMJonadab: what I said was "If you want a current browser in Esperanto, Firefox is the only game in town." I agree that there would be very few people who need a current browser in Esperanto, but that's not the same thing.
As for precision, I already knocked a digit off my spreadsheet's number. 2 digits of precision would be, I think, a bit thin, three digits is arguably too precise. Do we really serve more first-language speakers than IE? A statistician would plug in all the data and its probabilities, standard deviations and so on and tell you that there's a 78.3% chance that we do, but I'm happy just to use 3 digit precision. After all, if they beat us on this measure (which they could do by e.g. shipping a Persian localization) then I'd take the medicine like a man.
You are right about assuming there's no correlation between language and Internet use, and that being a flaw. But, as you say, there are no better stats until countries start adding "do you have access to the Internet" to the census, and releasing breakdowns by language. Given that many countries only do a census every 10 years, it'll be a little while...
For languages IE 8 has that Firefox lacks, see the spreadsheet :-) Bosnian, Konkani, Kyrgyz, Malay, and Uzbek.
Posted by Gerv at June 30, 2009 3:59 PMWhat about Latin? There are many people studying classic languages. Just for fun. And it would attract some media attention.
Posted by Lucas at July 6, 2009 10:09 PMThere's a translation of Firefox 3 into Latin for Ubuntu, but we'd need a volunteer willing to finish and upstream it.
Posted by Gerv at July 7, 2009 1:18 PM