Indic (Unicode) language statistics on web September 2011

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|

Indic (Unicode) language statistics on web September 2011

Kaśyap కశ్యప్
Hi

I wanna share  some statics   on Indic language on web in Unicode , I conducted a simple search on 18th September 2011 at 9:00 pm    with language name in Unicode  with three leading search engines Google, Yahoo, Bing . I did't include Sanskrit bacasue is based on the Devanagari script . This latest estimates can help to find the Internet Users by Language (unicode) due to the lack of other sources in data mining  and technical issues please consider this analysis is just of information purpose only .  

                       Google               Yahoo                Bing 
অসমীয়া (Assamese)              739,000                    91,500                 196,000
বাংলা (Bengali)                  20,400,000             49,600,000           18,600,000
English                         8,510,000,000       2,180,000,000      4,190,000,000
فارسی (Farsi)                    222,000,000          193,000,000           5,7,000,000
ગુજરાતી (Gujarti)              12,000,000               4,680,000              4,190,000
 हिन्दी (Hindi)                  251,000,000             70,500,000            1,7400,000
 ಕನ್ನಡ (Kannada)              12,700,000             19,500,000              6,130,000
کًشُر (Kashmiri)                         44,300                   424,000                      7,160
മലയാളം (Malayalam) 23,800,000             36,400,000            13,800,000
मराठी (Marathi)                 17,500,000             10,800,000            10,600,000
ଓଡ଼ିଆ (Oriya)                        1,550,000                   123,000                 141,000
ਪੰਜਾਬੀ (Punjabi)                23,000,000                4,490,000              2,110,000
தமிழ் (Tamil)                  59,800,000              66,100,000            16,600,000
తెలుగు (Telugu)                 40,100,000              24,900,000              9,230,000
اردو(Urdu)                          49,900,000              43,100,000           10,500,000

Regards

--
మీ శ్రేయోబిలాషి
కశ్యప్
kaburlu.wordpress.com
9396533666

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Reply | Threaded
Open this post in threaded view
|

Re: Indic (Unicode) language statistics on web September 2011

Subhashish Panigrahi
Thanks a lot Kaśyap for this great input.

Cheers
Subha

2011/9/18 Kaśyap కశ్యప్ <[hidden email]>
Hi

I wanna share  some statics   on Indic language on web in Unicode , I conducted a simple search on 18th September 2011 at 9:00 pm    with language name in Unicode  with three leading search engines Google, Yahoo, Bing . I did't include Sanskrit bacasue is based on the Devanagari script . This latest estimates can help to find the Internet Users by Language (unicode) due to the lack of other sources in data mining  and technical issues please consider this analysis is just of information purpose only .  

                       Google               Yahoo                Bing 
অসমীয়া (Assamese)              739,000                    91,500                 196,000
বাংলা (Bengali)                  20,400,000             49,600,000           18,600,000
English                         8,510,000,000       2,180,000,000      4,190,000,000
فارسی (Farsi)                    222,000,000          193,000,000           5,7,000,000
ગુજરાતી (Gujarti)              12,000,000               4,680,000              4,190,000
 हिन्दी (Hindi)                  251,000,000             70,500,000            1,7400,000
 ಕನ್ನಡ (Kannada)              12,700,000             19,500,000              6,130,000
کًشُر (Kashmiri)                         44,300                   424,000                      7,160
മലയാളം (Malayalam) 23,800,000             36,400,000            13,800,000
मराठी (Marathi)                 17,500,000             10,800,000            10,600,000
ଓଡ଼ିଆ (Oriya)                        1,550,000                   123,000                 141,000
ਪੰਜਾਬੀ (Punjabi)                23,000,000                4,490,000              2,110,000
தமிழ் (Tamil)                  59,800,000              66,100,000            16,600,000
తెలుగు (Telugu)                 40,100,000              24,900,000              9,230,000
اردو(Urdu)                          49,900,000              43,100,000           10,500,000

Regards

--
మీ శ్రేయోబిలాషి
కశ్యప్
kaburlu.wordpress.com
9396533666

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--
 ସୁଭାସିସ  ପାଣିଗାହି 
SubhasisaPanigahi
ଓଡ଼ିଆ ଉଇକିପିଡ଼ିଆ
_______________________________________________
Wikipedia Odia (Oriya) mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikipedia-or

facebook.com/OdiaWiki
Tweet @OdiaWiki


_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Reply | Threaded
Open this post in threaded view
|

Re: Indic (Unicode) language statistics on web September 2011

Vickram Crishna-2


On Mon, Sep 19, 2011 at 10:10 AM, Subhashish Panigrahi <[hidden email]> wrote:
Thanks a lot Kaśyap for this great input.

It's very enlightening. Is there some way (perhaps a wikiproject) this information can be tracked on an ongoing basis, so we can see at a glance how user preferences change over time (grow/modify)? I think that would be a pretty fantastic measure of the amount of growth of information and its usefulness over time.  


Cheers
Subha

2011/9/18 Kaśyap కశ్యప్ <[hidden email]>
Hi

I wanna share  some statics   on Indic language on web in Unicode , I conducted a simple search on 18th September 2011 at 9:00 pm    with language name in Unicode  with three leading search engines Google, Yahoo, Bing . I did't include Sanskrit bacasue is based on the Devanagari script . This latest estimates can help to find the Internet Users by Language (unicode) due to the lack of other sources in data mining  and technical issues please consider this analysis is just of information purpose only .  

                       Google               Yahoo                Bing 
অসমীয়া (Assamese)              739,000                    91,500                 196,000
বাংলা (Bengali)                  20,400,000             49,600,000           18,600,000
English                         8,510,000,000       2,180,000,000      4,190,000,000
فارسی (Farsi)                    222,000,000          193,000,000           5,7,000,000
ગુજરાતી (Gujarti)              12,000,000               4,680,000              4,190,000
 हिन्दी (Hindi)                  251,000,000             70,500,000            1,7400,000
 ಕನ್ನಡ (Kannada)              12,700,000             19,500,000              6,130,000
کًشُر (Kashmiri)                         44,300                   424,000                      7,160
മലയാളം (Malayalam) 23,800,000             36,400,000            13,800,000
मराठी (Marathi)                 17,500,000             10,800,000            10,600,000
ଓଡ଼ିଆ (Oriya)                        1,550,000                   123,000                 141,000
ਪੰਜਾਬੀ (Punjabi)                23,000,000                4,490,000              2,110,000
தமிழ் (Tamil)                  59,800,000              66,100,000            16,600,000
తెలుగు (Telugu)                 40,100,000              24,900,000              9,230,000
اردو(Urdu)                          49,900,000              43,100,000           10,500,000

Regards

--
మీ శ్రేయోబిలాషి
కశ్యప్
kaburlu.wordpress.com
9396533666

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--
 ସୁଭାସିସ  ପାଣିଗାହି 
SubhasisaPanigahi
ଓଡ଼ିଆ ଉଇକିପିଡ଼ିଆ
_______________________________________________
Wikipedia Odia (Oriya) mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikipedia-or

facebook.com/OdiaWiki
Tweet @OdiaWiki


_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--
Vickram
Fool On The Hill

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Reply | Threaded
Open this post in threaded view
|

Re: Indic (Unicode) language statistics on web September 2011

Rajesh Pandey-2
Thanks Kaśyap for this. People have started writing in their mother tongue and their scripts. That's really good and it feels better when Wikipedia is also responsible for the good.

Is there any tool so that we get the data for other wikiprojects as well?

On Mon, Sep 19, 2011 at 10:50 AM, Vickram Crishna <[hidden email]> wrote:


On Mon, Sep 19, 2011 at 10:10 AM, Subhashish Panigrahi <[hidden email]> wrote:
Thanks a lot Kaśyap for this great input.

It's very enlightening. Is there some way (perhaps a wikiproject) this information can be tracked on an ongoing basis, so we can see at a glance how user preferences change over time (grow/modify)? I think that would be a pretty fantastic measure of the amount of growth of information and its usefulness over time.  


Cheers
Subha

2011/9/18 Kaśyap కశ్యప్ <[hidden email]>
Hi

I wanna share  some statics   on Indic language on web in Unicode , I conducted a simple search on 18th September 2011 at 9:00 pm    with language name in Unicode  with three leading search engines Google, Yahoo, Bing . I did't include Sanskrit bacasue is based on the Devanagari script . This latest estimates can help to find the Internet Users by Language (unicode) due to the lack of other sources in data mining  and technical issues please consider this analysis is just of information purpose only .  

                       Google               Yahoo                Bing 
অসমীয়া (Assamese)              739,000                    91,500                 196,000
বাংলা (Bengali)                  20,400,000             49,600,000           18,600,000
English                         8,510,000,000       2,180,000,000      4,190,000,000
فارسی (Farsi)                    222,000,000          193,000,000           5,7,000,000
ગુજરાતી (Gujarti)              12,000,000               4,680,000              4,190,000
 हिन्दी (Hindi)                  251,000,000             70,500,000            1,7400,000
 ಕನ್ನಡ (Kannada)              12,700,000             19,500,000              6,130,000
کًشُر (Kashmiri)                         44,300                   424,000                      7,160
മലയാളം (Malayalam) 23,800,000             36,400,000            13,800,000
मराठी (Marathi)                 17,500,000             10,800,000            10,600,000
ଓଡ଼ିଆ (Oriya)                        1,550,000                   123,000                 141,000
ਪੰਜਾਬੀ (Punjabi)                23,000,000                4,490,000              2,110,000
தமிழ் (Tamil)                  59,800,000              66,100,000            16,600,000
తెలుగు (Telugu)                 40,100,000              24,900,000              9,230,000
اردو(Urdu)                          49,900,000              43,100,000           10,500,000

Regards

--
మీ శ్రేయోబిలాషి
కశ్యప్
kaburlu.wordpress.com
9396533666

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--
 ସୁଭାସିସ  ପାଣିଗାହି 
SubhasisaPanigahi
ଓଡ଼ିଆ ଉଇକିପିଡ଼ିଆ
_______________________________________________
Wikipedia Odia (Oriya) mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikipedia-or

facebook.com/OdiaWiki
Tweet @OdiaWiki


_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--
Vickram
Fool On The Hill

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l




--
Rajesh Pandey

_______________________________________________
Wikimediaindia-l mailing list
[hidden email]
https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l