Dear Ammar,
An automated way of doing this is to crawl the web and identify the language of the page on-the-fly.
I guess (based on some reading) that Nutch's crawler supports Urdu language detection - but again I'm not sure about this.
However, Dr. Sarmad Hussain of Centre for Language Engineering (CLE) in UET Lahore, has done extensive work in Urdu language.
One of his groups also developed a language identifier for Urdu.
You may contact him - he is quite an expert in the area. I'm sure he will point you to several of the resources you're looking for.
--
Kind Regards,
Yasir.
On Mon, Jan 16, 2012 at 6:39 PM, <ammar@brain.net.pk> wrote:
Dear All,
Under a ICT4D project we are trying to map the URDU content presently
available on WEB. Any information on portals,sources,leads,links,web sites
would help us a lot.
Information on persons and organizations working on preparing, translating
or displaying URDU content on web would also help to achieve the
objectives.
Information, suggestions, comments are requested.
Ammar Jaffri
0300-8551479
__._,_.___
MARKETPLACE
.
__,_._,___
No comments:
Post a Comment