Frontline
Volume 24 - Issue 9 :: May. 05-18, 2007
INDIA'S NATIONAL MAGAZINE
from the publishers of THE HINDU
Contents

Printer Friendly Page Send this Article to a Friend

SPECIAL FEATURE: C-DAC

Multilingual Computing



Recently released language CDs, a splendid addition to the resource base in Indian language computing. C-DAC is the largest single contributor to this effort.

FROM the heady days in the late 1980s when C-DAC kickstarted Indian language text creation and editing with its pioneering Graphical and Intelligence-based Script Technology (GIST) plug-ins for the personal computer, the institution has played a central role in bringing the benefits of computer technology to the `other India' for which English was not an option. Its home base in Pune resulted in that city becoming the epicentre of vernacular computing developments, with dozens of private players basing their development efforts there.

A decade later, in 1997, C-DAC released the "Leap" multilingual office productivity suite, which allowed users in multiple Indian languages to produce text documents and presentation material and create professional quality brochures and flyers in much the same way that Desk Top Publishing (DTP) tools were being harnessed in English.

Since then, it has steadily released plug-ins, clip art, mailing tools and other essential office aids in all the major Indian languages. But even when judged against this consistent output, the release a few weeks ago, by the Union Ministry of Information Technology, of 10 separate CD-based packages in as many languages, each containing word processing and spreadsheet tools, multiple font libraries, keyboard drivers, spelling checkers, transliteration tools and even a text-to-speech converter, was an awesome addition to the resource base in Indian language computing. What is more, all these were free - one could download the tools from a website (www.ildc.gov.in) or order them on a CD (also free).

By end-2007, C-DAC, which is the largest single contributor to this effort, expects to cover the remaining 12 languages. The centre wisely absorbed some excellent technology created at places such as the Utkal University, Bhubaneswar (for the CD in Oriya); the Indian Institute of Science, Bangalore (in Kannada); and Indian Institute of Technology, Guwahati (in Assamese).

A key contributor was Cyberscape India, makers of the well-known Akruti range of language computing products. Modular Systems, creators of the Ankur language solutions, pitched in with some inputs.

At the base of all these tool sets lay an open-source operating system that C-DAC has built to run "Bharateeya OO", C-DAC's realisation of an open-source office suite. But the CDs are tailored to run equally well under Windows or Linux.

Ever since the language tools were made available, over 7.5 lakh copies have been downloaded and about 3.5 lakh free CDs ordered.

"The Basic Information Processing Toolkit (BIPK) being released by way of language CD for free mass usage will steer the movement for content creation in Indian languages," feels Mahesh D. Kulkarni, Programme Coordinator, GIST. "The contents in Indian languages are vital resources for further development of NLP [Natural Language Processing] tools such as thesaurus, dictionaries, spell checker, grammar checker, machine translation system, speech recognition system, text-to-speech, and so on. The tools that have been released are useful for the common man and will help increase the productivity and penetration of IT through Indian languages," he added.

The Indian language groups at C-DAC are already addressing their next challenge: a truly Indian translation and transliteration browser which will on-the-fly convert English Web pages into any of the Indian languages - in the Indian script. It is not a trivial task - and to get a feel of how it will work and what needs to be done, there is no better illustration than the Indian Railways' site, www.trainenquiry.com, and its linked Hindi page, which harnesses the C-DAC translated-transliteral browser technology.

An Indian language browser that converts English and foreign language text seamlessly is a work in progress at C-DAC today. But if the samples out there, such as the railway application form, are any proof, it is almost there.

Anand Parthasarathy



Printer friendly page  
Send this article to Friends by E-Mail


Subscribe | Contact Us | Archives | Contents
(Letters to the Editor should carry the full postal address)
Home | The Hindu | Business Line | Sportstar | Publications | eBooks | Images
Copyright © 2007, Frontline.

Republication or redissemination of the contents of this screen are expressly prohibited
without the written consent of Frontline