Printer Friendly
The Free Library
14,505,585 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Scientists developing software to scan Arabic documents.


CNN.com reports that computer scientists are developing software to scan Arabic documents, including handwritten hand·write  
tr.v. hand·wrote , hand·writ·ten , hand·writ·ing, hand·writes
To write by hand.



[Back-formation from handwritten.]

Adj. 1.
 ones, for specific words and phrases Words and Phrases®

A multivolume set of law books published by West Group containing thousands of judicial definitions of words and phrases, arranged alphabetically, from 1658 to the present.
. The software should expand access to modern and ancient Arabic manuscripts as well as help with intelligence gathering. It will allow Arabic writings to be digitized and posted on the Web.

"The whole Internet is skewed toward people who speak English," said Venu Govindaraju, director of the Center for Unified Biometrics and Sensors at the University at Buffalo in New York, where the software is being developed.

He explained that if optical character recognition optical character recognition (OCR), method for the machine-reading of typeset, typed, and, in some cases, hand-printed letters, numbers, and symbols using optical sensing and a computer.  software is not developed for a particular language, "then all the classic texts in that language will disappear into oblivion.

Bill Young, an Arab language specialist at the University of Maryland University of Maryland can refer to:
  • University of Maryland, College Park, a research-extensive and flagship university; when the term "University of Maryland" is used without any qualification, it generally refers to this school
, told CNN CNN
 or Cable News Network

Subsidiary company of Turner Broadcasting Systems. It was created by Ted Turner in 1980 to present 24-hour live news broadcasts, using satellites to transmit reports from news bureaus around the world.
 that the software could help scan through masses of typed pages for specific names or words; however, handwritten Arabic presents serious challenges for computers. Some Arabic words can be written in more than one way, so the software would have to be given instructions about possible variations.

According to Govindaraju, the Arabic software would take into account the fact that characters may take different forms depending on where within a word they appear, and that Arabic vowels are pronounced but often not written.
COPYRIGHT 2005 Association of Records Managers & Administrators (ARMA)
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2005, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Title Annotation:News, Trends & Analysis
Author:Swartz, Nikki
Publication:Information Management Journal
Article Type:Brief Article
Geographic Code:1USA
Date:Mar 1, 2005
Words:211
Previous Article:South Dakota bill may restrict access to vital record.(News, Trends & Analysis)
Next Article:FBI dumps information-sharing software.(News, Trends & Analysis)
Topics:



Related Articles
Aesthetics of the new novel: epistemological rupture and anti-lyrical poetics.(Brief Article)
EHP children's health page. (EHP net).
Ultrasound scans and brain changes. (Pregnancy & Birth).(Brief Article)
Using PDF files for case and practice management: you can search, edit, annotate, share, and manipulate documents stored electronically in portable...
Medical webwatch.(websites)
E-audit: tools evolving to help you find your way along the paperless audit trail.
AIDS information overload: what you can do now.
Introduction.
Daily news alerts selected by AIDS Treatment News: www.connotea.org/group/aidsnew.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles