Document Details

Document Type : Article In Conference 
Document Title :
Language Identification in Document Analysis (LIDA)
التعرف على اللغة في تحليل الوثائق (LIDA)
 
Subject : Language Identification of a text Arabic or English 
Document Language : English 
Abstract : This paper presents a technique that can be used to discriminate between texts written in Arabic script and texts written in Latin script. This technique addresses the language identification problem on the word level and on the text line level. This technique uses an algorithm for horizontal projection profiles. This paper presents a new algorithm of language identification to determine languages of a document. This approach may be used in identifying the language in many applications. These applications cover encoding of document pages, language specific web crawling, information retrieval, natural language processing, text mining, translation service bureau software, spell checking software, stemming or morphological analyzers, and knowledge management systems. 
Conference Name : International Conference Circuits, Signals, and Systems 
Publishing Year : 1424 AH
2004 AD
 
Article Type : Article 
Conference Place : Florida – USA 
Organizing Body : IASTED 
Added Date : Thursday, March 3, 2011 

Researchers

Researcher Name (Arabic)Researcher Name (English)Researcher TypeDr GradeEmail
كمال جمبيJambi, Kamal ResearcherDoctoratekjambi@kau.edu.sa

Files

File NameTypeDescription
 29208.docx docx 

Back To Researches Page