1 |
Improving Scene Text Recognition for Indian Languages with Transfer Learning and Font Diversity
|
|
|
|
In: Journal of Imaging; Volume 8; Issue 4; Pages: 86 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
TEXT EXTRACTION FROM IMAGES USING NEURAL NETWORKS
|
|
|
|
Abstract:
Most western languages have witnessed the power of Artificial Intelligence (AI) in one other form. Primary fact for this achievement is due to the efforts of several researchers contributing to the field of computational linguistics. However, there are many languages in the World which has a great history and abundant literature but not many research activities due to many factors such as lack of motivation, non- availability of open-source corpora and so on. Telugu is one such language where there is a lack of efforts towards the digitization of language. The focus of this research is to extract text from the images to produce corpora for enabling computational linguistics and also to conserve the literature. Deep Learning with Neural Networks has proven solutions in the same domain.Optical Character Recognition is the solution adopted by western languages for digitization. However the same cannot be applied towards Telugu due to the complexity of scripts and the ambiguity in dialects. To address this issue, in this research we built a neural network system that can be adapted later for any such languages like Telugu. By adapting neural networks in this research we achieved an efficiency of 90 percent. Segmentation of characters is taken care by neural networks while we only specified the segmentation on word level. A comparative study of the system we developed and commercial API's is made and our system is proven to be more accurate.
|
|
Keyword:
Indic Scripts; Neural networks (Computer science); OCR; Programming languages (Electronic computers); Text Extraction
|
|
URL: http://hdl.handle.net/10342/7630
|
|
BASE
|
|
Hide details
|
|
6 |
Code-switching between structural and sociolinguistic perspectives ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Code-switching between structural and sociolinguistic perspectives ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
A descriptive grammar of Raji
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
9 |
Capturing Breathy Voice: Durational Measures of Oral Stops in Marathi
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Capturing Breathy Voice: Durational Measures of Oral Stops in Marathi
|
|
|
|
In: Kansas Working Papers in Linguistics, Vol 33, Iss , Pp 27-46 (2012) (2012)
|
|
BASE
|
|
Show details
|
|
11 |
The Brahmi writing system : cross-fertilizing epigraphy, archaeology and linguistics
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
13 |
An encyclopaedic dictionary of Indian languages, literature
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
14 |
Indien und Zentralasien : Sprach- und Kulturkontakt; Vorträge des Göttinger Symposions vom 7. bis 10. Mai 2001
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
15 |
An ethnolinguistic profile of Eastern India : a case of South Orissa
|
|
Ghosh, Arun. - Burdwan : Dept. of Bengali (D.S.A.), Univ. of Burdwan, 2003
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
16 |
The language of the Jarawa : phonology
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
17 |
Badaga : a Dravidian language
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
18 |
Bilingualism and trilingualism : Table C-8 ; India, States and Union Territories
|
|
|
|
MPI-SHH Linguistik
|
|
Show details
|
|
|
|