Preliminary simulation of robot on script detection from camera images

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Here is a lot of information is available in a photo images such as advertisement, book cover, banners and many more. Images with text are widely available because of efficiency and low cost of digital portable devices, which provide chances to manual transfers of a document images. Analysis techniques of manually transferred images could serves as a reference and starting point of further technique development but the method cannot be used directly on images captured using cameras. Camera pictures can be problematic with blurry, low resolution, distorted, disorientated images, apart from, complex interaction between content and background. Therefore, Optical Character Recognition (OCR) technique was used to change printed text into editable text that is convenient and accessible in various applications. However, OCR accuracy depends on text pre-processing and segmentation algorithm. Hence, this manuscript to introduce OCR Tesseract method and the history of OCR Open Source Tesseract system, its architecture and outcome of trial on various type of images to determine efficiency of OCR Tesseract and accuracy ratio of extracted images from camera.

Original languageEnglish
Title of host publicationAdvances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings
EditorsHalimah Badioze Zaman, Nazlena Mohamad Ali, Mohammad Nazir Ahmad, Alan F. Smeaton, Timothy K. Shih, Sergio Velastin, Tada Terutoshi
PublisherSpringer
Pages327-342
Number of pages16
ISBN (Print)9783030340315
DOIs
Publication statusPublished - 1 Jan 2019
Event6th International Conference on Advances in Visual Informatics, IVIC 2019 - Bangi, Malaysia
Duration: 19 Nov 201921 Nov 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11870 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Conference on Advances in Visual Informatics, IVIC 2019
CountryMalaysia
CityBangi
Period19/11/1921/11/19

Fingerprint

Optical character recognition
Robot
Camera
Cameras
Character Recognition
Robots
Tesseract
Simulation
System Architecture
Open Source
Preprocessing
Processing
Segmentation
Cover
Costs
Text
Interaction

Keywords

  • Camera
  • OCR
  • Photos
  • Tesseract

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Wydyanto, Nayan, N. M., & Sulaiman, R. (2019). Preliminary simulation of robot on script detection from camera images. In H. Badioze Zaman, N. Mohamad Ali, M. N. Ahmad, A. F. Smeaton, T. K. Shih, S. Velastin, & T. Terutoshi (Eds.), Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings (pp. 327-342). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11870 LNCS). Springer. https://doi.org/10.1007/978-3-030-34032-2_30

Preliminary simulation of robot on script detection from camera images. / Wydyanto, ; Nayan, Norshita Mat; Sulaiman, Riza.

Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings. ed. / Halimah Badioze Zaman; Nazlena Mohamad Ali; Mohammad Nazir Ahmad; Alan F. Smeaton; Timothy K. Shih; Sergio Velastin; Tada Terutoshi. Springer, 2019. p. 327-342 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11870 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wydyanto, , Nayan, NM & Sulaiman, R 2019, Preliminary simulation of robot on script detection from camera images. in H Badioze Zaman, N Mohamad Ali, MN Ahmad, AF Smeaton, TK Shih, S Velastin & T Terutoshi (eds), Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11870 LNCS, Springer, pp. 327-342, 6th International Conference on Advances in Visual Informatics, IVIC 2019, Bangi, Malaysia, 19/11/19. https://doi.org/10.1007/978-3-030-34032-2_30
Wydyanto , Nayan NM, Sulaiman R. Preliminary simulation of robot on script detection from camera images. In Badioze Zaman H, Mohamad Ali N, Ahmad MN, Smeaton AF, Shih TK, Velastin S, Terutoshi T, editors, Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings. Springer. 2019. p. 327-342. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-030-34032-2_30
Wydyanto, ; Nayan, Norshita Mat ; Sulaiman, Riza. / Preliminary simulation of robot on script detection from camera images. Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings. editor / Halimah Badioze Zaman ; Nazlena Mohamad Ali ; Mohammad Nazir Ahmad ; Alan F. Smeaton ; Timothy K. Shih ; Sergio Velastin ; Tada Terutoshi. Springer, 2019. pp. 327-342 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{c5fcad264ed54571b8c54e3b2d7663d5,
title = "Preliminary simulation of robot on script detection from camera images",
abstract = "Here is a lot of information is available in a photo images such as advertisement, book cover, banners and many more. Images with text are widely available because of efficiency and low cost of digital portable devices, which provide chances to manual transfers of a document images. Analysis techniques of manually transferred images could serves as a reference and starting point of further technique development but the method cannot be used directly on images captured using cameras. Camera pictures can be problematic with blurry, low resolution, distorted, disorientated images, apart from, complex interaction between content and background. Therefore, Optical Character Recognition (OCR) technique was used to change printed text into editable text that is convenient and accessible in various applications. However, OCR accuracy depends on text pre-processing and segmentation algorithm. Hence, this manuscript to introduce OCR Tesseract method and the history of OCR Open Source Tesseract system, its architecture and outcome of trial on various type of images to determine efficiency of OCR Tesseract and accuracy ratio of extracted images from camera.",
keywords = "Camera, OCR, Photos, Tesseract",
author = "Wydyanto and Nayan, {Norshita Mat} and Riza Sulaiman",
year = "2019",
month = "1",
day = "1",
doi = "10.1007/978-3-030-34032-2_30",
language = "English",
isbn = "9783030340315",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer",
pages = "327--342",
editor = "{Badioze Zaman}, Halimah and {Mohamad Ali}, Nazlena and Ahmad, {Mohammad Nazir} and Smeaton, {Alan F.} and Shih, {Timothy K.} and Sergio Velastin and Tada Terutoshi",
booktitle = "Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings",

}

TY - GEN

T1 - Preliminary simulation of robot on script detection from camera images

AU - Wydyanto,

AU - Nayan, Norshita Mat

AU - Sulaiman, Riza

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Here is a lot of information is available in a photo images such as advertisement, book cover, banners and many more. Images with text are widely available because of efficiency and low cost of digital portable devices, which provide chances to manual transfers of a document images. Analysis techniques of manually transferred images could serves as a reference and starting point of further technique development but the method cannot be used directly on images captured using cameras. Camera pictures can be problematic with blurry, low resolution, distorted, disorientated images, apart from, complex interaction between content and background. Therefore, Optical Character Recognition (OCR) technique was used to change printed text into editable text that is convenient and accessible in various applications. However, OCR accuracy depends on text pre-processing and segmentation algorithm. Hence, this manuscript to introduce OCR Tesseract method and the history of OCR Open Source Tesseract system, its architecture and outcome of trial on various type of images to determine efficiency of OCR Tesseract and accuracy ratio of extracted images from camera.

AB - Here is a lot of information is available in a photo images such as advertisement, book cover, banners and many more. Images with text are widely available because of efficiency and low cost of digital portable devices, which provide chances to manual transfers of a document images. Analysis techniques of manually transferred images could serves as a reference and starting point of further technique development but the method cannot be used directly on images captured using cameras. Camera pictures can be problematic with blurry, low resolution, distorted, disorientated images, apart from, complex interaction between content and background. Therefore, Optical Character Recognition (OCR) technique was used to change printed text into editable text that is convenient and accessible in various applications. However, OCR accuracy depends on text pre-processing and segmentation algorithm. Hence, this manuscript to introduce OCR Tesseract method and the history of OCR Open Source Tesseract system, its architecture and outcome of trial on various type of images to determine efficiency of OCR Tesseract and accuracy ratio of extracted images from camera.

KW - Camera

KW - OCR

KW - Photos

KW - Tesseract

UR - http://www.scopus.com/inward/record.url?scp=85077895765&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85077895765&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-34032-2_30

DO - 10.1007/978-3-030-34032-2_30

M3 - Conference contribution

AN - SCOPUS:85077895765

SN - 9783030340315

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 327

EP - 342

BT - Advances in Visual Informatics - 6th International Visual Informatics Conference, IVIC 2019, Proceedings

A2 - Badioze Zaman, Halimah

A2 - Mohamad Ali, Nazlena

A2 - Ahmad, Mohammad Nazir

A2 - Smeaton, Alan F.

A2 - Shih, Timothy K.

A2 - Velastin, Sergio

A2 - Terutoshi, Tada

PB - Springer

ER -