Extensible System for Optical Character Recognition of Maintenance Documents release_bribomxkkbfzjnhe4sygmaz65e

by John Anthony Labarga, Amardeep Singh, Vera Zaychik Moffitt

Abstract

In the course of maintenance and operations, equipment operators and manufacturers frequently generate large volumes of paper documents. This is particularly the case in maintaining legacy systems, and when external factors (e.g. security concerns, environment, training procedures) make it infeasible to record data in a computer system in real time. To implement analytics or automated monitoring, these documents must later be converted to digital copies, which can be ingested into a database. This paper describes a flexible system for converting paper forms into digital documents through Optical Character Recognition (OCR), utilizing open source tools and packages. This system allows for the incorporation of business rules and processes that deliver high fidelity digital copies.
In application/xml+jats format

Archived Files and Locations

application/pdf  817.4 kB
file_mluzc4bznjabpho5swmo45esna
papers.phmsociety.org (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2018-09-24
Proceedings Metadata
Not in DOAJ
Not in Keepers Registry
ISSN-L:  2325-0178
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: ee6cb57f-ad88-46a7-9b4a-d49c3f35b910
API URL: JSON