SCOPE
HOME
THE THIRD INTERNATIONAL CONFERENCE ON FORENSIC COMPUTER SCIENCE - ICoFCS 2008

Online ISBN: 978-85-65069-02-1 - Print ISSN: 1980-1114, pp 82-88

DOI: 10.5769/C2008008 and http://dx.doi.org/10.5769/C2008008


Tratamento de vestígios digitais impressos através de adaptações da tecnologia de OCR


By Daniel Miranda, and Leandro Pozzebon



To download this paper, click here.

HOME     SCOPE     VENUE     COMMITTEE     GUIDELINES     AWARD     PAPERS      CONFERENCES
To return to the "Published Papers" main page, click here.
ABSTRACT

It is common for forensic analysts to receive in printed form data that is usually produced, stored and used in digital form. In certain occasions, difficulty in obtaining the data in it's original format and the amount of printed material are enough to motivate research of an automated way to translate the information back from the paper to a digital format. This article presents an approach to leveraging OCR technology to automate tasks such as reassembling complex spreadsheets from printed documents. Two experiments carried on in real cases of a Brazilian Federal Police forensics unit are presented which demonstrate the use of free software and commercial-of-the-shelf software, the Python programming language, pattern recognition and image processing algorithms to achieve productivity increase in analyzing financial data.


KEYWORDS

OCR automatizado, Python, script.