SCOPE
HOME
THE THIRD INTERNATIONAL CONFERENCE ON FORENSIC COMPUTER SCIENCE - ICoFCS 2008
Online ISBN: 978-85-65069-02-1 - Print ISSN: 1980-1114, pp 82-88
DOI: 10.5769/C2008008 and http://dx.doi.org/10.5769/C2008008
Tratamento de vestígios digitais impressos através de adaptações da tecnologia de OCR
By Daniel Miranda, and Leandro Pozzebon
To download this paper, click here.
To return to the "Published Papers" main page, click here.
ABSTRACT
It is common for forensic analysts to receive in printed form data that is usually produced, stored and used in digital form. In certain occasions, difficulty in obtaining the data in it's original format and the amount of printed material are enough to motivate research of an automated way to translate the information back from the paper to a digital format. This article presents an approach to leveraging OCR technology to automate tasks such as reassembling complex spreadsheets from printed documents. Two experiments carried on in real cases of a Brazilian Federal Police forensics unit are presented which demonstrate the use of free software and commercial-of-the-shelf software, the Python programming language, pattern recognition and image processing algorithms to achieve productivity increase in analyzing financial data.
KEYWORDS
OCR automatizado, Python, script.