Optical Character Recognition (OCR)  

Often abbreviated as OCR refers to the branch of computer science that involves reading text from paper and translating the images into a form that the computer can manipulate (for example, into ASCII codes).

An OCR system enables you to take a document, book or a magazine article, feed it directly into an electronic computer file, and then edit the file using a word processing software. All OCR systems include an optical scanner for reading text, and sophisticated software for analyzing images.

Most modern document management software will have OCR capabilities incorporated readily or can be customized to do so. Most OCR systems use a combination of hardware (specialized circuit boards) and software to recognize characters, although some inexpensive systems do it entirely through software. Advanced OCR systems can read text in large variety of fonts, but they still have difficulty with handwritten text even though there efforts being made with new technology to have an improved recognition of penmanship.

The potential of OCR systems is enormous because they enable users to harness the power of computers to access printed documents that can not be produced from the original electronic source. OCR is already being used widely in the legal profession, where searches that once required hours or days can now be accomplished in a few seconds.