TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
Subject:Re: OCR on PDF From:Scott Turner <sturner -at- airmail -dot- net> To:"TECHWR-L" <techwr-l -at- lists -dot- raycomm -dot- com> Date:Wed, 20 Dec 2000 13:36:55 -0600
I would try Acrobat 4.0.
After that, if the formmtting is not particularly important, extract the
text with GhostView/GhostScript.
Scott
"Elliott C. Evans" wrote:
>
> A potential customer sent us a PDF file of a 1711 page document
> which was not properly scanned. Instead of being searchable text,
> it seems that every page is one big graphic. Is there any way to
> run Acrobat's OCR software on an existing PDF file without
> printing it out and scanning it back in? Choosing "Document >
> Capture Pages" with the file open results in a error for every
> page. (Using Exchange 3.0 on Windows 98)
>
> --
> Elliott C. "Eeyore" Evans eeyore+ -at- cmu -dot- edu
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Develop HTML-based Help with Macromedia Dreamweaver! (STC Discount.)
**NEW DATE/LOCATION!** January 16-17, 2001, New York, NY. http://www.weisner.com/training/dreamweaver_help.htm or 800-646-9989.
Take XML and Tech Writing courses online! Our instructor-led courses
(4-6 hrs/wk) give you "hands on" experience at your convenience. STC members
get 20% off! http://www.online-learning.com/index.html.
---
You are currently subscribed to techwr-l as: archive -at- raycomm -dot- com
To unsubscribe send a blank email to leave-techwr-l-obscured -at- lists -dot- raycomm -dot- com
Send administrative questions to ejray -at- raycomm -dot- com -dot- Visit http://www.raycomm.com/techwhirl/ for more resources and info.