TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
Subject:Re: Text to xml: Encoding issues From:Sandy Harris <sandyinchina -at- gmail -dot- com> To:TECHWR-L <techwr-l -at- lists -dot- techwr-l -dot- com> Date:Wed, 28 Dec 2005 06:43:12 +0800
Paul <paul -dot- inbar -at- intel -dot- com> wrote:
> Can anyone point me to information dealing with character encoding? ...
> For instance, a common occurrence is that the developers copy and
> paste from Word and include smart quotes. ...
> I can't figure out how to get Perl to be able to do this automatically.
It is a Perl script designed to "correct moronic and gratuitously incompatible
HTML generated by Microsoft applications". One thing it fixes is "smart"
quotes.
--
Sandy Harris
Zhuhai, Guangdong, China
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Now Shipping -- WebWorks ePublisher Pro for Word! Easily create online
Help. And online anything else. Redesigned interface with a new
project-based workflow. Try it today! http://www.webworks.com/techwr-l
Doc-To-Help 2005 now has RoboHelp Converter and HTML Source: Author
content and configure Help in MS Word or any HTML editor. No
proprietary editor! *August release. http://www.componentone.com/TECHWRL/DocToHelp2005
---
You are currently subscribed to TECHWR-L as archive -at- infoinfocus -dot- com -dot-