TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
Subject:RE: Text only documents From:Kevin McLauchlan <kmclauchlan -at- safenet-inc -dot- com> To:'Peter Neilson' <neilson -at- windstream -dot- net>, Khizran Kaleem <khizran -at- gmail -dot- com>, techwr-l -at- lists -dot- techwr-l -dot- com Date:Fri, 16 Feb 2007 10:02:48 -0500
Peter Neilson
> You might think about making a filter that will automatically and
> reliably remove the unneeded parts from the HTML that you get
> from Word.
> The difficulty here is that when you change your Word source,
> you cannot
> be sure that your filter will still work on Word's corresponding new
> HTML. Also, if you upgrade to a new version of MS Word, its
> HTML might
> be substantially different, requiring a fully rewritten filter.
>
> If it were my project, and I had the time and/or budget, I would
> investigate maintaining the source as XML, and designing or
> purchasing
> software for creating the required MS Word and HTML output. It is
> extremely likely that someone has already solved this problem.
At this point, we're no longer really addressing the need of
the original poster, but the question is interesting, nontheless, so...
Would there be any advantage to sucking an existing Word doc into
OpenOffice.org, cleaning it up a bit, then maintaining it in
OOo's XML-based default file format?
If nothing else, that process would free the pictures in the
document from the closed Word format.
Having the document in OOo would also result in clean html
if you exported from there.
And it's free-as-in-beer.... :-)
Again, possibly not helpful to Khizran, but just some talking points
of a general nature if anyone has any further thoughts.
Kevin
The information contained in this electronic mail transmission may be privileged and confidential, and therefore, protected from disclosure. If you have received this communication in error, please notify us immediately by replying to this message and deleting it from your computer without copying or disclosing it.
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Create HTML or Microsoft Word content and convert to Help file formats or
printed documentation. Features include single source authoring, team authoring,
Web-based technology, and PDF output. http://www.DocToHelp.com/TechwrlList
Now shipping: Help & Manual 4 with RoboHelp(r) import! New editor,
full Unicode support. Create help files, web-based help and PDF in up
to 106 languages with Help & Manual: http://www.helpandmanual.com
---
You are currently subscribed to TECHWR-L as archive -at- infoinfocus -dot- com -dot-