TechWhirl (TECHWR-L) is a resource for technical writing and technical communications professionals of all experience levels and in all industries to share their experiences and acquire information.
For two decades, technical communicators have turned to TechWhirl to ask and answer questions about the always-changing world of technical communications, such as tools, skills, career paths, methodologies, and emerging industries. The TechWhirl Archives and magazine, created for, by and about technical writers, offer a wealth of knowledge to everyone with an interest in any aspect of technical communications.
I am not sure I fully understand what you want to do. If you want to
just print out the text between tags, have you considered using XSLT? Do
you know what tags you are interested in? You would then need an xslt
processor such as xsltproc. Does you linux system have one installed?
There are also Perl modules that help with retrieving information from
xml files.
For the recursion, you could use a Perl script to get a list of all the
files, and then call the xslt processor from inside the perl script to
process each file.
Paul
-----Original Message-----
From: techwr-l-bounces+paul -dot- inbar=intel -dot- com -at- lists -dot- techwr-l -dot- com
[mailto:techwr-l-bounces+paul -dot- inbar=intel -dot- com -at- lists -dot- techwr-l -dot- com] On
Behalf Of Ed
Sent: Monday, September 22, 2008 1:21 PM
To: techwr-l List
Subject: Searching Text Files
I'm looking for a forum or group where I could work through a
particular problem. This list (techwr-l) may be a good choice. If not,
please recommend a list, group, or resource.
There is a directory of xml text files that I'd like to extract
information from, and output to a textfile. The information appears
between an open/close tag, for instance, <acroterm>SEC</acroterm>. Of
course there are a myriad of possibilities, so there must be some
wild-carding too. And then there is the recursion necessary to process
all files of a particular type, such as *.xml.
I know there are Windows programs I could buy to do this. However, I'd
prefer to stay with a command line solution, like Linux or DOS.
I'll continue searching.
--
Ed
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ComponentOne Doc-To-Help gives you everything you need to author and
publish quality Help, Web, and print content. Perfect for technical
authors, developers, and policy writers. Download a FREE trial. http://www.componentone.com/DocToHelp/
True single source, conditional content, PDF export, modular help.
Help & Manual is the most powerful authoring tool for technical
documentation. Boost your productivity! http://www.helpandmanual.com
---
You are currently subscribed to TECHWR-L as paul -dot- inbar -at- intel -dot- com -dot-
To subscribe, send a blank email to techwr-l-join -at- lists -dot- techwr-l -dot- com
Send administrative questions to admin -at- techwr-l -dot- com -dot- Visit http://www.techwr-l.com/ for more resources and info.
---------------------------------------------------------------------
Intel Israel (74) Limited
This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.
ComponentOne Doc-To-Help gives you everything you need to author and
publish quality Help, Web, and print content. Perfect for technical
authors, developers, and policy writers. Download a FREE trial. http://www.componentone.com/DocToHelp/
True single source, conditional content, PDF export, modular help.
Help & Manual is the most powerful authoring tool for technical
documentation. Boost your productivity! http://www.helpandmanual.com
---
You are currently subscribed to TECHWR-L as archive -at- web -dot- techwr-l -dot- com -dot-