[vox-tech] docbook to text

Peter Jay Salzman vox-tech@lists.lugod.org
Thu, 20 Jun 2002 09:51:28 -0700


i'm trying to convert docbook sgml to text, and am having a terrible
time of it:



i tried using sgml2txt, but it complained that it didn't know how to
handle docbook.  it suggested that i use jade.

ok.

reading through jade's help files, it appears that jade doesn't know how
to convert sgml to text.  this is starting to feel like UC davis's
bureacracy.

ok.

i can convert docbook to html (obviously), so i try html2text.  the
results have wierd underscoring things in it.  reading through the docs,
there's a -nobs option which produces pure text.

ok.

i try html2text with -nobs.  the results stink.  the formating is all
wrong.  there *should* be a blank line between an <H[1-4]> line and the
following text.  verbatim text is all screwed up.

ok.

after some searching, i find 'man html2textrc'.   yuck.  i _really_
don't want to read through this.  the rc file is complicated.  i don't
want to spend this amount of time for something as simple as converting
html to text.  i shouldn't have to spend more than 5 minutes on such a
simple operation.

ok.

i can convert man pages to text, so i use docbook-to-man and convert the
sgml file to a man page.  the results are worse than awful.  it's
completely unreadable.



are you feeling my pain yet?  this should be simple, and it's turning
out to be a nightmare.

does anyone have any ideas on how to convert a docbook file to text?

alternatively, if anyone has an .html2textrc file suitable for FAQ style
documents, i'd love to have a copy of your dotfile.

pete


--
GPG Fingerprint: B9F1 6CF3 47C4 7CD8 D33E  70A9 A3B9 1945 67EA 951D