ms word vs html ?
Karsten M. Self
kmself at ix.netcom.com
Sun Oct 20 20:47:07 PDT 2002
on Mon, Oct 07, 2002, Colin Marquardt (colin at marquardt-home.de) wrote:
> "Karsten M. Self" <kmself at ix.netcom.com> writes:
>
> > Went to work on the manager's HTML export. It blew up both Netscape and
> > Mozilla. Ran the docs through W3C's 'tidy' utility to clean up the
> > HMTL. One document finally validated (after a bunch of hand-edits to
> > fix errors). The other blew up _tidy_ itself. The HTML was _so_
> > nonstandard it blew up the HTML validator. I ended up rendering the
> > document via Lynx and re-tagging it by hand. In both cases, the tidied
> > HTML was ~30% the size of the original MS Word generated document.
>
> Wasn't there a cleaning tool especially for Word-generated HTML
> code? Called demoronizer or so... indeed:
> http://www.fourmilab.ch/webtools/demoroniser/
tidy is largely a replacement for the domoroniser. It's more current,
more capable, and generally more standards conformant.
Peace.
--
Karsten M. Self <kmself at ix.netcom.com> http://kmself.home.netcom.com/
What Part of "Gestalt" don't you understand?
Data corrupts. Absolute data corrupts absolutely.
-- Ed Self's corollary of Atkinson's Law.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://nblug.org/pipermail/talk/attachments/20021020/a5cbe292/attachment.pgp
More information about the talk
mailing list