Introduction
A tool to strip Microsoft’s proprietary tags and other superfluous noise from Word-generated HTML documents, leaving all the basic goodness intact. File sizes are greatly reduced, and the returned markup is easier to read, revise and employ. This tool is available as an API which you can call from your own web applications and other software.
Features
Actions include:
- Removes Microsoft-only properties
- Removes old-fashioned properties like font, del and ins
- Removes repeated new lines and spaces
- Cleans up tables by removing cellspacing, cellpadding and width parameters
- Adds newlines after headings and paragraphs for improved readability
- Replaces with spaces but only within text
- Leaves all comments intact
API
This tool is also available via a web-based API so that you can use it from your own applications and websites. learn more about the API or sign up now

English
Deutsch
français
svenska

Word HTML cleaner