What this tool does
- Removes all HTML/XML tags from input text
- Optionally preserves block-level line breaks (br, p, div, h1-h6, li, tr)
- Decodes common HTML entities (&, <, , ", numeric entities)
- Collapses multiple spaces/newlines into single spaces
- "Keep tags" lets you whitelist tags to preserve (e.g. b,i,strong for inline formatting)
Use cases
- Extract clean text from scraped web pages
- Clean up rich text editor output for plain-text use
- Prepare content for email or SMS
- Migrate content between CMSs without HTML clutter