Remove HTML tags to extract plain text. Free, private, runs in your browser.
100% private — your files never leave your browser. All processing happens locally on your device.
Whenever you export content from a CMS, scrape a web page, convert an HTML email to plain text, or prepare content for a text-only channel (SMS, push notifications, AI prompts), you need the text without the markup. Manual cleanup is tedious and error-prone — this tool does it in one paste.
Pasting HTML from unknown sources is safe here. The tool never renders the input — it only scans text — so scripts, inline event handlers, and malicious markup are harmless. Tags are stripped by pattern matching, not by creating DOM nodes, which eliminates an entire category of XSS risks.
By default, block elements like <p>, <div>, <br>, <li>, and <h1>-<h6> become line breaks in the output, so paragraphs and lists remain readable. HTML entities like `&`, `<`, and numeric references like `A` are decoded to their character forms. Both behaviors are togglable if you need raw tag-free text or the original entity syntax preserved.
Sometimes you want most tags gone but need to keep specific ones — typically <a> for hyperlinks or <strong> and <em> for emphasis. Enter those tag names in the 'Keep tags' field (comma-separated) and only those will survive. Attributes are preserved on kept tags, so hyperlinks keep their `href`.
Yes. The tool parses the input in a sandboxed regex pass and only extracts text — scripts, styles, and event handlers are discarded, never executed. Nothing is rendered.
Yes, by default. Common entities and numeric references (e.g., A) are decoded to their characters. Toggle off 'Decode entities' if you want the raw entity text preserved.
Yes. Add tag names like `a, strong, em` to the 'Keep tags' field — only those will survive the strip, and their attributes are preserved as written.