Html2text
html2text is a very simple script that uses DOM methods to convert HTML into a format similar to what would be rendered by a browser - perfect for places where you need a quick text representation. For example:
<html>
<title>Ignored Title</title>
<body>
<h1>Hello, World!</h1>
<p>This is some e-mail content.
Even though it has whitespace and newlines, the e-mail converter
will handle it correctly.
<p>Even mismatched tags.</p>
<div>A div</div>
<div>Another div</div>
<div>A div<div>within a div</div></div>
<a href="http://foo.com">A link</a>
</body>
</html>
Will be converted into:
Hello, World!
This is some e-mail content. Even though it has whitespace and newlines, the e-mail converter will handle it correctly.
Even mismatched tags.
A div
Another div
A div
within a div
[A link](http://foo.com)
Usage
$text = \NZTim\Html2text\Html2text::convert($html);
Options
Option | Default | Description |
---|---|---|
ignore_errors | false |
Set to true to ignore any XML parsing errors. |
drop_links | false |
Set to true to not render links as [http://foo.com](My Link) , but rather just My Link . |
Pass along options as a second argument to convert
, for example:
$options = [
'ignore_errors' => true,
// other options go here
];
$text = \NZTim\Html2text\Html2text::convert($html, $options);
Tests
Some very basic tests are provided in the tests/
directory. Run them with composer install && vendor/bin/phpunit
.
Troubleshooting
Class 'DOMDocument' not found
You need to install the PHP XML extension for your PHP version. e.g. apt-get install php7.1-xml
License
html2text
is licensed under MIT, making it suitable for both Eclipse and GPL projects.
This project is an updated version of Jevon Wright's original.