HTML Quote extraction appears to not be working
Mikejonesab12 opened this issue · comments
Michael Jones commented
I performed the demos of both the regular text extraction and the HTML extraction found on the README. The text extraction worked as expected. However, the HTML extraction simply returned the original input.
I am using Python 3.6.1.
Any ideas?
Sergey Obukhov commented
@Mikejonesab12 I couldn't reproduce, could you provide a code snippet?
Michael Jones commented
The code which is almost exactly from the README:
import talon
from talon import quotations
talon.init()
html = """Reply
<blockquote>
<div>
On 11-Apr-2011, at 6:54 PM, Bob <bob@example.com> wrote:
</div>
<div>
Quote
</div>
</blockquote>"""
#reply = quotations.extract_from(html, 'text/html')
reply = quotations.extract_from_html(html)
print(reply)
Printed output:
Reply
<blockquote>
<div>
On 11-Apr-2011, at 6:54 PM, Bob <bob@example.com> wrote:
</div>
<div>
Quote
</div>
</blockquote>
Wladimir Coka commented
I have same problem, not working HTML quote (gmail_quote)