Giters
mozilla
/
readability
A standalone version of the readability lib
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
8185
Watchers:
104
Issues:
540
Forks:
575
mozilla/readability Issues
H1 Headers ignored/skipped on https://www.astralcodexten.com/p/practically-a-book-review-rootclaim
Updated
18 days ago
Comments count
4
Short sentence standalone paragraphs missing from reader view of https://www.royalroad.com/fiction/63759/super-supportive/chapter/1449598/one-hundred-two-what-kind-of-wordchain
Updated
18 days ago
Comments count
2
The Luddite articles aren't readerable
Updated
18 days ago
Comments count
1
The verge: first sentence is skipped
Updated
18 days ago
Comments count
1
Reader mode cuts off last 11 paragraphs of "ChatGPT Is a Blurry JPEG of the Web" on newyorker.com
Updated
22 days ago
Comments count
4
H1 is converted into H2?
Updated
24 days ago
Comments count
1
Feature: Callback like `onRemoveNode` before a node is being removed
Updated
a month ago
Comments count
1
Incomplete rendering of page in reading mode
Updated
a month ago
Discrepancy between firefox reader mode to readability library.
Updated
a month ago
Comments count
2
Problem handling invalid HTML attributes
Updated
2 months ago
Comments count
1
Paragraph is ignored on https://oejaj.cfwb.be accessibility statement page
Updated
2 months ago
The Montreal Gazette's secondary navbar menu shows up as a list in Reader mode
Updated
2 months ago
Some of the lazy loaded images in "The Conversation" are not shown unless the whole article has been scrolled before entering reader view
Updated
2 months ago
Readability doesn't extract the right article for some pages on lgbtqnation.com
Updated
2 months ago
BBC News articles aren't readerable
Updated
2 months ago
too much content cut on europarl.europa.eu
Updated
2 months ago
Add `articleBody` to the metadata when found in the Article Schema Markup
Updated
3 months ago
Comments count
1
About clipping tables in web pages
Updated
3 months ago
Comments count
1
any idea why it can't parse this page's job description?
Updated
3 months ago
Comments count
1
Don’t remove <html> element from DOM tree
Closed
3 months ago
Comments count
1
Over focusing on a pre-formatted code block
Updated
3 months ago
tagName is case-sensitive in XHTML docs
Updated
3 months ago
Parsing Issue with DOMPurify and Readability.js
Closed
4 months ago
Comments count
2
Crashes on all Pinterest and many other websites, minimal reproduction
Closed
4 months ago
Comments count
6
Missing section titles from substack.com
Updated
4 months ago
Comments count
2
Can we get a list of images in the article as part of the json response?
Updated
4 months ago
anyway to include a list of keywords in output?
Closed
4 months ago
Comments count
4
YouTube videos not being extracted.
Updated
4 months ago
Comments count
1
bug: <ul> element removed, yet children <li> are preserved, resulting in broken markup
Updated
4 months ago
Text attached to the input form displayed
Updated
4 months ago
Why doesn't _isProbablyVisible check visibility: hidden?
Closed
5 months ago
Comments count
3
Reader mode cuts off the main content
Updated
5 months ago
Comments count
2
Not all HTML Entities are unescaped from title and other metadata (when double-escaped by websites)
Updated
6 months ago
Comments count
1
bleepingcomputer issue from ad content cnx-player-wrapper
Updated
7 months ago
Comments count
3
anyway to ignore css?
Closed
7 months ago
Comments count
1
fitchratings.com article pages display unformatted lists of currencies instead
Updated
9 months ago
Main article content at guykawasaki.com stripped
Updated
9 months ago
Readability Aggressively Strips Important Content If an HTML Element's ID and/or Class Name Uses Certain Words
Closed
9 months ago
Comments count
1
Articles on nypost.com sometimes display privacy / cookie warnings instead of article content (depends on scroll position)
Updated
9 months ago
Comments count
1
Articles on Macrumors.com display privacy / cookie warnings instead of article content
Updated
9 months ago
Comments count
2
European commission news do not display the reader view
Updated
9 months ago
H2 with strong inside is being eaten...
Updated
9 months ago
Comments count
1
Images missing from medium.com articles
Closed
a year ago
Comments count
1
I
Closed
a year ago
libc++abi: terminating due to uncaught exception of type std::out_of_range: basic_string Abort trap: 6
Closed
a year ago
Comments count
3
i'm getting errors logged by the library even though I use try/catch
Closed
a year ago
Comments count
4
Error during parse | TypeError: Cannot read properties of null
Updated
a year ago
Error during parsing | ReferenceError: li_count is not defined
Closed
a year ago
Comments count
3
Read
Closed
a year ago
Video regex should be configurable to allow videos from other sites to be retained in readability output
Closed
a year ago
Comments count
8
Previous
Next