Giters
scrapy
/
scrapely
A pure-python HTML screen-scraping library
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
1859
Watchers:
123
Issues:
53
Forks:
315
scrapy/scrapely Issues
Install not working with Python 3.8.5
Updated
4 years ago
Is project alive?
Updated
4 years ago
Installing via pip on Python 3.7 fails
Updated
5 years ago
Comments count
3
ZeroDivisionError when training with zero-length data
Updated
5 years ago
Comments count
4
ValueError: Buffer dtype mismatch, expected 'int64_t' but got 'long' i on Windows 10
Updated
5 years ago
Comments count
1
Use in production
Updated
5 years ago
ValueError: Buffer dtype mismatch, expected 'int64_t' but got 'long'
Updated
5 years ago
Comments count
17
Installing pip on Python 3.7 still fails
Updated
5 years ago
Comments count
2
Interest in other wrapper induction techniques?
Updated
6 years ago
Wrong tag getting annotated
Updated
6 years ago
Comments count
1
scrapely.template.FragmentNotFound: Fragment not found annotating 'price' using: <function func at 0x...>
Updated
6 years ago
Question: automate training
Updated
6 years ago
Comments count
2
Unable to pull in https
Updated
6 years ago
error [SSL: CERTIFICATE_VERIFY_FAILED] on travel sites
Updated
6 years ago
Comments count
3
Extract from javascript?
Updated
7 years ago
safehtml should ensure tabular content safety
Closed
7 years ago
Comments count
3
How to scrape within Python using generated JSON from command line?
Updated
7 years ago
Comments count
1
Please, release a version with a better python3 support
Closed
8 years ago
Comments count
2
Incorrect cleaning of <img> tag
Closed
8 years ago
Comments count
4
Import Error: Cannot import name 'Scraper'
Closed
9 years ago
Comments count
3
benchmarks?
Closed
8 years ago
Comments count
1
Duplicate Values (but valid) in the same html
Closed
8 years ago
Comments count
2
Output processor for @href and @src (Image Field) : to remove whitespace characters if present
Closed
8 years ago
Comments count
1
Drop Python 2.6 support
Closed
8 years ago
Comments count
1
How to extract a list of items
Updated
8 years ago
Is really Python 3 supported?
Closed
8 years ago
Comments count
3
Can I train the scraper on multiple pages so given a certain page it chooses automatically the template?
Updated
9 years ago
Comments count
1
I want to scrap all the contact list of different food industry website in a specific city?
Updated
9 years ago
Comments count
1
Multiple matches?
Updated
9 years ago
safehtml omit some important (all) attributes of tags
Updated
9 years ago
Comments count
2
add a tag for 0.10 release
Closed
9 years ago
Comments count
2
remove most Scrapy mentions from the README
Closed
9 years ago
Comments count
2
Python 3 support
Closed
9 years ago
Comments count
1
Obtaining sectioned article text
Updated
10 years ago
Comments count
1
Does the order of annotations matter - Weird output
Updated
10 years ago
l
Closed
11 years ago
Comments count
9
Random failing doctests
Closed
10 years ago
problem with bad encoding and BOM?
Updated
10 years ago
Comments count
2
Is this still an active project?
Closed
10 years ago
Html page containing more than one single entity. How to annotate?
Updated
10 years ago
What you mean with "The training implementation is currently very simple and is only provided for references purposes, to make it easier to test Scrapely and play with it. "
Updated
10 years ago
How to use use html data instead of direct URLs
Closed
11 years ago
Comments count
3
Provide method for parsing HTML that has already been downloaded by external libraries.
Closed
11 years ago
Comments count
1
tool.parse_criteria normalizes whitespace
Closed
11 years ago
Comments count
1
README Usage (command line tool) correction
Closed
11 years ago
possible to pass scrapy response object to scrapely?
Closed
11 years ago
Comments count
1
Correct example at README.rst
Closed
12 years ago
Specifying integer values in the data dict
Closed
12 years ago
Support for passing HTML, not just URLs
Closed
12 years ago
Comments count
2
Slow Extraction Times
Closed
12 years ago
Comments count
4
Previous
Next