Giters
cfpb
/
crawl-cfgov
Archive the HTML of consumerfinance.gov daily
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
9
Watchers:
25
Issues:
9
Forks:
7
cfpb/crawl-cfgov Issues
Long URLs get truncated by wget
Updated
2 years ago
Comments count
1
Question marks in filenames prevent cloning repository on Windows
Updated
3 years ago
Comments count
1
Make it easier to map search results to URLs
Updated
4 years ago
Comments count
1
Varying CSS filenames cause superfluous diffs
Closed
4 years ago
Commit messages lack new files and summary numbers
Closed
4 years ago
Pages with email signups report meaningless diffs
Closed
4 years ago
Comments count
2
Only pages from target domain should be committed
Closed
4 years ago
Comments count
1
New Relic script causes diffs on every page
Closed
4 years ago
Comments count
9
Crawl seems to be missing some pages
Closed
4 years ago
Comments count
3