DreamingTech's repositories
spider_code_lib
spider code snippets, projects, knowleages, etc.
bloom_filter
auto-increment memory bloom filter and redis bloom filter for scrapy 布隆过滤器 内存型布隆过滤器 redis布隆过滤器 自增加布隆过滤器
CurrentTimedRotatingFileHandler
rewrited TimedRotatingFileHandler, CurrentTimedRotatingFileHandler
imoocgocrawler
distributed crawler project from imooc go course by ccmouse. 分布式爬虫项目-Google资深工程师深度讲解Go语言
jobbole_article
伯乐在线scrapy-redis+docker分布式爬虫
py_spider_projects
小爬虫集合
.github.io
DreaMingTech
chatgpt-mirai-qq-bot
🚀 一键部署!真正的 AI 聊天机器人!支持ChatGPT、文心一言、讯飞星火、Bing、Bard、ChatGLM、POE,多账号,人设调教,虚拟女仆、图片渲染、语音发送 | 支持 QQ、Telegram、Discord、微信 等平台
CookiePool
一个强大的Cookie池项目,融合scrapy/requests/chrome储存cookie/cookie字符串/selenium等cookie形式
CookiesPool
Cookies Pool
dhash
Python library to calculate the difference hash (perceptual hash) for a given image, useful for detecting duplicates
html_normalize
normalize html page to a standrad page
imooc_194_fisher
imooc fisher 鱼书
imooc_399_movie_cat
imooc flask movie_cat
Jikipedia_Spider
python selenium+mitmproxy实现 小鸡词典爬虫 详见博客:https://www.cnblogs.com/FHC1994/p/12171265.html
job_spider_project
招聘网站爬虫
lunar
农历与公历相互转换的模块,支持农历之间的加减运算,并提供生肖、干支等,支持1900-2100年。
NodeSandbox
Node补环境框架
notes
工作学习笔记
phantomflix
Python Netflix API Metadata & Downloader for Windows and Linux
practical-pytorch
DEPRECATED and not maintained - see official repo at https://github.com/pytorch/tutorials
puppeteer-extra
💯 Teach puppeteer new tricks through plugins.
pyreBloom
Fast Redis Bloom Filters in Python
python-bloomfilter
Scalable Bloom Filter implemented in Python
readability
📚 Turn any web page into a clean view
Scrapy_Redis_Bloomfilter
基于Redis的Bloomfilter去重,并将其扩展到Scrapy框架。
spider_plus_fw
scrapy_plus, spider_plus, 仿 scrapy 爬虫框架