renpingsheng's repositories
haodf-offline-crawler-scripts
好大夫网站离线爬虫程序集
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
akshare
AkShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
baidumap
百度迁徙指数以及流出去向,全国所有地级市精度
ChatYuan
ChatYuan: Large Language Model for Dialogue in Chinese and English
CMB
CMB, A Comprehensive Medical Benchmark in Chinese
COVIDExposureIndices
Exposure indices derived from PlaceIQ movement data by Couture, Dingel, Green, Handbury, and Williams
crawler-www.haodf.com
给公司写的爬虫程序,实现功能为爬取好大夫网医院列表
EagleEye
Stalk your Friends. Find their Instagram, FB and Twitter Profiles using Image Recognition and Reverse Image Search.
fastai-petfinder
Merging image, tabular and text data in a neural network with fastai
Firefly
Firefly(流萤): 中文对话式大语言模型(全量微调+QLoRA),支持微调Baichuan2、CodeLlama、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya、Bloom等大模型
haipproxy
:sparkling_heart: High available distributed ip proxy pool, powerd by Scrapy and Redis
haodf-1
好大夫在线问答爬取
house
有完整版的PDF下载。
ica
Python implementation of the Iterative Classification Algorithm
Iterative-Classification
Implemention of an iterative model for making appropriate predictions using the contents of the links and relations that exist between the links in order to classify unlabeled data more accurately as compared to Content-based Bayesian classification.
Medical_NLP
Medical NLP Competition, dataset, large models, paper 医疗NLP领域 比赛,数据集,大模型,论文,工具包
mostly-harmless-replication
Replication of tables and figures from "Mostly Harmless Econometrics" in Stata, R, Python and Julia.
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Pre-modern_Chinese_corpus_dataset
一个近代汉语语料库数据集 This is a pre-modern Chinese ( From Song dynasty in 10th century AD to Republic of China in the early 20th Century ) language corpus.These language resources are all txt format,arranged by Dynasty(Song,Yuan,Ming,Early-Qing,Late-Qing and Republic of China).The relevant authors' information and types of literature also have been labelled.
ProxyPool
An Efficient ProxyPool with Getter, Tester and Server
sampler
Tool for shell commands execution, visualization and alerting. Configured with a simple YAML file.
scylla
Intelligent proxy pool for Humans™ (Maintainer needed)
SEC-risks
This project provides code that can extract Item 1A. Risk factors from annual reports form 10-k filed with the Securities and Exchange Commission (SEC) of USA.
Ultimate-Facebook-Scraper
🤖 A bot which scrapes almost everything about a Facebook user's profile including all public posts/statuses available on the user's timeline, uploaded photos, tagged photos, videos, friends list and their profile photos (including Followers, Following, Work Friends, College Friends etc).
weixin_crawler
高效微信公众号全部历史文章和阅读数据爬虫powered by scrapy 微信公众号爬虫 微信采集 公众号采集 微信爬虫
ZhuguanDetection
Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。