Deng Xudong's starred repositories
TikTokDownloader
完全免费开源,基于 AIOHTTP 模块实现:TikTok 主页/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
EconML
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
awesome-ggplot2
A curated list of awesome ggplot2 tutorials, packages etc.
performance
:muscle: Models' quality and performance metrics (R2, ICC, LOO, AIC, BF, ...)
social-media-profiles-regexs
:card_index: Extract social media profiles and more with regular expressions
Causality4NLP_Papers
A reading list for papers on causality for natural language processing (NLP)
patchworklib
Patchwork for matplotlib: A subplot manager for intuitive layouts in matplotlib, seaborn, and plotnine.
AllNewsSpider
澎湃新闻,新浪新闻,腾讯新闻,搜狐新闻,新闻联播,泰晤士报,纽约时报,BBCNews,旨在爬取所有新闻门户网站的新闻,禁止将所得数据商用!
instagram_influencer_dataset
Influencer dataset collected from Instagram
histcite-python
HistCite 工具的 Python 实现
DiachronicEmb-BigHistData
Tools to train and explore diachronic word embeddings from Big Historical Data
news_spider
项目基于Scrapy实现,爬取新闻网站主要新闻,通过gen库提取内容,存储到mysql中。实现定时爬取和增量爬取。已爬取:、湖南在线、四月、四川新闻、广州日报大洋网、光明网、四川在线、东南网、中青在线、中评网、北晚在线、**消费网、**科技网、**经济网、**日报、**交通新闻网、**经济新闻网、中华网、文明网、南方网、**新闻网