Deng Xudong's starred repositories

pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Language:PythonLicense:BSD-3-ClauseStargazers:42386Issues:1113Issues:26511

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Language:PythonLicense:GPL-3.0Stargazers:30375Issues:161Issues:370

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:12516Issues:52Issues:136

tsfresh

Automatic extraction of relevant features from time series:

Language:Jupyter NotebookLicense:MITStargazers:8171Issues:168Issues:527

TikTokDownloader

完全免费开源,基于 AIOHTTP 模块实现:TikTok 主页/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具

Language:PythonLicense:GPL-3.0Stargazers:6317Issues:39Issues:210

flashtext

Extract Keywords from sentence or Replace keywords in sentences.

Language:PythonLicense:MITStargazers:5570Issues:141Issues:113

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Language:PythonLicense:Apache-2.0Stargazers:5086Issues:51Issues:184

causalml

Uplift modeling and causal inference with machine learning algorithms

Language:PythonLicense:NOASSERTIONStargazers:4842Issues:84Issues:389

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4266Issues:79Issues:167

PyWxDump

获取微信账号信息(昵称/账号/手机/邮箱/数据库密钥/wxid);PC微信数据库读取、解密脚本;聊天记录查看工具;聊天记录导出为html(包含语音图片)。支持多账户信息获取,支持所有微信版本。

Language:PythonLicense:NOASSERTIONStargazers:4175Issues:33Issues:90

EconML

ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:3614Issues:77Issues:549

caj2pdf

Convert CAJ (China Academic Journals) files to PDF. 转换**知网 CAJ 格式文献为 PDF。佛系转换,成功与否,皆是玄学。

Language:PythonLicense:NOASSERTIONStargazers:2845Issues:45Issues:80

patchwork

The Composer of ggplots

Language:RLicense:NOASSERTIONStargazers:2405Issues:48Issues:315

pyts

A Python package for time series classification

Language:PythonLicense:BSD-3-ClauseStargazers:1724Issues:25Issues:78

awesome-ggplot2

A curated list of awesome ggplot2 tutorials, packages etc.

ppscore

Predictive Power Score (PPS) in Python

Language:PythonLicense:MITStargazers:1086Issues:27Issues:61

performance

:muscle: Models' quality and performance metrics (R2, ICC, LOO, AIC, BF, ...)

Language:RLicense:GPL-3.0Stargazers:958Issues:25Issues:456

social-media-profiles-regexs

:card_index: Extract social media profiles and more with regular expressions

dm

Working with relational data models in R

Language:RLicense:NOASSERTIONStargazers:492Issues:10Issues:626

Causality4NLP_Papers

A reading list for papers on causality for natural language processing (NLP)

patchworklib

Patchwork for matplotlib: A subplot manager for intuitive layouts in matplotlib, seaborn, and plotnine.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:351Issues:7Issues:44

AllNewsSpider

澎湃新闻,新浪新闻,腾讯新闻,搜狐新闻,新闻联播,泰晤士报,纽约时报,BBCNews,旨在爬取所有新闻门户网站的新闻,禁止将所得数据商用!

Language:PythonLicense:Apache-2.0Stargazers:313Issues:7Issues:15

likert

Package to analyze likert based items.

ggprism

ggplot2 extension inspired by GraphPad Prism

rmrb

人民日报(1946-2003)

instagram_influencer_dataset

Influencer dataset collected from Instagram

histcite-python

HistCite 工具的 Python 实现

Language:PythonLicense:MITStargazers:19Issues:4Issues:11

DiachronicEmb-BigHistData

Tools to train and explore diachronic word embeddings from Big Historical Data

Language:Jupyter NotebookLicense:MITStargazers:18Issues:1Issues:5

xwlb

新闻联播开放数据

Language:JavaScriptLicense:MITStargazers:16Issues:0Issues:0

news_spider

项目基于Scrapy实现,爬取新闻网站主要新闻,通过gen库提取内容,存储到mysql中。实现定时爬取和增量爬取。已爬取:、湖南在线、四月、四川新闻、广州日报大洋网、光明网、四川在线、东南网、中青在线、中评网、北晚在线、**消费网、**科技网、**经济网、**日报、**交通新闻网、**经济新闻网、中华网、文明网、南方网、**新闻网

Language:PythonStargazers:4Issues:0Issues:0