Yaotian Zhang's repositories
yaotianzhang.github.io
yaotianzhang.github.io
Literatures
Literatures for writting papers
ftools
Fast Stata commands for large datasets
Peeking-Strategy
data&code of this research
weibo-crawler
新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
textnets
Text analysis with networks.
reghdfe
Linear, IV and GMM Regressions With Any Number of Fixed Effects
ddf--gapminder--systema_globalis
Gapminder's fact-base with local & global statistics
twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
learn-regex
Learn regex the easy way
vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualize and explore big tabular data at a billion rows per second 🚀
pyLDAvis
Python library for interactive topic model visualization. Port of the R LDAvis package.
mybook
Elements of Computational Communicaiton
twarc
A command line tool (and Python library) for archiving Twitter JSON
hate-speech-and-offensive-language
Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017
open-covid-19-data
Open source aggregation pipeline for public COVID-19 data, including hospitalization/ICU/ventilator numbers for many countries.
MM-COVID
Cross Linugual COVID-19 Fake News Dataset
weiboSpider
新浪微博爬虫,用python爬取新浪微博数据
American-political-data-and-R
A guide to accessing & analyzing open source American political data using R. Fall 2020 Version.
bigdata
NJU Master Course **Big Data Mining and Analysis**
nltk_data
NLTK Data
opensources
Curated lists of credible and non-credible online sources, available for public use
Auto_CLIWC
Code for Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention (AAAI18)