Jack Shan's repositories
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
news-article-crawling
crawling news article text, image, and other info based on their URL links
Crowdfunding-Predicting-Kickstarter-Project-Sucess
We predict whether a Kickstarter project proposal succeeds or fails to meet the fund-raising objective by only providing information from the project launch by means of 220, 000 project proposals scraped from Kickstarter. We evaluate the performance for these predictions of different machine learning models based on the project category...
BuzzFace
A data set regarding news veracity on social media. Published at ICWSM-18.
financial-machine-learning
A curated list of practical financial machine learning tools and applications.
Chinese_wordseg_sentiment
Seg Chinese sentences and calculate sentiment
COVID-19-InstaPostIDs
The repository includes an ongoing collection of Instagram Posts IDs correlated with the new coronavirus COVID-19.
awesome-causality-algorithms
An index of algorithms for learning causality with data
harry_potter_nlp
Harry Potter and the Allocation of Dirichlet
awesome-spider
爬虫集合
FakeNewsNet
This is a dataset for fake news detection research
CKA-Centered-Kernel-Alignment
Reproduce CKA: Similarity of Neural Network Representations Revisited
data-validation
Library for exploring and validating machine learning data
HiddenMarkovModel
Code for the Hidden Markov Model Tutorial Series
fake-video-corpus
A dataset of debunked and verified user-generated videos.
Review_Inconsistency
Online review inconsistency between rating and its sentiments
ganhacks
starter from "How to Train a GAN?" at NIPS2016
Crowdfunding
Scraper for obtaining crowdfunding data in a structured manner implemented in Python with Scrapy.
ai-deadlines
:alarm_clock: AI conference deadline countdowns
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
DPlayer
:lollipop: Wow, such a lovely HTML5 danmaku video player
text-classification-cnn-rnn
CNN-RNN中文文本分类,基于TensorFlow
textgenrnn
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.