Wei Shi's starred repositories
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
CLUEDatasetSearch
搜索所有中文NLP数据集,附常用英文NLP数据集
awesome-twitter-data
A list of Twitter datasets and related resources.
Sentiment-Analysis-Twitter
:mortar_board:RESEARCH [NLP :thought_balloon:] We use different feature sets and machine learning classifiers to determine the best combination for sentiment analysis of twitter.
nlp-text-emotion
Multi-class sentiment analysis lstm, finetuned bert
learning-stm
Learning structural topic modeling using the stm R package.
Datasets-for-Hate-Speech-Detection
Datasets for Hate Speech Detection
bitermplus
Biterm Topic Model (BTM): modeling topics in short texts
ClickBait-Detector
This repository represent an AI method to classify an article as clickbait or non-clickbait
twitterspyder
推特爬虫
EchoChambers
Mapping Echo Chambers In Large Networks
Echo-chamber_COVID-19_edition
Network analysis experiment on echo-chamber relative to COVID-19 tweets.