Wei Shi (Mosw5871)

Mosw5871

Geek Repo

Company:Shanghai Jiao Tong University

Location:Shanghai

Home Page:https://smd.sjtu.edu.cn

Github PK Tool:Github PK Tool

Wei Shi's starred repositories

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:129498Issues:1120Issues:15249

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:62274Issues:261Issues:1526

bert

TensorFlow code and pre-trained models for BERT

Language:PythonLicense:Apache-2.0Stargazers:37514Issues:997Issues:1142

jieba

结巴中文分词

Language:PythonLicense:MITStargazers:32832Issues:1283Issues:847

EasySpider

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptLicense:NOASSERTIONStargazers:30356Issues:201Issues:410

pycaret

An open-source, low-code machine learning library in Python

Language:Jupyter NotebookLicense:MITStargazers:8679Issues:133Issues:2295

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

CLUEDatasetSearch

搜索所有中文NLP数据集,附常用英文NLP数据集

textstat

:memo: python package to calculate readability statistics of a text object - paragraphs, sentences, articles.

Language:PythonLicense:MITStargazers:1110Issues:19Issues:112

awesome-twitter-data

A list of Twitter datasets and related resources.

Sentiment-Analysis-Twitter

:mortar_board:RESEARCH [NLP :thought_balloon:] We use different feature sets and machine learning classifiers to determine the best combination for sentiment analysis of twitter.

BERTweet

BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)

Language:PythonLicense:MITStargazers:569Issues:12Issues:46

BTM

Code for Biterm Topic Model (published in WWW 2013)

Language:C++License:Apache-2.0Stargazers:402Issues:22Issues:26

nlp-text-emotion

Multi-class sentiment analysis lstm, finetuned bert

Language:Jupyter NotebookStargazers:191Issues:5Issues:7

learning-stm

Learning structural topic modeling using the stm R package.

TweeterPy

TweeterPy is a python library to extract data from Twitter. TweeterPy API lets you scrape data from a user's profile like username, userid, bio, followers/followings list, profile media, tweets, etc.

Language:PythonLicense:MITStargazers:121Issues:4Issues:57

ergm

Fit, Simulate and Diagnose Exponential-Family Models for Networks

Language:RLicense:NOASSERTIONStargazers:94Issues:16Issues:439

BTM

Biterm Topic Modelling for Short Text with R

Language:C++License:Apache-2.0Stargazers:93Issues:8Issues:16

mybook

Lectures on Computational Communication

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:81Issues:4Issues:7

hurtlex

A multilingual lexicon of words to hurt.

Language:PythonStargazers:74Issues:3Issues:0

bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

Language:CythonLicense:MITStargazers:73Issues:2Issues:29
Language:Jupyter NotebookStargazers:36Issues:0Issues:0

athec

Computational aesthetic analysis of visual media

ClickBait-Detector

This repository represent an AI method to classify an article as clickbait or non-clickbait

Language:PythonStargazers:11Issues:5Issues:0

twitterspyder

推特爬虫

Language:Jupyter NotebookStargazers:10Issues:0Issues:0

EchoChambers

Mapping Echo Chambers In Large Networks

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

ctap-web

CTAP's Web frontend

Language:JavaLicense:NOASSERTIONStargazers:3Issues:0Issues:0

Echo-chamber_COVID-19_edition

Network analysis experiment on echo-chamber relative to COVID-19 tweets.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0

clickbait

This is a notebook ,which talks about EDA and model building of a use case, identify whether the statement is clickbait or not

Language:Jupyter NotebookStargazers:1Issues:0Issues:0