Gujie Li's starred repositories
TikTokDownloader
TikTok 主页/合辑/直播/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
XHS-Downloader
小红书链接提取/作品采集工具:提取账号发布、收藏、点赞作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书无水印作品文件!
pysentimiento
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks
emotion2vec
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
pytextclassifier
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
w2v2-how-to
How to use our public wav2vec2 dimensional emotion model
fast_vector_similarity
The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.
my-voice-analysis
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
stata-schemepack
Here you will find various ready-to-use Stata schemes.
AmazonReviews2023
Scripts for processing the Amazon Reviews 2023 dataset; implementations and checkpoints of BLaIR: "Bridging Language and Items for Retrieval and Recommendation".
Speaker_diarization
Speech Diarization for scrum automation
text2gender
Predict the author's gender from their text.
tsa-complaint-counts
Monthly counts of TSA traveler complaints by airport, category, and subcategory.
liwc-22-cli-python
Examples of how to call the LIWC-22 CLI from a Python script