Leo Y. YANG's starred repositories
databonsai
clean & curate your data with LLMs.
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
magical_spider
神奇的蜘蛛🕷,一个几乎适用于所有web端站点的采集方案
Bert-Chinese-Text-Classification-Pytorch
使用Bert,ERNIE,进行中文文本分类
Keras-TextClassification
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
BERT-for-Sequence-Labeling-and-Text-Classification
This is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
100-Days-Of-ML-Code
100-Days-Of-ML-Code中文版
National-Data
国家统计局的国家数据网站数据抓取器,可以直接使用1978-2016所有年鉴指标的csv数据
text-classification-cnn-rnn
CNN-RNN中文文本分类,基于TensorFlow
captcha_recognize
Image Recognition captcha without image segmentation 无需图片分割的验证码识别
hq-proxies
A daemon to maintain a high-quality HTTP proxy pool
alfred-airpods-selector
Use Alfred to Switch Between AirPods and Default Audio Sources on macOS
IPProxyPool
IPProxyPool代理池项目,提供代理ip
pyPushBullet
Python library to interface with PushBullet
pushbullet.py
A python client for http://pushbullet.com
fitbitScraper
R package to scrape fitbit data
cleanthesis
Clean Thesis is a clean, simple, and elegant LaTeX style (or template) for thesis documents.
the-swift-programming-language-in-chinese
中文版 Apple 官方 Swift 教程《The Swift Programming Language》