Leven / Xinze Lyu's repositories
RL_exercise
My exercise for Reinforcement Learning
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU部署 (Chinese LLaMA & Alpaca LLMs)
data-selection-survey
A Survey on Data Selection for Language Models
dolma
Data and tools for generating and inspecting OLMo pre-training data.
DownloadConceptualCaptions
Reliably download millions of images efficiently
FindTheChatGPTer
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
Flair_BUG_REPORTER
Sending requests to Flair NER model with high concurrency will lead the GPU stuck in 100% usage.
ModelByPytorch
Implementing some models with pytorch, it is just for fun.
Firefly
Firefly: 大模型训练工具,支持训练Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
flip
Official Open Source code for "Scaling Language-Image Pre-training via Masking"
funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料、中文谣言数据、百度中文问答数据集、句子相似度匹配算法集合、bert资源、文本生成&摘要相关工具、cocoNLP信息抽取工具、国内电话号码正则匹配、清华大学XLORE:中英文跨语言百科知识图谱、清华大学人工智能技术系列报
human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Megatron-LM
Ongoing research training transformer models at scale
shadowsocks
shadowsocks.wiki
simple-simcse
A simple implementation of SimCSE
simpleT5
simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.
transformers-bloom-inference
Fast Inference Solutions for BLOOM
ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
vlog4j
Java library based on the VLog rule engine
zero_nlp
中文nlp应用(大模型、数据、模型、训练、推理)