Yiming Cui (ymcui)

ymcui

User data from Github https://github.com/ymcui

Company:Joint Laboratory of HIT and iFLYTEK Research (HFL)

Location:Beijing, China

Home Page:http://ymcui.github.io

GitHub:@ymcui

Twitter:@KCrosner

Yiming Cui's repositories

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:18921Issues:183Issues:732

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Language:PythonLicense:Apache-2.0Stargazers:10071Issues:141Issues:242

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:7176Issues:78Issues:390

Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Language:PythonLicense:Apache-2.0Stargazers:1942Issues:21Issues:81

Chinese-XLNet

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:1649Issues:31Issues:69

Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:1431Issues:25Issues:86

MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

Chinese-Mixtral

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Language:PythonLicense:Apache-2.0Stargazers:608Issues:15Issues:10

cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:427Issues:11Issues:24

PERT

PERT: Pre-training BERT with Permuted Language Model

Chinese-RC-Datasets

Collections of Chinese reading comprehension datasets

License:CC-BY-SA-4.0Stargazers:217Issues:12Issues:0

LERT

LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)

Language:PythonLicense:Apache-2.0Stargazers:205Issues:3Issues:6

Chinese-Cloze-RC

A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)

License:CC-BY-SA-4.0Stargazers:170Issues:13Issues:0

cmrc2019

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:126Issues:9Issues:2

LAMB_Optimizer_TF

LAMB Optimizer for Large Batch Training (TensorFlow version)

Language:PythonLicense:Apache-2.0Stargazers:120Issues:7Issues:2

cmrc2017

The First Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2017)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:91Issues:12Issues:0

Chinese-MobileBERT

Chinese MobileBERT(中文MobileBERT模型)

Language:PythonLicense:Apache-2.0Stargazers:89Issues:3Issues:3

Eval-on-NN-of-RC

Empirical Evaluation on Current Neural Networks on Cloze-style Reading Comprehension

License:CC-BY-SA-4.0Stargazers:87Issues:11Issues:4

ChatGPT-in-Academia

Policies of scientific publisher and conferences towards large language model (LLM), such as ChatGPT

License:CC-BY-SA-4.0Stargazers:74Issues:2Issues:0

Cross-Lingual-MRC

Cross-Lingual Machine Reading Comprehension (EMNLP 2019)

Language:PythonLicense:Apache-2.0Stargazers:68Issues:6Issues:4

expmrc

ExpMRC: Explainability Evaluation for Machine Reading Comprehension

Language:PythonLicense:CC-BY-SA-4.0Stargazers:62Issues:2Issues:0

NLP-Review-Scorer

Score your NLP paper review

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:24Issues:2Issues:0

ACL2020-PC-Blogs-Chinese

Chinese Version of ACL 2020 PC Blogs (ACL 2020程序委员会博文中文版)

License:CC-BY-SA-4.0Stargazers:14Issues:3Issues:0

mrc-model-analysis

Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models (iScience)

Language:PythonLicense:Apache-2.0Stargazers:7Issues:2Issues:1

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Language:PythonLicense:Apache-2.0Stargazers:3Issues:3Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:C++License:MITStargazers:1Issues:1Issues:0

VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0