Yiming Cui (ymcui)

ymcui

Geek Repo

Company:Joint Laboratory of HIT and iFLYTEK Research (HFL)

Location:Beijing, China

Home Page:http://ymcui.github.io

Twitter:@KCrosner

Github PK Tool:Github PK Tool

Yiming Cui's repositories

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17860Issues:185Issues:728

Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Language:PythonLicense:Apache-2.0Stargazers:9430Issues:142Issues:239

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Language:PythonLicense:Apache-2.0Stargazers:6992Issues:75Issues:384

Chinese-XLNet

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:1642Issues:33Issues:69

Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:1384Issues:26Issues:85

Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Language:PythonLicense:Apache-2.0Stargazers:1218Issues:17Issues:54

MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

Chinese-Mixtral

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Language:PythonLicense:Apache-2.0Stargazers:548Issues:15Issues:10

cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:408Issues:12Issues:23

PERT

PERT: Pre-training BERT with Permuted Language Model

Chinese-RC-Datasets

Collections of Chinese reading comprehension datasets

License:CC-BY-SA-4.0Stargazers:211Issues:13Issues:0

LERT

LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)

Language:PythonLicense:Apache-2.0Stargazers:188Issues:3Issues:6

Chinese-Cloze-RC

A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)

License:CC-BY-SA-4.0Stargazers:166Issues:14Issues:0

cmrc2019

A Sentence Cloze Dataset for Chinese Machine Reading Comprehension (CMRC 2019)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:125Issues:10Issues:2

LAMB_Optimizer_TF

LAMB Optimizer for Large Batch Training (TensorFlow version)

Language:PythonLicense:Apache-2.0Stargazers:120Issues:8Issues:2

cmrc2017

The First Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2017)

Language:PythonLicense:CC-BY-SA-4.0Stargazers:92Issues:13Issues:0

Eval-on-NN-of-RC

Empirical Evaluation on Current Neural Networks on Cloze-style Reading Comprehension

License:CC-BY-SA-4.0Stargazers:86Issues:12Issues:4

Chinese-MobileBERT

Chinese MobileBERT(中文MobileBERT模型)

Language:PythonLicense:Apache-2.0Stargazers:78Issues:3Issues:3

ChatGPT-in-Academia

Policies of scientific publisher and conferences towards large language model (LLM), such as ChatGPT

License:CC-BY-SA-4.0Stargazers:72Issues:3Issues:0

Cross-Lingual-MRC

Cross-Lingual Machine Reading Comprehension (EMNLP 2019)

Language:PythonLicense:Apache-2.0Stargazers:67Issues:7Issues:4

expmrc

ExpMRC: Explainability Evaluation for Machine Reading Comprehension

Language:PythonLicense:CC-BY-SA-4.0Stargazers:59Issues:3Issues:0

NLP-Review-Scorer

Score your NLP paper review

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:24Issues:3Issues:0

ACL2020-PC-Blogs-Chinese

Chinese Version of ACL 2020 PC Blogs (ACL 2020程序委员会博文中文版)

License:CC-BY-SA-4.0Stargazers:14Issues:4Issues:0

mrc-model-analysis

Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models (iScience)

Language:PythonLicense:Apache-2.0Stargazers:7Issues:2Issues:1

xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Language:PythonLicense:Apache-2.0Stargazers:3Issues:4Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:C++License:MITStargazers:1Issues:1Issues:0

VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0