zengpeiyang's repositories
nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
bert
TensorFlow code and pre-trained models for BERT
bert-multitask-learning
BERT for Multitask Learning
causalml
Uplift modeling and causal inference with machine learning algorithms
CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
ctrl-sum
Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper
dict_build
自动构建中文词库:http://www.matrix67.com/blog/archives/5044
focal_loss_pytorch
A PyTorch Implementation of Focal Loss.
LAMA
LAnguage Model Analysis
learn-regex
Learn regex the easy way
LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
models
Models and examples built with TensorFlow
nmt
TensorFlow Neural Machine Translation Tutorial
OpenCC
A project for conversion between Traditional and Simplified Chinese
OpenSeq2Seq
Toolkit for efficient experimentation with various sequence-to-sequence models
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,Seq2Seq_Attention,BERT,MacBERT,ELECTRA,ERNIE,Transformer等模型实现,开箱即用。
QuickStart
百度AI平台QuickStart文档配套代码
roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
SimCSE
SimCSE: Simple Contrastive Learning of Sentence Embeddings
spark
Apache Spark
TensorRT
TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
wenku_spider
免券下载百度文库,支持doc,txt,ppt,pdf