wyfdgg's repositories
bicleaner
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
langchain-tutorials
Overview and tutorial of the LangChain Library
PromptCBLUE
PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese
BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
alpaca-lora
Instruct-tune LLaMA on consumer hardware
langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
Linly
Chinese-LLaMA基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
LLMPruner
模型词表裁剪
conference_call_for_paper
2019-2020 International Conferences in Artificial Intelligence, Machine Learning, Computer Vision, Data Mining, Natural Language Processing and Robotics
seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
NCRFpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
GloVe
GloVe model for distributed word representation
PLMpapers
Must-read Papers on pre-trained language models.
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
mosesdecoder
Moses, the machine translation system
PyTorch-Course
JULYEDU PyTorch Course
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
marian
Fast Neural Machine Translation in C++
BERT-pytorch
Google AI 2018 BERT pytorch implementation
pytorch_NER_BiLSTM_CNN_CRF
End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF implement in pyotrch
Bert-BiLSTM-CRF-pytorch
使用谷歌预训练bert做字嵌入的BiLSTM-CRF序列标注模型
nmt
TensorFlow Neural Machine Translation Tutorial
bert
TensorFlow code and pre-trained models for BERT
DeepLearningProject
An in-depth machine learning tutorial introducing readers to a whole machine learning pipeline from scratch.
stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)