There are 21 repositories under vietnamese-nlp topic.
Underthesea - Vietnamese NLP Toolkit
PhoGPT: Generative Pre-training for Vietnamese (2023)
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)
Vietnamese question answering system with BERT
VietASR - Vietnamese Automatic Speech Recognition
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
A Fast and Accurate Vietnamese Word Segmenter (LREC 2018)
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
Electra pre-trained model using Vietnamese corpus
Vietnamese Automatic Speech Recognition
Vietnamese sensitive words (including teencode) was created by ML algorithm
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
Vietnamese Word Tokenize
PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)
A simple/fast/accurate accent prediction for non-accented Vietnamese text
We use LSTM, BiLSTM, BERT and SVM with TF-IDF, Word2vec and Bag-of-words to classify this documents to positive (labeled as 1), neutral (labeled as 0) and negative (labeled as 2)
A Robustly Optimized BERT Pretraining Approach for Vietnamese
[VietSentiWordNet] A quick and simple method to find Opinion for Vietnamese text.
Speech and Language Processing 3rd edition Vietnamese Translation
VietTTS: An Open-Source Vietnamese Text to Speech
VnDT: A Vietnamese Dependency Treebank
Vietnamese Wikipedia Corpus
Vietnamese long form question answering system with documents retrieval.