LeHuyHongNhat / Sentiment_Analysis_Using_PhoBert_Trained

This project implements sentiment analysis using PhoBert, a pre-trained Vietnamese BERT model, on the UIT-VSFC dataset to classify the sentiment of Vietnamese text.

Repository from Github https://github.comLeHuyHongNhat/Sentiment_Analysis_Using_PhoBert_TrainedRepository from Github https://github.comLeHuyHongNhat/Sentiment_Analysis_Using_PhoBert_Trained

Sentiment Analysis Using PhoBERT

Harnessing PhoBERT and the UIT-VSFC dataset for advanced Vietnamese sentiment analysis.

License Python Transformers

πŸ“‘ Overview

This project leverages PhoBERT with the UIT-VSFC dataset to classify sentiment in Vietnamese text. By utilizing the power of a transformer-based model fine-tuned for Vietnamese, this project achieves highly accurate sentiment predictions, ideal for natural language understanding applications in Vietnamese.

✨ Key Features

  • State-of-the-Art NLP: Employs PhoBERT, a leading model for Vietnamese language understanding.
  • UIT-VSFC Dataset: Trained on a well-curated dataset with extensive Vietnamese sentiment data.
  • Robust Sentiment Analysis: Classifies text sentiment as positive, negative, or neutral with precision.

πŸ”§ Installation

  1. Clone the repository:
    git clone https://github.com/LeHuyHongNhat/Sentiment-Analysis-Using-PhoBERT.git
  2. Navigate to the project directory:
    cd Sentiment-Analysis-Using-PhoBERT

πŸ“ Usage

Data Preparation

Download the UIT-VSFC dataset in .

Model Training

Fine-tune PhoBERT on the UIT-VSFC dataset

Evaluation

Evaluate model performance on the test dataset

πŸ“Š Results

The fine-tuned model achieves high accuracy on the UIT-VSFC dataset, demonstrating effectiveness for sentiment analysis tasks in Vietnamese. Detailed metrics can be found in the notebooks.

πŸš€ Future Work

  • Dataset Expansion: Incorporate additional Vietnamese sentiment datasets.
  • Multi-Label Sentiment: Extend the model to capture more nuanced sentiment variations.

πŸ“„ License

This project is licensed under the MIT License.

🀝 Acknowledgments

Made with ❀️ by LΓͺ Huy Hα»“ng NhαΊ­t

About

This project implements sentiment analysis using PhoBert, a pre-trained Vietnamese BERT model, on the UIT-VSFC dataset to classify the sentiment of Vietnamese text.


Languages

Language:Jupyter Notebook 100.0%