yehias21 / Extensive_analysis_on_NLP_models_for_sentiment_analysis_in_business_applications

A full profile for various approaches in order to select the best opinion extraction method for business applications

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Extensive analysis on NLP models for sentiment analysis in business applications

picture

Models Ideas:

  1. Use of custom models for different categories (tech, food, books,...) to be automatically (using context classfication) or manually selected(by the client). (different datasets applied)
  2. Run multiple models per dataset and derive weighted average results.
  3. Developing a layered classification use fast/slow classification (divide the dataset using confidence index to strong and weak groups; the weak group will be analysed further using Roberta model).
  4. Aspect based analysis (attach sentiment to specific aspects rather than sentence/opinion) and word cloud (for word frequencies) to show insights of the reviews. (Amazon comprehend model)
  5. Use of lemmatization, opinion unit extractor, subjectivity index and multiclass classification(love, sad, angry,...) for better accuracy and data enrichment.
  6. Test of a sent-ngrams lexion sentiment analysis (SO-CAL).
  7. Use of client dataset to fine-tune the model. (Ideation phase)

Datasets used:

  1. Twitter airline
  2. IMDB
  3. Yelp (preprocessing phase)
  4. Amazon
  5. 140sentiment twitter

Models used:

  1. Textblob
  2. Flair (RNN)
  3. Vader
  4. Roberta-large
  5. Bert_multilingual

Results:

crop

About

A full profile for various approaches in order to select the best opinion extraction method for business applications


Languages

Language:Jupyter Notebook 100.0%