mpangrazzi/absa_poc_pipeline

aspect-based-sentiment-analysis gpt3 nlp

A GPT-3-based proof-of-concept Aspect-Based Sentiment Analysis pipeline

This is the code related to the article .

Note that this is a proof-of-concept and not a production-ready pipeline. It is not meant to be used in production by any means, but rather to demonstrate the potential of the approach.

Setup

Setup virtual environment

$ python3 -m venv .venv
$ source .venv/bin/activate

Install dependencies

$ pip install -r requirements.txt

Scraping

Using CLI

To run the Scrapy spider on a specific Amazon url, you can do:

$ scrapy runspider \
  absa/scraping/amazon.py \
  -O reviews.csv \
  -a start_url='https://www.amazon.com/FIODIO-Comfortable-Anti-Ghosting-Resistant-Multimedia/product-reviews/B086168Y25/ref=cm_cr_dp_d_show_all_btm?ie=UTF8&reviewerType=all_reviews'

It will automatically follow the next pages until the required number of items have been scraped.

Using the notebook

You can also play with the notebook notebooks/scraping.ipynb to see how the scraping works.

Analysis

The documented analysis pipeline is in notebooks/analysis.ipynb.

About

A GPT-3-based proof-of-concept Aspect-Based Sentiment Analysis pipeline

aspect-based-sentiment-analysis gpt3 nlp

MIT License

Languages

Language:Jupyter Notebook 97.5%Language:Python 2.5%