CS330 Homedepot AI Final Project

I worked as an assosiate at Home Depot for the summer, and I it challenging that customers wanted to know the difference between two products. While we have signage, it isn't specific enough. I found that I needed to read the descriptions of both products, compare and summarize them for the customers. This program is designed to help with that task, and make my job more efficient, and better for the customer.

This program uses a variety of AI models to compare two products on homedepot.com. It then gives a summary of the products. It has options to use a variety of models, and to change the settings of each model.

If I were to continue this project, I would add caching to the program, so two products that have already been compared would not need to be compared again. I would also add a way to compare more than two products at once. I would also create a frontend app for the phone, and make the backend a server, so that it could be used by other employees.

⚠️ This project is only built for Windows, and will not run on other platforms!

Team Members

Eric Golde

Example Videos

This project takes a lot of disk space, and can be a pain to get to compile properly. These videos are provided to showcase how the project works for each of the given models for convenience.

Model Name	Example Video
GPT 3.5 Turbo
GPT 4
LLAMA 7B
LLAMA 13B
Wizard 13B
Orca 2 7B
Orca 2 13B
Openhermes 2.5 mistral 7b

Installing & Running

Installing

Clone this repo
In the llama.cpp folder:
1. Clone the repo
2. Build it following the build instructions on the repo (Don't forget to enable GPU!)
In the models folder:
1. Clone all the models you want to use
2. Convert your models to fp16 using the steps below
3. Convert your fp16 to q4_0 using the steps below
4. If you are using GPT 3.5 or 4, you need to put your API key in the api_key.txt file in the models/GPT folder
Import the project into IntelliJ

Running

Run the project
Find two products on homedepot.com, or use the ones pre-populated in the search bar
Click the search button
Wait for the results to come in
Select the AI you want to use in the settings panel
Change any other settings you want per model (Default settings work well for most models)
Click the start button
Wait for the magic to happen

Downloading the models

Models to clone

Downloading

In git terminal

git lfs install
git clone <link to repo>

Converting to FP16

In Linux for windows subsystem

python3 -m pip install -r requirements.txt
python3 convert.py models/<model name>/

Converting to q4_0

In windows terminal

./quantize ./models/<model name>/ggml-model-f16.gguf ./models/<model name>/ggml-model-q4_0.gguf q4_0

Now you can delete all the files in the models folder except for the q4_0 model

Results & Summary

For my testing, I compared the products 317061072 and 314600837. I chose these two products because they are similier lawn mowers by the same manufacturer. This is a very real world example of what I was asked by customers as an assosiate to compare.

I evaluated the speed and quality of the results for each model. Quality is subjective by how correct it was, and how useful I felt the results were.

Each models settings are the defaults in the program.

GPT 3.5 Turbo

GPT 3.5 Turbo worked very well at this task. It was able to give very detailed answers, and was very good at staying on topic.

This was the fastest model to run.

GPT 4

GPT 4 worked very well at this task. It was able to give very detailed answers, and was very good at staying on topic, but it was sometimes overly verbose in its descriptions.

It was also slow, but not as slow as the local models.

LLAMA 7B

LLAMA 7B did not work well at all. It either would make things up, or just talk about random facts and ignore any user input.

LLAMA 13B

LLAMA 13B did not work well at all. It either would make things up, or just talk about random facts and ignore any user input. I did not see a difference between LLAMA 13B and LLAMA 7B.

Wizard 13B

Wizard 13B worked very well at this task. It was able to give very detailed answers, and was very good at staying on topic.

Orca 2 7B

Orca 2 7B worked very well at this task. It was able to give very detailed answers, and was very good at staying on topic.

Orca 2 was the fastest local model to run.

Orca 2 13B

Orca 2 13B worked very well at this task. It was able to give very detailed answers, and was very good at staying on topic. I can't tell the difference between Orca 2 7B and Orca 2 13B besides 13B being slower.

Openhermes 2.5 mistral 7B

This model worked, but it did not give bullet points and a summary. It just gave a summary of both products which is not exactly what it was asked to do. Other than that it worked well.

What I learned

GPT 3.5 Turbo is the best and fastest model for this task
Orca 2 is the best local model for this task
Every LLAMA model is bad at this task
This is not a very simple task for a AI to accomplish, despite being a simple task on paper.

In class presentation

You can find the presentation I gave in class below:

Basic Troubleshooting

If the program crashes or hangs, you may need to kill the process main.exe in task manager.
If llama.cpp fails to run, try running it outside of IntelliJ with the command main.exe in the llama.cpp folder.
If the model fails to convert to fp16 due to a token error, try editing the model settings in the models json file, to specify the correct tokens.

Contributing

Don't accidently commit your GPT API key!

I already accidentally did that once, and had to reset it.

If you don't want to accidentally commit your API key, you can ignore any changes to the models/GPT/api_key.txt file by running the command git update-index --assume-unchanged models/GPT/api_key.txt in git bash.

If you do commit it, you can reset it at https://platform.openai.com/api-keys.

About

Final project for CS330

Languages

Language:Java 100.0%

egold555 / PLU-CS330-Final