davidfrisch / vv_cw_2

This is a repository for the coursework of CW2 and CW3 of COMP0103 - Validation and Verification module. The coursework is about investigating how well language models (LLMs) of different sizes can solve LeetCode problems in Java code.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Coursework 3 - Validation and Verification

In this project, we're investigating how well language models (LLMs) of different sizes can solve LeetCode problems in Java code. In particular, we're interested in whether smaller, more manageable LLMs can compete with larger ones, while still running smoothly on a regular computer. This Github repository provides the necessary code to run such a system that can handle all three parts of the problem: setting up the prompt for the LLM, getting the code snippet response, and checking that the response actually works.

Setup

The setup guide is available in the setup guide. It provides a step-by-step guide to setting up the project.

Run

source venv/bin/activate
cd pipeline
python3 main.py --model <model_name> --number <number_of_problems_to_solve> --verbose <True/False>

About

This is a repository for the coursework of CW2 and CW3 of COMP0103 - Validation and Verification module. The coursework is about investigating how well language models (LLMs) of different sizes can solve LeetCode problems in Java code.


Languages

Language:Java 74.2%Language:Jupyter Notebook 24.6%Language:Python 1.2%Language:Dockerfile 0.0%