will-thompson-k / neurips_llm_efficiency_challenge

NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Neurips 1 LLM 1 GPU Challenge

This repository contains a toy submission for the NeurIPS 1 LLM 1 GPU Competition. It provides a simple implementation that serves as a starting point for participants to understand the problem and build their own solutions.

Contents

Submission

The submission in this repository is a basic implementation of the setting up a HTTP server in accordance to the open_api spec. It includes a sample solution built off of Lit-GPT and open-llama weights that participants can reference or modify as they see fit.

Usage

You can use the provided code as a reference or starting point for your own implementation. The main.py file contains the simple FastAPI server, and you can modify it to suit your needs.

OpenAPI Specification

The openapi.json file in this repository contains the OpenAPI specification for the Competition API. Competitors can use this specification to understand the API endpoints, request and response structures, and overall requirements for interacting with the competition platform.

The OpenAPI specification provides a standardized way to describe the API, making it easier for competitors to develop their own solutions and integrate them seamlessly with the competition infrastructure.

About

NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day


Languages

Language:Python 84.4%Language:Dockerfile 15.6%