deluair / OpenCodeInterpreter

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

OpenCodeInterpreter

[๐Ÿ Homepage] | [๐Ÿ› ๏ธCode]


๐ŸŒŸ Upcoming Features

  • ๐Ÿ’ก Open Sourcing OpenCodeInterpreter-GM-7b Model with gemma-7b Base;

  • ๐Ÿš€ Deploying Demo on HuggingFace Spaces;

  • ๐Ÿ› ๏ธ Open Sourcing Demo Local Deployment Code with a Setup Guide

๐Ÿ””News

โœจ[2024-02-26]: We have open-sourced the OpenCodeInterpreter-DS-1.3b Model.

๐Ÿ“˜[2024-02-26]: We have open-sourced the CodeFeedback-Filtered-Instruction Dataset.

๐Ÿš€[2024-02-23]: We have open-sourced the datasets used in our project named Code-Feedback.

๐Ÿ”ฅ[2024-02-19]: We have open-sourced all models in the OpenCodeInterpreter series ! We welcome everyone to try out our models and look forward to your participation! ๐Ÿ˜†

Introduction

OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.

Models

All models within the OpenCodeInterpreter series have been open-sourced on Hugging Face. You can access our models via the following link: OpenCodeInterpreter Models.

Data Collection

Supported by Code-Feedback, a dataset featuring 68K multi-turn interactions, OpenCodeInterpreter incorporates execution and human feedback for dynamic code refinement. For additional insights into data collection procedures, please consult the readme provided under Data Collection.

Evaluation

Our evaluation framework primarily utilizes HumanEval and MBP, alongside their extended versions, HumanEval+ and MBPP+, leveraging the EvalPlus framework for a more comprehensive assessment. For specific evaluation methodologies, please refer to the Evaluation README for more details.

Contact

If you have any inquiries, please feel free to raise an issue or reach out to us via email at: xiangyue.work@gmail.com, zhengtianyu0428@gmail.com. We're here to assist you!

About

License:Apache License 2.0


Languages

Language:Python 95.9%Language:Shell 4.0%Language:Dockerfile 0.1%