Jia LI's starred repositories

video2dataset

Easily create large video dataset from video urls

Language:PythonLicense:MITStargazers:502Issues:0Issues:0

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Language:PythonStargazers:159Issues:0Issues:0

MetaMath-AIMO

https://www.kaggle.com/competitions/ai-mathematical-olympiad-prize/leaderboard

Stargazers:3Issues:0Issues:0

client-python

Python client library for Mistral AI platform

Language:PythonLicense:Apache-2.0Stargazers:415Issues:0Issues:0

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Language:PythonStargazers:1008Issues:0Issues:0

bigcodebench

BigCodeBench: The Next Generation of HumanEval

Language:PythonLicense:Apache-2.0Stargazers:129Issues:0Issues:0

DeepSeek-Coder-V2

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

License:MITStargazers:1362Issues:0Issues:0
Language:PythonStargazers:420Issues:0Issues:0

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:4424Issues:0Issues:0

BunkaTopics

🗺️ Data Cleaning and Textual Data Visualization 🗺️

Language:PythonLicense:MITStargazers:122Issues:0Issues:0

continue

⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains

Language:TypeScriptLicense:Apache-2.0Stargazers:13483Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:2479Issues:0Issues:0

llm-decontaminator

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Language:PythonLicense:Apache-2.0Stargazers:179Issues:0Issues:0

code-interpreter

Python & JS/TS SDK for adding code interpreting to your AI app

Language:PythonLicense:Apache-2.0Stargazers:882Issues:0Issues:0

mistral

Workflow Service for OpenStack. Mirror of code maintained at opendev.org.

Language:PythonLicense:Apache-2.0Stargazers:279Issues:0Issues:0

Convolutional-KANs

This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learnable non linear activations in each pixel.

Language:Jupyter NotebookLicense:MITStargazers:635Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:2981Issues:0Issues:0
Language:PythonLicense:MITStargazers:67Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:28697Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Language:PythonLicense:MITStargazers:479Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4231Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10922Issues:0Issues:0

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:11971Issues:0Issues:0

ai-codereviewer

AI Code Reviewer: Enhance your GitHub workflow with AI-powered code review! Get intelligent feedback and suggestions on pull requests using OpenAI's GPT-4 API, improving code quality and saving developers time.

Language:TypeScriptLicense:MITStargazers:477Issues:0Issues:0

llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Language:Jupyter NotebookLicense:MITStargazers:1034Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20699Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:450Issues:0Issues:0
Language:PythonStargazers:10Issues:0Issues:0

LeanCopilot

LLMs as Copilots for Theorem Proving in Lean

Language:C++License:MITStargazers:857Issues:0Issues:0

verbose-lean4

Natural language tactics to teach mathematics using Lean 4

Language:LeanLicense:Apache-2.0Stargazers:45Issues:0Issues:0