Xu Luo (Frankluox)

Frankluox

Geek Repo

Company:UESTC

Location:Chengdu, China

Home Page:https://frankluox.github.io/

Twitter:@lux77215832

Github PK Tool:Github PK Tool

Xu Luo's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29604Issues:189Issues:974

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:28827Issues:276Issues:1187

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28099Issues:167Issues:405

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Language:PythonLicense:MITStargazers:12090Issues:90Issues:340

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8044Issues:75Issues:308

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:6942Issues:44Issues:68

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:6856Issues:50Issues:597

lerobot

🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:4527Issues:51Issues:68

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:PythonLicense:Apache-2.0Stargazers:4454Issues:52Issues:138

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4152Issues:46Issues:252

schedule_free

Schedule-Free Optimization in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1712Issues:17Issues:25

penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Language:PythonLicense:Apache-2.0Stargazers:1566Issues:18Issues:12

dora

DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.

Language:RustLicense:Apache-2.0Stargazers:1376Issues:27Issues:119

mujoco_menagerie

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1164Issues:26Issues:57

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:882Issues:19Issues:68

prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Language:PythonLicense:Apache-2.0Stargazers:697Issues:3Issues:21

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonLicense:MITStargazers:663Issues:3Issues:9

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:638Issues:11Issues:29

ManiSkill

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

Language:PythonLicense:Apache-2.0Stargazers:594Issues:17Issues:191

loco-mujoco

Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.

Language:PythonLicense:MITStargazers:487Issues:8Issues:29

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

TeleVision

Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Language:PythonLicense:NOASSERTIONStargazers:428Issues:4Issues:3

Glyph-ByT5

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:413Issues:15Issues:15

robocasa

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Language:PythonLicense:NOASSERTIONStargazers:400Issues:9Issues:33

starcoder2-self-align

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Language:PythonLicense:Apache-2.0Stargazers:212Issues:5Issues:5

CV-VAE

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter NotebookStargazers:182Issues:14Issues:9

awesome-robot-learning-envs

A list of awesome and popular robot learning environments

android_world

AndroidWorld is an environment and benchmark for autonomous agents

Language:PythonLicense:Apache-2.0Stargazers:60Issues:3Issues:0

MATH-V

MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.

Language:PythonLicense:MITStargazers:34Issues:1Issues:1