艾梦's starred repositories

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:26671Issues:0Issues:0

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:299Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:4234Issues:0Issues:0

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:2312Issues:0Issues:0

onnx-web

web UI for GPU-accelerated ONNX pipelines like Stable Diffusion, even on Windows and AMD

Language:PythonLicense:MITStargazers:179Issues:0Issues:0

VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:6930Issues:0Issues:0

onnx-modifier

A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.

Language:JavaScriptLicense:MITStargazers:1150Issues:0Issues:0

unsloth

Finetune Llama 3, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Language:PythonLicense:Apache-2.0Stargazers:10707Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:166Issues:0Issues:0

BitMat

An efficent implementation of the method proposed in "The Era of 1-bit LLMs"

Language:PythonLicense:Apache-2.0Stargazers:140Issues:0Issues:0

search4all

Personal AI search copilot, open-source Perplexity

Language:PythonLicense:Apache-2.0Stargazers:631Issues:0Issues:0

JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Language:PythonLicense:Apache-2.0Stargazers:936Issues:0Issues:0
Language:PythonStargazers:31Issues:0Issues:0

openui

OpenUI let's you describe UI using your imagination, then see it rendered live.

Language:HTMLLicense:Apache-2.0Stargazers:15485Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3591Issues:0Issues:0

pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音

Language:PythonLicense:GPL-3.0Stargazers:5979Issues:0Issues:0

StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Language:PythonLicense:MITStargazers:257Issues:0Issues:0
Stargazers:218Issues:0Issues:0

SingDiffusion

[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models

Language:PythonStargazers:52Issues:0Issues:0

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:1040Issues:0Issues:0

LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

Language:PythonLicense:MITStargazers:2584Issues:0Issues:0

Portrait-4D

Portrait4D: Learning One-Shot 4D Head Avatar Synthesis using Synthetic Data (CVPR 24)

Language:PythonLicense:MITStargazers:139Issues:0Issues:0

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:Apache-2.0Stargazers:3294Issues:0Issues:0

GaussianPro

[ICML2024] Official code for GaussianPro: 3D Gaussian Splatting with Progressive Propagation

Language:PythonLicense:MITStargazers:517Issues:0Issues:0

PyTorch-SVGRender

SVG Differentiable Rendering: Generating vector graphics using neural networks.

Language:PythonLicense:MPL-2.0Stargazers:80Issues:0Issues:0

ScoreHMR

ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)

Language:PythonLicense:MITStargazers:346Issues:0Issues:0

full-stack-fastapi-template

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

Language:TypeScriptLicense:MITStargazers:23693Issues:0Issues:0

DCNv4

[CVPR 2024] Deformable Convolution v4

Language:PythonLicense:MITStargazers:359Issues:0Issues:0

FaceTalk

[CVPR 2024] FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models

Language:ShellLicense:NOASSERTIONStargazers:142Issues:0Issues:0

qa-lora

Official PyTorch implementation of QA-LoRA

Language:PythonLicense:MITStargazers:93Issues:0Issues:0