YuxiangJohn

YuxiangJohn

Geek Repo

Company:JD.com

Location:Beijing

Github PK Tool:Github PK Tool

YuxiangJohn's starred repositories

metahuman-stream

Real time interactive streaming digital human

Language:PythonLicense:Apache-2.0Stargazers:2451Issues:0Issues:0

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:2931Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8136Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3692Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5267Issues:0Issues:0

X-Pose

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Language:PythonLicense:NOASSERTIONStargazers:332Issues:0Issues:0
Stargazers:30Issues:0Issues:0

BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

Language:PythonLicense:MITStargazers:356Issues:0Issues:0

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Language:PythonLicense:MITStargazers:609Issues:0Issues:0
Language:MATLABLicense:GPL-3.0Stargazers:10312Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6049Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:57Issues:0Issues:0

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:9309Issues:0Issues:0

GlyphDraw2

GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:26486Issues:0Issues:0

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Language:PythonLicense:Apache-2.0Stargazers:1623Issues:0Issues:0

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7762Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4592Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:28383Issues:0Issues:0

swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:2706Issues:0Issues:0

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonLicense:NOASSERTIONStargazers:415Issues:0Issues:0

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Language:Jupyter NotebookLicense:MITStargazers:258Issues:0Issues:0

InstantText

Give the gift of rendering text to SDXL

Stargazers:9Issues:0Issues:0

InstantID-Rome

Improved InstantID 🔥

Stargazers:153Issues:0Issues:0

GlyphControl-release

[NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"

Language:PythonLicense:MITStargazers:196Issues:0Issues:0

ShareGPT4Video

An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Language:PythonStargazers:1189Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1517Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:102Issues:0Issues:0
Language:PythonLicense:MITStargazers:21Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:140Issues:0Issues:0