Jizhizi_Li's starred repositories

VisDiff

Official implementation of "Describing Differences in Image Sets with Natural Language" (CVPR 2024 Oral)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:63Issues:0Issues:0

Realtime_Multi-Person_Pose_Estimation

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:5072Issues:0Issues:0

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:5162Issues:0Issues:0

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellLicense:Apache-2.0Stargazers:23943Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:22136Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5213Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:6018Issues:0Issues:0

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:24743Issues:0Issues:0

sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Language:PythonLicense:GPL-3.0Stargazers:3777Issues:0Issues:0

deep_sort

Simple Online Realtime Tracking with a Deep Association Metric

Language:PythonLicense:GPL-3.0Stargazers:5115Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:30584Issues:0Issues:0

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookStargazers:1562Issues:0Issues:0

streamlit

Streamlit — A faster way to build and share data apps.

Language:PythonLicense:Apache-2.0Stargazers:32655Issues:0Issues:0

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8146Issues:0Issues:0

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:3691Issues:0Issues:0

CLIP_prefix_caption

Simple image captioning model

Language:Jupyter NotebookLicense:MITStargazers:1241Issues:0Issues:0

fastdup

fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

Language:PythonLicense:NOASSERTIONStargazers:1442Issues:0Issues:0

invisible-watermark

python library for invisible image watermark (blind image watermark)

Language:PythonLicense:MITStargazers:1496Issues:0Issues:0

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3580Issues:0Issues:0

openai-python

The official Python library for the OpenAI API

Language:PythonLicense:Apache-2.0Stargazers:20705Issues:0Issues:0

fastapi

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Language:PythonLicense:MITStargazers:72347Issues:0Issues:0

LocalAI

:robot: The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities.

Language:C++License:MITStargazers:21010Issues:0Issues:0
Language:TypeScriptLicense:NOASSERTIONStargazers:717Issues:0Issues:0

photoshot

An open-source AI avatar generator web app - https://photoshot.app

Language:TypeScriptLicense:MITStargazers:3369Issues:0Issues:0

AnimatableGaussians

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

Language:PythonLicense:NOASSERTIONStargazers:786Issues:0Issues:0

awesome-digital-human

A collection of resources on digital human including clothed people digitalization, virtual try-on, and other related directions.

License:MITStargazers:1263Issues:0Issues:0

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Language:PythonLicense:MITStargazers:2892Issues:0Issues:0

github-profile-views-counter

It counts how many times your GitHub profile has been viewed. Free cloud micro-service.

Language:PHPLicense:MITStargazers:3715Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:8602Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:37117Issues:0Issues:0