Beast code in Giters

cseti007's starred repositories

ComfyUI-FluxTrainer

Language:PythonApache-2.011900

ComfyUI_ProPainter_Nodes

🖌️ ComfyUI implementation of ProPainter framework for video inpainting.

Language:PythonGPL-3.020300

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT6635600

SenseVoice

Multilingual Voice Understanding Model

Language:PythonNOASSERTION221300

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION560000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03288400

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonApache-2.0408400

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Language:PythonAGPL-3.0934800

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonMIT837800

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.0568800

ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

Language:PythonMIT203200

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonApache-2.087400

krita-ai-diffusion

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Language:PythonGPL-3.0613900

stable-diffusion-webui-extensions

Extension index for stable-diffusion-webui

47200

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonMIT427500

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonApache-2.0279400

sd-webui-lcm

Latent Consistency Model for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonMIT61200

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01885500

Stable-Diffusion-WebUI-TensorRT

TensorRT Extension for Stable Diffusion Web UI

Language:PythonMIT188100

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++Apache-2.01046900

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.01011900

fast-stable-diffusion

fast-stable-diffusion + DreamBooth

Language:PythonMIT745700

sd-webui-deforum

Deforum extension for AUTOMATIC1111's Stable Diffusion webui

Language:PythonNOASSERTION265000