Beast code in Giters

lrain-CN's starred repositories

marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Language:PythonApache-2.0440500

4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Language:Jupyter NotebookNOASSERTION201400

AnimateZero

Official PyTorch implementation for the paper "AnimateZero: Video Diffusion Models are Zero-Shot Image Animators"

34500

MAG-Edit

MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance

Language:Python8200

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:Python283800

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonMIT388100

gaussian-splatting

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Language:PythonNOASSERTION1323400

ai副业赚钱大集合，教你如何利用ai做一些副业项目，赚取更多额外收益。The Ultimate Guide to Making Money with AI Side Hustles: Learn how to leverage AI for some cool side gigs and rake in some extra cash. Check out the English version for more insights.

1276700

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookNOASSERTION918500

VividTalk

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

Apache-2.075700

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonApache-2.01072200

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT573700

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookApache-2.0203400

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-2-Clause1055000

faster-whisper

Faster Whisper transcription with CTranslate2

Language:PythonMIT1093700

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0356300

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.0278700

awesome-video-text-datasets

A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.

MIT2600

leptonai

A Pythonic framework to simplify AI service building

Language:PythonApache-2.0261200

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.02512800

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Language:PythonApache-2.0707800

awesome-video-text-retrieval

A curated list of deep learning resources for video-text retrieval.

57100

pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.03116100

lrain-CN