Beast code in Giters

songcheng's starred repositories

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonMIT28353 281 1100

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.010930 66 678

gallery-dl

Command-line program to download image galleries and collections from several image hosting sites

Language:PythonGPL-2.010731 140 4685

pdfminer.six

Community maintained fork of pdfminer - we fathom PDF

Language:PythonMIT5626 120 646

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.04245 60 167

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT3788 110 69

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonNOASSERTION2082 34 95

T-Rex

[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy

Language:PythonNOASSERTION1980 36 72

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION1836 36 123

OpenSeeFace

Robust realtime face and facial landmark tracking on CPU with Unity integration

Language:PythonBSD-2-Clause1365 22 53

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Language:PythonApache-2.0979 42 38

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonMIT965 45 33

pyreft

ReFT: Representation Finetuning for Language Models

Language:PythonApache-2.0942 15 74

MakeItTalk

Language:Jupyter NotebookNOASSERTION940 27 91

search2ai

Help your LLMs online

Language:JavaScriptMIT938 11 25

thepipe

Extract markdown and images from URLs, PDFs, docs, slides, and more, ready for multimodal LLMs. ⚡

Language:PythonMIT814 8 17

Arc2Face

Arc2Face: A Foundation Model of Human Faces

Language:PythonMIT485 15 19

IDE-3D

[SIGGRAPH Asia 2022] IDE-3D: Interactive Disentangled Editing For High-Resolution 3D-aware Portrait Synthesis

Language:Jupyter Notebook472 19 22

clifs

Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP

Language:JavaScriptApache-2.0432 4 11

Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Language:Python430 11 35

GRiT

GRiT: A Generative Region-to-text Transformer for Object Understanding (https://arxiv.org/abs/2212.00280)

Language:PythonMIT284 2 18

ST-LLM

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Language:PythonApache-2.076 7 16

LipFD

This repository contains the codes of "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes".

Language:Python61 3 7

FreeTalker

Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 2024)

Language:Python49 5 2

CharacterGen

[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

Language:JavaScriptAGPL-3.033 4 4

ubisoft-laforge-FFHQ-UV-Intrinsics

FFHQ-UV-Intrinstics: A dataset containing intrinsic face decomposition for 10k subjects of FFHQ-UV

NOASSERTION26 7 2

AnyPathLib

Language:PythonApache-2.01700

lip-synthesis

Audio-Visual Lip Synthesis via Intermediate Landmark Representation

Language:Python12 20

Video2ARKitBlendshapes

Video to ARKit BlendShapes

Language:Python500

perm

Official implementation of "Perm: A Parametric Hair Model for Multi-Style 3D Hair Generation"

200