PeterPham's repositories

applied-llm

Everything about LLMs in production.

License:MITStargazers:1Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

License:MITStargazers:1Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

License:Apache-2.0Stargazers:1Issues:0Issues:0

garfield

[CVPR'24] Group Anything with Radiance Fields

License:MITStargazers:1Issues:0Issues:0

HSIConvKAN

How to Learn More? Exploring the Possibility of Kolmogorov-Arnold Networks for Hyperspectral Image Classification

License:MITStargazers:1Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:1Issues:0Issues:0

LongVA

Long Context Transfer from Language to Vision

License:Apache-2.0Stargazers:1Issues:0Issues:0

MimicBrush

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

License:Apache-2.0Stargazers:1Issues:0Issues:0

textgrad

Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

License:MITStargazers:1Issues:0Issues:0

transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

License:Apache-2.0Stargazers:1Issues:0Issues:0

videollm-online

VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

BentoBLIP

how to build an image captioning application on top of a BLIP model with BentoML

Stargazers:0Issues:0Issues:0

CosmicMan

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

Stargazers:0Issues:0Issues:0

DDMI

Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Representations", ICLR 2024

License:MITStargazers:0Issues:0Issues:0

Depth-Anything-V2

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

grokfast-pytorch

Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"

License:MITStargazers:0Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

License:MITStargazers:0Issues:0Issues:0

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

License:Apache-2.0Stargazers:0Issues:0Issues:0

MultiPly

MultiPly: Reconstruction of Multiple People from Monocular Video in the Wild (CVPR2024 Oral)

Stargazers:0Issues:0Issues:0

MV-VTON

MV-VTON: Multi-View Virtual Try-On with Diffusion Models

Stargazers:0Issues:0Issues:0

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

Stargazers:0Issues:0Issues:0

OpenYOLO3D

Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method in literature.

Stargazers:0Issues:0Issues:0

RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

Stargazers:0Issues:0Issues:0

SMILE-Dataset

[NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"

Stargazers:0Issues:0Issues:0

top-cvpr-2024-papers

This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]

License:CC0-1.0Stargazers:0Issues:0Issues:0

typer

Typer, build great CLIs. Easy to code. Based on Python type hints.

License:MITStargazers:0Issues:0Issues:0

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Stargazers:0Issues:0Issues:0