Camilo Fosco's starred repositories

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:17858Issues:117Issues:474

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:9420Issues:101Issues:320

cortex

Production infrastructure for machine learning at scale

Language:GoLicense:Apache-2.0Stargazers:7998Issues:145Issues:1098

tenacity

Retrying library for Python

Language:PythonLicense:Apache-2.0Stargazers:6129Issues:48Issues:247

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5168Issues:78Issues:103

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4287Issues:120Issues:52

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:3258Issues:36Issues:209

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:2973Issues:30Issues:375

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2570Issues:27Issues:155

cohere-toolkit

Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.

Language:TypeScriptLicense:MITStargazers:2290Issues:25Issues:23

graph-of-thoughts

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Language:PythonLicense:NOASSERTIONStargazers:1915Issues:20Issues:19

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1766Issues:21Issues:80

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonLicense:AGPL-3.0Stargazers:1613Issues:24Issues:133

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1521Issues:10Issues:124

EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookLicense:MITStargazers:1503Issues:19Issues:238

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

text-to-video-synthesis-colab

Text To Video Synthesis Colab

Language:Jupyter NotebookLicense:UnlicenseStargazers:1408Issues:23Issues:24

style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Language:PythonLicense:Apache-2.0Stargazers:1102Issues:23Issues:23

Show-1

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Language:PythonLicense:NOASSERTIONStargazers:1071Issues:39Issues:19

Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Language:PythonLicense:Apache-2.0Stargazers:967Issues:13Issues:42

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:444Issues:10Issues:13

MindVideo

Official code base for MinD-Video

bom

A utility to generate SPDX-compliant Bill of Materials manifests

Language:GoLicense:Apache-2.0Stargazers:305Issues:11Issues:72

prisma-lambda-cdk

Build and deploy a Lambda function with Prisma ORM by AWS Cloud Development Kit.

Language:TypeScriptLicense:MIT-0Stargazers:87Issues:10Issues:4

awesome-text-based-image-manipulation

A curated list of text-based image manipulation methods.

Language:PythonLicense:CC0-1.0Stargazers:70Issues:2Issues:0

awesome-vision-and-language-pretraining

A curated list of vision-and-language pre-training (VLP). :-)