Gueter Josmy Faure's starred repositories
anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Awesome-Text-to-Image
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
StreamMultiDiffusion
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
AlphAction
Spatio-Temporal Action Localization System
TransformerCompression
For releasing code related to compression methods for transformers, accompanying our publications
unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
videoCC-data
VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automatic pipeline starting from the Conceptual Captions Image-Captioning Dataset.
Tensorflow-JS-Projects
Web projects using Tensorflow JS, Plotly, D3, Echarts, NumJS, and NumericJS