Loken14's repositories
Chain-of-Spot
Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models
awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
AffineQuant
Official implementation of the ICLR 2024 paper AffineQuant
GLiNER
Generalist model for NER (Extract any entity types from texts)
FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
rag-pipelines
Advanced RAG Pipelines
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Awesome-LLM-with-RAG
[In Progress]
Show-1
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
LLaVA
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
Action_Video_Generation
Generating the different actions from an input video
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
text-to-video-synthesis-colab
Text To Video Synthesis Colab
Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
DevOpsGPT
Multi agent system for AI-driven software development. Combine LLM with DevOps tools to convert natural language requirements into working software. Supports any development language and extends the existing code.
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
pyVHR
Python framework for Virtual Heart Rate
PhysRecorder
A tool for recording high availability rPPG datasets
ELICIT
One-shot Implicit Animatable Avatars with Model-based Priors [ICCV 2023]
MMPD_rPPG_dataset
Here is Mobile Muti-domain Physiological Dataset collected by Tsinghua University.
X-Avatar
X-Avatar: Expressive Human Avatars (CVPR2023)
Diffusion-Models-in-Vision-A-Survey
This repository categorizes the papers about diffusion models applied in computer vision according to their target task. The classifcation is based on our survey: https://arxiv.org/abs/2209.04747v1
denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch