taemin6697

김태민's starred repositories

google-research

Google Research

Language:Jupyter NotebookApache-2.033595 750 1226

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024564 191 3896

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.021187 179 435

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION8147 100 85

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT5761 178 15

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonMIT5546 62 434

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookMIT4053 128 28

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause3757 43 412

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.03551 31 476

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonApache-2.01480 22 65

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause1395 22 39

style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Language:PythonApache-2.01177 23 25

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT1126 20 48

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

1030 52 14

Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

MIT802 46 13

PandaGPT

[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All

Language:PythonApache-2.0743 11 26

Neural-IMage-Assessment

A PyTorch Implementation of Neural IMage Assessment

Language:PythonNOASSERTION517 5 37

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookApache-2.0470 12 37

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookApache-2.0410 16 41

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Language:PythonMIT283 5 9

search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

Language:PythonMIT111 3 3

attention-map

🚀 Cross attention map tools for huggingface/diffusers

Language:PythonMIT93 3 5

Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Language:Jupyter Notebook72 1 2

text2image-benchmark

Benchmark for generative image models

Language:Jupyter NotebookMIT45 10

EmoBench

This is the official repository for the paper "EmoBench: Evaluating the Emotional Intelligence of Large Language Models"

Language:PythonMIT31 40

awesome-vision-language-modeling

Recent Advances in Vision-Language Pre-training!

26 10

level2-3-cv-finalproject-cv-08

level2-3-cv-finalproject-cv-08 created by GitHub Classroom

Language:Python7 1 1

level3_nlp_finalproject-nlp-02

level3_nlp_finalproject-nlp-02 created by GitHub Classroom

Language:Python5 1 24

level3_cv_finalproject-cv-12

level3_cv_finalproject-cv-12 created by GitHub Classroom

Language:Jupyter Notebook4020

HansungGPT

Language:Jupyter Notebook1 10