김태민's starred repositories
google-research
Google Research
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
Time-Series-Library
A Library for Advanced Deep Time Series Models.
Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
MetaTransformer
Meta-Transformer for Unified Multimodal Learning
multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
Neural-IMage-Assessment
A PyTorch Implementation of Neural IMage Assessment
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
attention-map
🚀 Cross attention map tools for huggingface/diffusers
Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
text2image-benchmark
Benchmark for generative image models
awesome-vision-language-modeling
Recent Advances in Vision-Language Pre-training!
level2-3-cv-finalproject-cv-08
level2-3-cv-finalproject-cv-08 created by GitHub Classroom
level3_nlp_finalproject-nlp-02
level3_nlp_finalproject-nlp-02 created by GitHub Classroom
level3_cv_finalproject-cv-12
level3_cv_finalproject-cv-12 created by GitHub Classroom