김태민's starred repositories
Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
mental-health-datasets
An evolving list of electronic media data sets used to model mental-health status.
Neural-IMage-Assessment
A PyTorch Implementation of Neural IMage Assessment
search-agents
Code for the paper 🌳 Tree Search for Language Model Agents
audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
text2image-benchmark
Benchmark for generative image models
Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
Time-Series-Library
A Library for Advanced Deep Time Series Models.
level2-3-cv-finalproject-cv-08
level2-3-cv-finalproject-cv-08 created by GitHub Classroom
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
attention-map
🚀 Cross attention map tools for huggingface/diffusers
level3_cv_finalproject-cv-12
level3_cv_finalproject-cv-12 created by GitHub Classroom
awesome-vision-language-modeling
Recent Advances in Vision-Language Pre-training!
level3_nlp_finalproject-nlp-02
level3_nlp_finalproject-nlp-02 created by GitHub Classroom
awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
google-research
Google Research
multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
MetaTransformer
Meta-Transformer for Unified Multimodal Learning