taemin6697

김태민's starred repositories

Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

BSD-3-Clause262700

mental-health-datasets

An evolving list of electronic media data sets used to model mental-health status.

Language:Python36300

EmoBench

This is the official repository for the paper "EmoBench: Evaluating the Emotional Intelligence of Large Language Models"

Language:PythonMIT3100

Neural-IMage-Assessment

A PyTorch Implementation of Neural IMage Assessment

Language:PythonNOASSERTION51700

search-agents

Code for the paper 🌳 Tree Search for Language Model Agents

Language:PythonMIT11100

audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Language:PythonMIT28300

text2image-benchmark

Benchmark for generative image models

Language:Jupyter NotebookMIT4500

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookMIT405300

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonMIT112700

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.0355100

Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Language:Jupyter Notebook7500

HansungGPT

Language:Jupyter Notebook100

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonMIT554700

torchtune

A Native-PyTorch Library for LLM Fine-tuning

Language:PythonBSD-3-Clause375800

level2-3-cv-finalproject-cv-08

level2-3-cv-finalproject-cv-08 created by GitHub Classroom

Language:Python700

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02118700

PandaGPT

[TLLM'23] PandaGPT: One Model To Instruction-Follow Them All

Language:PythonApache-2.074500

fromage

🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".

Language:Jupyter NotebookApache-2.047000

gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Language:Jupyter NotebookApache-2.041000

style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Language:PythonApache-2.0117700

attention-map

🚀 Cross attention map tools for huggingface/diffusers

Language:PythonMIT9300

level3_cv_finalproject-cv-12

level3_cv_finalproject-cv-12 created by GitHub Classroom

Language:Jupyter Notebook400

awesome-vision-language-modeling

Recent Advances in Vision-Language Pre-training!

2600

level3_nlp_finalproject-nlp-02

level3_nlp_finalproject-nlp-02 created by GitHub Classroom

Language:Python500

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

103100

google-research

Google Research

Language:Jupyter NotebookApache-2.03359700

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Language:PythonBSD-3-Clause139500

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT576100

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonApache-2.0148000

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION814700