ycyoon's repositories
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
ycyoon.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
AffectGPT
A Real-world Audio-Video-Text Aligned Dataset for Explainable Multimodal Emotion Reasoning
leaf-pytorch
PyTorch implementation of the LEAF audio frontend
FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
jnuventure.github.io
제주대학교 벤처스타트업 아카데미
trl
Train transformer language models with reinforcement learning.
RL4LMs
A modular RL library to fine-tune language models to human preferences
instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
LMkor
Pretrained Language Models for Korean
SLF-RPM
Official PyTorch implementation of AAAI-22: Self-supervised Representation Learning Framework for Remote PhysiologicalMeasurement using Spatiotemporal Augmentation Loss (SLF-RPM)
pytorch_face_landmark
Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.
scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
Face-Transformer
Face Transformer for Recognition
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
mmdetection
OpenMMLab Detection Toolbox and Benchmark
TAdaConv
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
EfficientPhys
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
3DDFA
The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.
youtube-dl
Command-line program to download videos from YouTube.com and other video sites
xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
darknet
Convolutional Neural Networks
Faster-RCNN_TF
Faster-RCNN in Tensorflow
deep-learning-models
Keras code and weights files for popular deep learning models.
models
Models and examples built with TensorFlow
udapi-python
Python framework for processing Universal Dependencies data