ycyoon's repositories
Face-Transformer
Face Transformer for Recognition
3DDFA
The PyTorch improved version of TPAMI 2017 paper: Face Alignment in Full Pose Range: A 3D Total Solution.
AffectGPT
A Real-world Audio-Video-Text Aligned Dataset for Explainable Multimodal Emotion Reasoning
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
darknet
Convolutional Neural Networks
deep-learning-models
Keras code and weights files for popular deep learning models.
EfficientPhys
EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Vitals Measurement
Faster-RCNN_TF
Faster-RCNN in Tensorflow
FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
jnuventure.github.io
제주대학교 벤처스타트업 아카데미
leaf-pytorch
PyTorch implementation of the LEAF audio frontend
LMkor
Pretrained Language Models for Korean
mmdetection
OpenMMLab Detection Toolbox and Benchmark
models
Models and examples built with TensorFlow
pytorch_face_landmark
Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.
RL4LMs
A modular RL library to fine-tune language models to human preferences
scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
SLF-RPM
Official PyTorch implementation of AAAI-22: Self-supervised Representation Learning Framework for Remote PhysiologicalMeasurement using Spatiotemporal Augmentation Loss (SLF-RPM)
TAdaConv
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, video representation learning and temporal detection.
trl
Train transformer language models with reinforcement learning.
udapi-python
Python framework for processing Universal Dependencies data
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Video-Swin-Transformer
This is an official implementation for "Video Swin Transformers".
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
ycyoon.github.io
A beautiful, simple, clean, and responsive Jekyll theme for academics
youtube-dl
Command-line program to download videos from YouTube.com and other video sites