Jongho Lee's repositories
darknet_idsl
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
yolov3_idsl
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
GANLatentDiscovery
The authors official implementation of Unsupervised Discovery of Interpretable Directions in the GAN Latent Space
google-research
Google Research
image-super-resolution
🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.
pixel2style2pixel
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation"
ppt_goja_ai
Software Maestro 11th
tensorflow-deeplab-v3-plus
DeepLabv3+ built in TensorFlow
ByteTrack
ByteTrack: Multi-Object Tracking by Associating Every Detection Box
DFKI_3dPoseToSensor
3D pose estimation for generating sensor data
dust3r
DUSt3R: Geometric 3D Vision Made Easy
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
GFNet
[NeurIPS 2021] Global Filter Networks for Image Classification
how-do-vits-work
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
mast3r
Grounding Image Matching in 3D with MASt3R
mlp-mixer
Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
smoothquant
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
tpu
Reference models and tools for Cloud TPUs.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs