SoNguyen's repositories
3DNBF
Official code base for the ICCV 2023 paper "3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation"
accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
crnn-ctc-loss-digit
OCR Handwriting Number
awesome-SOTA-FER
A curated list of facial expression recognition in both 7-emotion classification and affect estimation.
CDFSOD-benchmark
A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector)
DiffSplat
[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".
DocAligner
Predictions of the four corners of documents.
EscherNet
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
Face-Analysis
Face-Analysis: Age, Race, Masked, Skintone, Emotion, Gender
flux
Official inference repo for FLUX.1 models
GeoCalib
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
handwriting-synthesis
Handwriting Synthesis with RNNs ✏️
LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
omniglue
Code release for CVPR'24 submission 'OmniGlue'
One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
Recommendations-Document-Image-Processing
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing"
RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
tutorials_triton
This repository contains tutorials and examples for Triton Inference Server
uni_fas
5th Chalearn Face Anti-spoofing Workshop and Challenge@CVPR2024
VLM-R1
Solve Visual Understanding with Reinforced VLMs