SoNguyen's repositories

3DNBF

Official code base for the ICCV 2023 paper "3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

crnn-ctc-loss-digit

OCR Handwriting Number

Language:PythonStargazers:0Issues:0Issues:0

awesome-SOTA-FER

A curated list of facial expression recognition in both 7-emotion classification and affect estimation.

Stargazers:0Issues:0Issues:0

CDFSOD-benchmark

A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector)

License:Apache-2.0Stargazers:0Issues:0Issues:0

DiffSplat

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

License:MITStargazers:0Issues:0Issues:0

DocAligner

Predictions of the four corners of documents.

License:Apache-2.0Stargazers:0Issues:0Issues:0

EscherNet

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis

License:NOASSERTIONStargazers:0Issues:0Issues:0

Face-Analysis

Face-Analysis: Age, Race, Masked, Skintone, Emotion, Gender

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

flux

Official inference repo for FLUX.1 models

License:Apache-2.0Stargazers:0Issues:0Issues:0

GeoCalib

GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)

License:Apache-2.0Stargazers:0Issues:0Issues:0

GRM

Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

Language:JavaScriptStargazers:0Issues:0Issues:0

handwriting-synthesis

Handwriting Synthesis with RNNs ✏️

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:PythonStargazers:0Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

omniglue

Code release for CVPR'24 submission 'OmniGlue'

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

One-DM

Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation

License:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Recommendations-Document-Image-Processing

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

Stargazers:0Issues:0Issues:0

RemoteCLIP

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing"

Stargazers:0Issues:0Issues:0

RT-DETR

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

tutorials_triton

This repository contains tutorials and examples for Triton Inference Server

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

uni_fas

5th Chalearn Face Anti-spoofing Workshop and Challenge@CVPR2024

License:Apache-2.0Stargazers:0Issues:0Issues:0

VLM-R1

Solve Visual Understanding with Reinforced VLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0