Khanh Vo Duc's repositories
VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Megatron-LM
Ongoing research training transformer models at scale
NeMo-Aligner
Scalable toolkit for efficient model alignment
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
isaac-sim-jetson-hil-course-doc
Doc site for Isaac Sim + Jetson HIL hands-on course
Lidar_AI_Solution
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution, YUV2RGB, cuOSD,).
LLM-Finetuning-Hub
Repository that contains LLM fine-tuning and deployment scripts along with our research findings.
nlp-in-3-weeks
Repository of the NLP in 3 weeks series starting 2023-12-05
mlx
MLX: An array framework for Apple silicon
VTimeLLM
Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
neuralangelo
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)
opencv
Open Source Computer Vision Library
cuDLA-samples
YOLOv5 on Orin DLA
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Deep-Learning-Accelerator-SW
NVIDIA DLA-SW, the recipes and tools for running deep learning workloads on NVIDIA DLA cores for inference applications.
arrayfire
ArrayFire: a general purpose GPU library.
LLaVA
[NeurIPS 2023 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
NeVA
The open source implementation of "NeVA: NeMo Vision and Language Assistant"
VideoLLM
VideoLLM: Modeling Video Sequence with Large Language Models
ODISE
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
pybind11
Seamless operability between C++11 and Python
slt-techwrite
O'Reilly Technical Writing Course
neural-graphical-models
Neural Graphical models are neural network based graphical models that offer richer representation, faster inference & sampling