JAGVARAL BATSELEM's starred repositories
supervision
We write your reusable computer vision tools. 💜
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
pytorch-cifar
95.47% on CIFAR10 with PyTorch
labelCloud
A lightweight tool for labeling 3D bounding boxes in point clouds.
InstructDiffusion
PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.
MV-Tractus
A simple tool to extract motion vectors from h264 encoded videos.
BoTSORT-cpp
C++ implementation of BoT-SORT MOT algorithm with Re-ID and Camera Motion Compensation
kaggle-feedback-english-language-learning-1st-place-solution
Feedback Prize - English Language Learning: 1st place solution code
pointcloud-utils
A toolbox for pointcloud processing, including: filter, bounding box extraction, ground segmentation, cluster. And implemented by different algorithms(some with pcl wrapper). c++17 supported
sam-cpp-vs
sam.cpp visual studio integration
Point-Clouds-3D-Perception
Using the KITTI dataset, we employed Open3D to visualize, downsample, segment with RANSAC, cluster via DBSCAN, create 3D bounding boxes, and perform surface reconstruction on point clouds.