Akshay L Chandra's repositories
Selfie_Filters_OpenCV
This deep learning application can detect Facial Keypoints (15 unique points). They mark important areas of the face - the eyes, corners of the mouth, the nose, etc.
deep-active-learning-pytorch
A PyTorch toolkit with 8 popular deep active learning query methods implemented.
Alphabet_Recognition_Gestures
This python application recognizes alphabet from real time webcam data. The user is allowed to write the alphabet on the screen using an object-of-interest (a water bottle cap in this case).
image_bbox_slicer
This easy-to-use library is a data transformer sometimes useful in Object Detection and Segmentation tasks. With only a few lines of code, one can slice images and their bounding box annotations into smaller tiles, both into specific sizes and into any arbitrary number of equal parts. The tool also supports resizing of images and their bounding box annotations, both by specific sizes and by a resizing/scaling factor.
Webcam_Paint_OpenCV
This Python application uses OpenCV library to track an object-of-interest (water bottle cap in my case) and uses the detected object to draw colored lines (Blue, Green, Red and Yellow).
Mouse_Cursor_Control_Handsfree
This HCI (Human-Computer Interaction) application in Python(3.6) will allow you to control your mouse cursor with your facial movements, works with just your regular webcam. Its hands-free, no wearable hardware or sensors needed.
Digits_Recognition_RealTime
This python application recognizes digits from real time webcam data. The user is allowed to write the digits on the screen using an object-of-interest (a water bottle cap in this case).
Mimic_Me_CV_Game
This repo hold the code of a simple, fun game built using Affectiva's Emotion-as-a-Service API. An Emoji is shown on the screen and one has to mimic the emoji to score points.
MorseCode_Converter_DeepLearning
This 4-in-1 deep learning application can convert Morse Code signalled in 4 different ways in real time. Namely, flashlight toggles, eye winking, hand gestures and mouse clicks.
init-pools-dal
This is the official code implementation of our paper, On Initial Pools for Deep Active Learning, accepted at the Pre-registration Workshop at NeurIPS 2020.
3D-Diffusion-Policy
[arXiv 2024] 3D Diffusion Policy
dreamerv3-calvin
Mastering Diverse Domains through World Models
github-readme-stats
:zap: Dynamically generated stats for your github readmes
lang-segment-anything
SAM with text prompt
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
t3vip
T3VIP: Transformation-based 3D Video Prediction