Shivam Sharma's starred repositories
face_recognition
The world's simplest facial recognition api for Python and the command line
sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
awesome-image-captioning
A curated list of image captioning and related area resources. :-)
Urban-Sound-Classification
Urban sound classification using Deep Learning
LRV-Instruction
[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning
harmful-memes-detection-resources
Resources (conference/journal publications, references to dataset) for harmful memes detection.
ML-Reading-Group
Collection of talks given in the ML reading group@IIITD
MEMEX_Meme_Evidence
Official repo for ACL'23 (main) paper - MEMEX: Detecting Explanatory Evidence for Memes via Knowledge-Enriched Contextualization