Sreyan Ghosh (Sreyan88)

Sreyan88

User data from Github https://github.com/Sreyan88

Company:University of Maryland, Ex-Nvidia & Cisco, MIDAS@IIIT-D, Speech Lab@IIT-M

Location:College Park, Maryland

Home Page:https://sreyan88.github.io/

GitHub:@Sreyan88

Twitter:@SreyanG


Organizations
Speech-Lab-IITM

Sreyan Ghosh's repositories

GAMA

Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Language:PythonLicense:Apache-2.0Stargazers:142Issues:9Issues:26

MMER

Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition

Fatigue-Detection-using-Deep-Learning

This is a project based on a research paper to detect fatigue levels of a person through a photograph.

Language:PythonStargazers:33Issues:1Issues:0

LAPE

A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)

ACLM

Code for ACL 2023 Paper: ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER

VDGD

Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs

Language:PythonLicense:Apache-2.0Stargazers:20Issues:1Issues:3

CompA

Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

Language:PythonStargazers:19Issues:2Issues:0

LipGER

Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition

Language:PythonLicense:Apache-2.0Stargazers:17Issues:1Issues:3

RECAP

Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning

Disfluency-Detection-with-Span-Classification

This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spoken Utterances"

DALE

Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP

Language:PythonLicense:MITStargazers:10Issues:1Issues:0

Synthio

Code for ICLR 2025 Paper: Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic Data

Language:PythonLicense:MITStargazers:9Issues:1Issues:1

BioAug

Code for SIGIR 2023 paper: BioAug: Conditional Generation based Data Augmentation for Low-Resource Biomedical NER

CoDa

Code for NAACL 2024 (Findings) Paper: CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP

CoSyn

Implementation of CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network

Language:PythonStargazers:3Issues:0Issues:0

ABEX

Code for ACL 2024 paper -- ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions

Language:PythonStargazers:2Issues:1Issues:0

audio-flamingo

PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.

Stargazers:1Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:3Issues:0
Language:HTMLLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:11
Language:HTMLStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0

sreyan88.github.io

My personal website and blog (http://sreyan88.github.io)

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0