ASHNOORSINGH's starred repositories

opensmile

The Munich Open-Source Large-Scale Multimedia Feature Extractor

Language:C++License:NOASSERTIONStargazers:548Issues:0Issues:0

Speech-Emotion-Recognition

Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别

Language:PythonLicense:MITStargazers:942Issues:0Issues:0

opensmile-python

Python package for openSMILE

Language:PythonLicense:NOASSERTIONStargazers:232Issues:0Issues:0

portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

Language:CLicense:NOASSERTIONStargazers:1381Issues:0Issues:0

Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

Stargazers:869Issues:0Issues:0

OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Language:PythonLicense:MITStargazers:626Issues:0Issues:0

gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Language:PythonLicense:Apache-2.0Stargazers:1797Issues:0Issues:0

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

Language:PythonLicense:Apache-2.0Stargazers:740Issues:0Issues:0

EmoBox

[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

Language:PythonStargazers:94Issues:0Issues:0

emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Language:PythonStargazers:518Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:1490Issues:0Issues:0

awesome-foundation-model-leaderboards

A curated list of awesome leaderboards for foundation models

Stargazers:176Issues:0Issues:0

INTERSPEECH-2023-Papers

INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:617Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8142Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:4873Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2204Issues:0Issues:0
Language:Jupyter NotebookStargazers:79Issues:0Issues:0

DeepAnT-A-Deep-Learning-Approach-for-Unsupervised-Anomaly-Detection-in-Time-Series

Code for DeepAnT: A Deep Learning Approach for Unsupervised Anomaly Detection in Time Series

Language:Jupyter NotebookStargazers:6Issues:0Issues:0

UnSAM

Code release for "Segment Anything without Supervision"

Language:Jupyter NotebookStargazers:243Issues:0Issues:0

summary-of-a-haystack

Codebase accompanying the Summary of a Haystack paper.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:61Issues:0Issues:0

Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

License:MITStargazers:392Issues:0Issues:0

aphrodite-engine

PygmalionAI's large-scale inference engine

Language:PythonLicense:AGPL-3.0Stargazers:803Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Language:PythonLicense:MITStargazers:12264Issues:0Issues:0

MMEvalPro

Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs

Language:PythonStargazers:19Issues:0Issues:0

textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:1211Issues:0Issues:0

Edge-Pruning

Code and data for the paper "Finding Transformer Circuits with Edge Pruning".

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Language:PythonLicense:Apache-2.0Stargazers:404Issues:0Issues:0

VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Language:PythonLicense:Apache-2.0Stargazers:546Issues:0Issues:0

FinRobot

FinRobot: An Open-Source AI Agent Platform for Financial Applications using LLMs 🚀 🚀 🚀

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1231Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:24520Issues:0Issues:0