karthik19967829

Karthik Ganesan's repositories

InferDoc

Generate SQUAD style dataset from raw text file and train a transformer based question answering model .This repo has code from https://github.com/facebookresearch/UnsupervisedQA and https://github.com/deepset-ai/haystack

Language:Python11 30

DSTC11-Benchmark

Language:Python8 5 1

16-884.github.io

Language:HTML000

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonNOASSERTION000

Ballerina

This repo has code base thats a fusion of BOLAA and Webarena to be build hyper-personalized agents that are aligned to your life-goals

Language:Python010

BOLAA

Language:PythonApache-2.0000

codingInterview

coding interview brushup

Language:Jupyter Notebook000

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonNOASSERTION010

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.0010

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonMIT000

externalcolabcode

Language:Python000

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0000

hexa

Discovering and Achieving Goals via World Models, NeurIPS 2021

Language:PythonMIT000

hexa-benchmark

Language:Python000

karthik19967829.github.io

Language:HTML020

LongLoRA

Code and documents of LongLoRA and LongAlpaca

Language:PythonApache-2.0000

mmml-course

Language:SCSSMIT000

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLMIT000

NexusRaven

NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.

Language:PythonApache-2.0000

NexusRaven-V2

Language:Jupyter Notebook000

pydmps

Language:PythonGPL-3.0000

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonMIT000

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonApache-2.0000

sharedtask-dialdoc2021

doc2dial data includes a set of documents from multiple domains; and conversations between an assisting agent and an end user that are grounded in the associated documents.

Language:Python010

shinjiwlab.github.io

Language:JavaScriptMIT010

soundstorm-speechtokenizer

Implementation of SoundStorm built upon SpeechTokenizer.

MIT000

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonApache-2.0000

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

Language:PythonMIT000

WCN-BERT

Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).

Language:Python000

zeno-build

Build, evaluate, analyze, and understand LLM-based apps

Language:PythonMIT000