Karthik Ganesan (karthik19967829)

karthik19967829

Geek Repo

Company:ML Researcher @ nexusflow.ai

Location:Sanfrancisco, USA

Home Page:https://www.linkedin.com/in/karthik-ganesan-b07462124/

Github PK Tool:Github PK Tool

Karthik Ganesan's repositories

InferDoc

Generate SQUAD style dataset from raw text file and train a transformer based question answering model .This repo has code from https://github.com/facebookresearch/UnsupervisedQA and https://github.com/deepset-ai/haystack

Language:PythonStargazers:11Issues:3Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Ballerina

This repo has code base thats a fusion of BOLAA and Webarena to be build hyper-personalized agents that are aligned to your life-goals

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

codingInterview

coding interview brushup

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

espnet_onnx

Onnx wrapper for espnet infrernce model

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hexa

Discovering and Achieving Goals via World Models, NeurIPS 2021

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

LongLoRA

Code and documents of LongLoRA and LongAlpaca

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:SCSSLicense:MITStargazers:0Issues:0Issues:0

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0

NexusRaven

NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sharedtask-dialdoc2021

doc2dial data includes a set of documents from multiple domains; and conversations between an assisting agent and an end user that are grounded in the associated documents.

Language:PythonStargazers:0Issues:1Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0

soundstorm-speechtokenizer

Implementation of SoundStorm built upon SpeechTokenizer.

License:MITStargazers:0Issues:0Issues:0

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vocode-python

🤖 Build voice-based LLM agents. Modular + open source.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

WCN-BERT

Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).

Language:PythonStargazers:0Issues:0Issues:0

zeno-build

Build, evaluate, analyze, and understand LLM-based apps

Language:PythonLicense:MITStargazers:0Issues:0Issues:0