Surya T - Secondary's repositories

ADeus

An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.

License:NOASSERTIONStargazers:0Issues:0Issues:0

AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

License:Apache-2.0Stargazers:0Issues:0Issues:0

cog-stickers

Make stickers

License:MITStargazers:0Issues:0Issues:0

Coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:0Issues:0Issues:0

docarray

🧬 The data structure for multimodal data · Neural Search · Vector Search · Document Store

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dora

Implementation of DoRA

License:MITStargazers:0Issues:0Issues:0

easyblocks

The open-source visual builder framework.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

emojis

Turn your ideas into emojis in seconds. Generate your favorite Slack emojis with just one click.

Language:TypeScriptLicense:AGPL-3.0Stargazers:0Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

hf-ai-cookbook

Open-source AI cookbook

License:Apache-2.0Stargazers:0Issues:0Issues:0

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

libfvad

Voice activity detection (VAD) library, based on WebRTC's VAD engine

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

MagicDance

MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

Stargazers:0Issues:0Issues:0

magika

Detect file content types with deep learning

License:Apache-2.0Stargazers:0Issues:0Issues:0

mediamtx

ready-to-use RTSP server and RTSP proxy that allows to read and publish video and audio streams via UDP and TCP

Language:GoLicense:MITStargazers:0Issues:0Issues:0

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pico-tflmicro

Pico TensorFlow Lite Port

Language:C++Stargazers:0Issues:0Issues:0

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

react-native-reusables

shadcn/ui for React Native: Copy, paste, and tailor React Native components to suit your specific requirements.

License:MITStargazers:0Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

stickerbaker

Let's bake some (AI) stickers!

Stargazers:0Issues:0Issues:0

tailscale

The easiest, most secure way to use WireGuard and 2FA.

Language:GoLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

upsy

Your new mate on Slack. Powered by AI.

License:MITStargazers:0Issues:0Issues:0

wg-easy

The easiest way to run WireGuard VPN + Web-based Admin UI.

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:1Issues:0

whisper-plus

WhisperPlus: Advancing Speech-to-Text Processing 🚀

License:Apache-2.0Stargazers:0Issues:0Issues:0

ZLUDA

CUDA on AMD GPUs

License:Apache-2.0Stargazers:0Issues:0Issues:0