suryatmodulus

Surya T - Secondary's repositories

ADeus

An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI.

NOASSERTION000

AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf

Apache-2.0000

cog-stickers

Make stickers

MIT000

Coqui-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

docarray

🧬 The data structure for multimodal data · Neural Search · Vector Search · Document Store

Language:PythonApache-2.0000

dora

Implementation of DoRA

MIT000

easyblocks

The open-source visual builder framework.

AGPL-3.0000

emojis

Turn your ideas into emojis in seconds. Generate your favorite Slack emojis with just one click.

Language:TypeScriptAGPL-3.0000

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

Language:PythonNOASSERTION000

hf-ai-cookbook

Open-source AI cookbook

Apache-2.0000

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonMIT000

libfvad

Voice activity detection (VAD) library, based on WebRTC's VAD engine

BSD-3-Clause000

LWM

Apache-2.0000

MagicDance

MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer

000

magika

Detect file content types with deep learning

Apache-2.0000

mediamtx

ready-to-use RTSP server and RTSP proxy that allows to read and publish video and audio streams via UDP and TCP

Language:GoMIT000

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

MIT000

oot_cog_samples

000

OpenVoice

Instant voice cloning by MyShell

Language:PythonMIT000

pico-tflmicro

Pico TensorFlow Lite Port

Language:C++000

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT000

react-native-reusables

shadcn/ui for React Native: Copy, paste, and tailor React Native components to suit your specific requirements.

MIT000

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT000

StableCascade

MIT000

stickerbaker

Let's bake some (AI) stickers!

000

tailscale

The easiest, most secure way to use WireGuard and 2FA.

Language:GoBSD-3-Clause000

upsy

Your new mate on Slack. Powered by AI.

MIT000

wg-easy

The easiest way to run WireGuard VPN + Web-based Admin UI.

Language:JavaScriptNOASSERTION010

whisper-plus

WhisperPlus: Advancing Speech-to-Text Processing 🚀

Apache-2.0000

ZLUDA

CUDA on AMD GPUs

Apache-2.0000