khursani8

followers

following

stars

Kuala Lumpur

Organizations

ai-rush-2019

utphax

Sani's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonMIT65140 5430

privateGPT

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonApache-2.049730 443 992

supervision

We write your reusable computer vision tools. 💜

Language:PythonMIT18155 128 389

marvin

✨ Build AI interfaces that spark joy

Language:PythonApache-2.05034 37 203

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonApache-2.03174 43 49

docta

A Doctor for your data

Language:PythonNOASSERTION3066 117 3

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT2898 37 197

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++Apache-2.02618 44 391

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonMIT1475 27 45

ChatWaifu_Mobile

移动版二次元 AI 老婆聊天器

Language:C++MIT1204 21 22

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language:PythonMIT1033 17 72

webwhiz

WebWhiz allows you to create an AI chatbot that knows everything about your product and can instantly respond to your customer's queries.

Language:TypeScriptAGPL-3.0891 17 73

langcorn

⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

Language:PythonMIT874 8 18

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language:PythonMIT828 22 87

SD-CN-Animation

This script allows to automate video stylization task using StableDiffusion and ControlNet.

Language:PythonMIT806 15 154

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonMIT605 11 13

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonBSD-3-Clause574 18 74

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXApache-2.0563 7 5

sleap

A deep learning framework for multi-animal pose tracking.

Language:PythonNOASSERTION414 22 644

udpipe

UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files

Language:C++MPL-2.0357 28 165

ZipIt

A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training

Language:PythonMIT265 3 26

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Language:Python223 19 37

prompt-optimizer

Minimize LLM token complexity to save API costs and model computations.

Language:PythonMIT216 5 5

chat2plot

chat to visualization with LLM

Language:PythonMIT182 5 9

efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Language:Jupyter NotebookApache-2.0145 6 9

NaturalSpeech2

Language:Jupyter NotebookMIT133 12 11

stable-diffusion-webui-daam

DAAM for Stable Diffusion Web UI

Language:PythonNOASSERTION90010

QuickEmbedding

Language:PythonMIT65 10

PyAction

A Toolkit for Video Action Recognition(Classification/Detection)

Language:PythonApache-2.016 4 2

stable-diffusion-webui-metadata-marker

Stable diffusion WebUI extension. Renders generation information on the output image.

Language:PythonApache-2.013 20