Sani (khursani8)

khursani8

Geek Repo

Location:Kuala Lumpur

Home Page:khursani.win

Github PK Tool:Github PK Tool


Organizations
ai-rush-2019
utphax

Sani's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65140Issues:543Issues:0

privateGPT

Interact with your documents using the power of GPT, 100% privately, no data leaks

Language:PythonLicense:Apache-2.0Stargazers:49730Issues:443Issues:992

supervision

We write your reusable computer vision tools. 💜

Language:PythonLicense:MITStargazers:18155Issues:128Issues:389

marvin

✨ Build AI interfaces that spark joy

Language:PythonLicense:Apache-2.0Stargazers:5034Issues:37Issues:203

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3174Issues:43Issues:49

docta

A Doctor for your data

Language:PythonLicense:NOASSERTIONStargazers:3066Issues:117Issues:3

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2898Issues:37Issues:197

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++License:Apache-2.0Stargazers:2618Issues:44Issues:391

Personalize-SAM

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Language:PythonLicense:MITStargazers:1475Issues:27Issues:45

ChatWaifu_Mobile

移动版二次元 AI 老婆聊天器

Language:C++License:MITStargazers:1204Issues:21Issues:22

spacy-llm

🦙 Integrating LLMs into structured NLP pipelines

Language:PythonLicense:MITStargazers:1033Issues:17Issues:72

webwhiz

WebWhiz allows you to create an AI chatbot that knows everything about your product and can instantly respond to your customer's queries.

Language:TypeScriptLicense:AGPL-3.0Stargazers:891Issues:17Issues:73

langcorn

⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops

Language:PythonLicense:MITStargazers:874Issues:8Issues:18

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

Language:PythonLicense:MITStargazers:828Issues:22Issues:87

SD-CN-Animation

This script allows to automate video stylization task using StableDiffusion and ControlNet.

Language:PythonLicense:MITStargazers:806Issues:15Issues:154

MEGABYTE-pytorch

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

Language:PythonLicense:MITStargazers:605Issues:11Issues:13

PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Language:PythonLicense:BSD-3-ClauseStargazers:574Issues:18Issues:74

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXLicense:Apache-2.0Stargazers:563Issues:7Issues:5

sleap

A deep learning framework for multi-animal pose tracking.

Language:PythonLicense:NOASSERTIONStargazers:414Issues:22Issues:644

udpipe

UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files

Language:C++License:MPL-2.0Stargazers:357Issues:28Issues:165

ZipIt

A framework for merging models solving different tasks with different initializations into one multi-task model without any additional training

Language:PythonLicense:MITStargazers:265Issues:3Issues:26

NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

prompt-optimizer

Minimize LLM token complexity to save API costs and model computations.

Language:PythonLicense:MITStargazers:216Issues:5Issues:5

chat2plot

chat to visualization with LLM

Language:PythonLicense:MITStargazers:182Issues:5Issues:9

efficientspeech

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:145Issues:6Issues:9
Language:Jupyter NotebookLicense:MITStargazers:133Issues:12Issues:11

stable-diffusion-webui-daam

DAAM for Stable Diffusion Web UI

Language:PythonLicense:NOASSERTIONStargazers:90Issues:0Issues:10
Language:PythonLicense:MITStargazers:65Issues:1Issues:0

PyAction

A Toolkit for Video Action Recognition(Classification/Detection)

Language:PythonLicense:Apache-2.0Stargazers:16Issues:4Issues:2

stable-diffusion-webui-metadata-marker

Stable diffusion WebUI extension. Renders generation information on the output image.

Language:PythonLicense:Apache-2.0Stargazers:13Issues:2Issues:0