Sheng Zhao (nuaazs)

nuaazs

Geek Repo

Company:Nanjing University of Aeronautics and Astronautics

Location:Pavia , Italy

Github PK Tool:Github PK Tool

Sheng Zhao's starred repositories

Python

All Algorithms implemented in Python

Language:PythonLicense:MITStargazers:183677Issues:5942Issues:1458

flutter

Flutter makes it easy and fast to build beautiful apps for mobile and beyond

Language:DartLicense:BSD-3-ClauseStargazers:164297Issues:3524Issues:97896

screenshot-to-code

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Language:PythonLicense:MITStargazers:55835Issues:324Issues:287

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:48663Issues:366Issues:2957

whisper.cpp

Port of OpenAI's Whisper model in C/C++

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:30056Issues:171Issues:489

DeepFaceLive

Real-time face swap for PC streaming or video calls

Language:PythonLicense:GPL-3.0Stargazers:25784Issues:362Issues:144

LivePortrait

Bring portraits to life!

Language:PythonLicense:NOASSERTIONStargazers:11007Issues:106Issues:298

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-2-ClauseStargazers:10672Issues:128Issues:671

plantuml

Generate diagrams from textual description

Language:JavaLicense:NOASSERTIONStargazers:10238Issues:157Issues:1148

yolov10

YOLOv10: Real-Time End-to-End Object Detection

Language:PythonLicense:AGPL-3.0Stargazers:9044Issues:46Issues:368

noise-suppression-for-voice

Noise suppression plugin based on Xiph's RNNoise

Language:C++License:GPL-3.0Stargazers:4731Issues:61Issues:166

panel

Panel: The powerful data exploration & web app framework for Python

Language:PythonLicense:BSD-3-ClauseStargazers:4585Issues:57Issues:3550

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:4287Issues:49Issues:295

rnnoise

Recurrent neural network for audio noise reduction

Language:CLicense:BSD-3-ClauseStargazers:3959Issues:150Issues:197

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3646Issues:32Issues:496

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

DictionaryByGPT4

一本 GPT4 生成的单词书📚,超过 8000 个单词分析,涵盖了词义、例句、词根词缀、变形、文化背景、记忆技巧和小故事

Language:HTMLLicense:CC-BY-SA-4.0Stargazers:3119Issues:22Issues:24

DeepFilterNet

Noise supression using deep filtering

Language:PythonLicense:NOASSERTIONStargazers:2315Issues:33Issues:274

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2303Issues:31Issues:102

avue

Avue.js2.0是基于现有的element-ui库进行的二次封装,简化一些繁琐的操作,核心理念为数据驱动视图,主要的组件库针对table表格和form表单场景,同时衍生出更多企业常用的组件,达到高复用,容易维护和扩展的框架,同时内置了丰富了数据展示组件,让开发变得更加容易

Language:VueLicense:MITStargazers:2197Issues:56Issues:578

DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Language:PythonLicense:MITStargazers:1975Issues:19Issues:46

Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Language:PythonLicense:Apache-2.0Stargazers:1481Issues:18Issues:69

DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Language:PythonLicense:CC-BY-4.0Stargazers:1048Issues:49Issues:146

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

gpuRIR

Python library for Room Impulse Response (RIR) simulation with GPU acceleration

Language:CudaLicense:AGPL-3.0Stargazers:470Issues:10Issues:51

wavmark

AI-based Audio Watermarking Tool

Language:PythonLicense:MITStargazers:208Issues:8Issues:14

Auto-Tuning-Spectral-Clustering

This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"

Language:PythonLicense:MITStargazers:105Issues:7Issues:2

supervoice-vall-e-2

VALL-E 2 reproduction

Language:Jupyter NotebookStargazers:68Issues:7Issues:2

SLP_NUAA

Git Repository of the Summer Lecture Program held at NUAA.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:2Issues:0Issues:0