Capper (Boomprogrammar)

Boomprogrammar

Geek Repo

Github PK Tool:Github PK Tool

Capper's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49120Issues:0Issues:0

HackerDictionary

整理一些黑客蛮力攻击常用的字典

License:MITStargazers:7Issues:0Issues:0

sd-webui-regional-prompter

set prompt to divided region

Language:PythonLicense:AGPL-3.0Stargazers:1453Issues:0Issues:0

ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Language:PythonStargazers:4461Issues:0Issues:0
Language:PythonStargazers:43Issues:0Issues:0

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6741Issues:0Issues:0

Style2Paints_V3

Reimplementation of Style2Paints V3

Language:PythonStargazers:94Issues:0Issues:0

ControlLoRA

ControlLoRA: A Lightweight Neural Network To Control Stable Diffusion Spatial Information

Language:PythonLicense:Apache-2.0Stargazers:539Issues:0Issues:0

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:22326Issues:0Issues:0

CAT

A CRF-based ASR Toolkit

Language:PythonLicense:Apache-2.0Stargazers:311Issues:0Issues:0

style2paints

sketch + style = paints :art: (TOG2018/SIGGRAPH2018ASIA)

Language:JavaScriptLicense:Apache-2.0Stargazers:17879Issues:0Issues:0

Pixeldraw

A pixel drawing application 一个像素绘画软件

Language:JavaStargazers:5Issues:0Issues:0

CIF-PyTorch

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Language:PythonLicense:Apache-2.0Stargazers:65Issues:0Issues:0

CIF-HieraDist

[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation

Language:PythonLicense:Apache-2.0Stargazers:33Issues:0Issues:0

DFSMN

Tensorflow version of DFSMN

Language:PythonStargazers:48Issues:0Issues:0

DPHuBERT

INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"

Language:PythonLicense:MITStargazers:95Issues:0Issues:0

XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Language:PythonLicense:MITStargazers:289Issues:0Issues:0

FitHuBERT

FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

SGEM

Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization (INTERSPEECH 2023 Oral Presentation)

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

ContextNet

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Language:PythonLicense:Apache-2.0Stargazers:31Issues:0Issues:0

UnsupSeg

Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)

Language:PythonLicense:MITStargazers:134Issues:0Issues:0

chinese-chatbot-corpus

中文公开聊天语料库

Language:PythonLicense:Apache-2.0Stargazers:3918Issues:0Issues:0

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Stargazers:12757Issues:0Issues:0

Emotion-Recognition-Papers

A list of papers for emotion recognition using machine learning/deep learning.

Stargazers:50Issues:0Issues:0

CLONE_DK

使用聊天记录和播客文章,基于chatGLM-6B训练自己的数字克隆的方案实现,包括用到的脚本和最后部署成前端页面的代码

Language:PythonLicense:MITStargazers:235Issues:0Issues:0

awesome-aigc

A list of awesome AIGC works

License:CC0-1.0Stargazers:534Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:4352Issues:0Issues:0

wav2vec

a simplified version of wav2vec(1.0, vq, 2.0) in fairseq

Language:PythonStargazers:118Issues:0Issues:0

chinese_speech_pretrain

chinese speech pretrained models

Language:ShellStargazers:939Issues:0Issues:0

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Language:PythonLicense:MITStargazers:1083Issues:0Issues:0