robotPin's repositories
akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
bark
🔊 Text-Prompted Generative Audio Model
chatbox
Chatbox is a desktop app for GPT/LLM that supports Windows, Mac, Linux & Web Online
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
comfyui-portrait-master-zh-cn
肖像大师 中文版 comfyui-portrait-master
Digital_Life_Server
Yet another voice assistant, but alive.
edge-tts
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
facefusion
Next generation face swapper and enhancer
gpt4free
decentralising the Ai Industry, just some language model api's...
GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
immersive-translate
Immersive Dual Web Page Translation Extension - 沉浸式双语网页翻译扩展
it-tools
Collection of handy online tools for developers, with great UX.
Make-A-Protagonist
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
photoshot
An open-source AI avatar generator web app - https://photoshot.app
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,ConvSeq2Seq,BERT,MacBERT,ELECTRA,ERNIE,Transformer,T5等模型实现,开箱即用。
Rope
GUI-focused roop
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
SdPaint
Stable Diffusion Painting
SDT
This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR23).
stable-diffusion-webui
Stable Diffusion web UI
StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
subtitleedit
the subtitle editor :)
tts-vue
🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。
Video2Music
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)