codingfor

codingfor

Geek Repo

Location:hangzhou

Github PK Tool:Github PK Tool

codingfor's starred repositories

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:5325Issues:0Issues:0

VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Language:PythonLicense:Apache-2.0Stargazers:4926Issues:0Issues:0

ai-toolkit

Various AI scripts. Mostly Stable Diffusion stuff.

Language:PythonLicense:MITStargazers:3172Issues:0Issues:0

Stable-Diffusion

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, Kaggle, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya, Midjourney, RunPod

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:2102Issues:0Issues:0

OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

Language:PythonLicense:AGPL-3.0Stargazers:1726Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9577Issues:0Issues:0
Language:CStargazers:1Issues:0Issues:0

FaceGrab

Batch extract known face from video/image sequence (CNN GPU with CUDA / HoG)

Language:PythonStargazers:5Issues:0Issues:0
Language:Jupyter NotebookStargazers:138Issues:0Issues:0

ComfyUI-MimicMotion

a comfyui custom node for MimicMotion

Language:PythonStargazers:333Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:305Issues:0Issues:0

MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Language:PythonLicense:NOASSERTIONStargazers:1825Issues:0Issues:0

leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:13621Issues:0Issues:0

Scrapegraph-ai

Python scraper based on AI

Language:PythonLicense:MITStargazers:15145Issues:0Issues:0

anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.

Language:JavaScriptLicense:MITStargazers:25200Issues:0Issues:0

XHS-Downloader

小红书链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书无水印作品文件!

Language:PythonLicense:GPL-3.0Stargazers:5394Issues:0Issues:0

RAGflow

This repo provides methods for building and evaluating Retrieval Augmented Generation (RAG) systems.

Language:PythonStargazers:16Issues:0Issues:0

FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

Language:TypeScriptLicense:NOASSERTIONStargazers:17755Issues:0Issues:0

Transformers-for-NLP-2nd-Edition

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more

Language:Jupyter NotebookLicense:MITStargazers:796Issues:0Issues:0

tesseract

Tesseract Open Source OCR Engine (main repository)

Language:C++License:Apache-2.0Stargazers:61966Issues:0Issues:0

Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Language:PythonStargazers:600Issues:0Issues:0

Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:PythonStargazers:1796Issues:0Issues:0

whisper.cpp

Port of OpenAI's Whisper model in C/C++

Language:C++License:MITStargazers:35295Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:66988Issues:0Issues:0

llama3-Chinese-chat

Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)

Language:PythonStargazers:3998Issues:0Issues:0

BPB-Worker-Panel

A GUI Panel providing Worker subscriptions, Fragment settings and Warp configs, providing configs for cross-platform clients using (Sing-box, Clash and Xray cores)

Language:JavaScriptLicense:GPL-3.0Stargazers:4963Issues:0Issues:0

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Language:ShellLicense:Apache-2.0Stargazers:25487Issues:0Issues:0

autosub

[NO LONGER MAINTAINED] Command-line utility for auto-generating subtitles for any video file

Language:PythonLicense:MITStargazers:4144Issues:0Issues:0

videoWater

视频批量处理,码率设置,格式转换,添加字幕,添加水印,去除水印,修改分辨率,视频剪裁,倍速播放

License:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonStargazers:72Issues:0Issues:0