itaban's starred repositories

DouyinLiveRecorder

可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、流星、Twitch等平台直播录制

Language:PythonLicense:MITStargazers:3922Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34778Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20915Issues:0Issues:0

SadTalker-Video-Lip-Sync

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Language:PythonStargazers:1746Issues:0Issues:0

sitcom-simulator

A tool that combines ChatGPT, Stable Diffusion, FakeYou, and FreePD to create AI-generated videos.

Language:PythonLicense:MITStargazers:98Issues:0Issues:0

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:2810Issues:0Issues:0

parler-tts

Inference and training library for high-quality TTS models.

Language:PythonLicense:Apache-2.0Stargazers:2905Issues:0Issues:0

HairFastGAN

Official Implementation for "HairFastGAN: Realistic and Robust Hair Transfer with a Fast Encoder-Based Approach"

Language:PythonLicense:MITStargazers:377Issues:0Issues:0

awesome-indie

awesome-indie 中文版 - 帮助独立开发者赚钱的资源

Stargazers:1305Issues:0Issues:0

awesome-indie

Resources for independent developers to make money

License:NOASSERTIONStargazers:9699Issues:0Issues:0

we-drawing

AI画图。每天一句**古诗词,生成 AI 图片 Powered by Bing DALL-E-3.

Language:TypeScriptLicense:MITStargazers:558Issues:0Issues:0

Mora

Mora: More like Sora for Generalist Video Generation

Language:PythonStargazers:1453Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:27615Issues:0Issues:0

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:27840Issues:0Issues:0

kungfu

Kungfu Trader

Language:C++License:Apache-2.0Stargazers:3340Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8084Issues:0Issues:0

aiXcoder-7B

official repository of aiXcoder-7B Code Large Language Model

Language:PythonLicense:Apache-2.0Stargazers:2153Issues:0Issues:0

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Language:PythonStargazers:1130Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:39269Issues:0Issues:0

skyvern

Automate browser-based workflows with LLMs and Computer Vision

Language:PythonLicense:AGPL-3.0Stargazers:5525Issues:0Issues:0

llamafile

Distribute and run LLMs with a single file.

Language:C++License:NOASSERTIONStargazers:17849Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:12681Issues:0Issues:0

MaxKB

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统,1Panel 官方出品。

Language:PythonLicense:GPL-3.0Stargazers:8484Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24743Issues:0Issues:0

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10258Issues:0Issues:0

spring-ai

An Application Framework for AI Engineering

Language:JavaLicense:Apache-2.0Stargazers:2626Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3109Issues:0Issues:0

ai_webui

AI-WEBUI: A universal web interface for AI creation, 一款好用的图像、音频、视频AI处理工具

Language:Jupyter NotebookLicense:MITStargazers:193Issues:0Issues:0

iptv-sources

Autoupdate iptv sources

Language:TypeScriptLicense:GPL-3.0Stargazers:5672Issues:0Issues:0

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:1493Issues:0Issues:0