Cheng-Lin Tsai's repositories
Agent-E
Agent driven automation starting with the web. Discord: https://discord.gg/wgNfmFuqJF
claude-dev
Autonomous software engineer right in your IDE, capable of reading/writing files, executing commands, and more with your permission every step of the way.
claude-engineer
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks. This tool combines the capabilities of a large language model with practical file system operations and web search functionality.
comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
continue
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
dataline
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
fish-speech
Brand new TTS solution
genai-os
Kuwa GenAI OS: An open, free, secure, and privacy-focused Generative-AI Operating System.
gptpdf
Using GPT to parse PDF
groqnotes
Groqnotes: Generate organized notes from audio using Groq, Whisper, and Llama3
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Husky-v1
Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.
Kolors
Kolors Team
llm-graph-builder
Neo4j graph construction from unstructured data using LLMs
LLM101n
LLM101n: Let's build a Storyteller
mesop
Build delightful web apps quickly in Python
micro-agent
An AI agent that writes (actually useful) code for you
MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
mindsdb
The platform for building AI from enterprise data
ml-4m
4M: Massively Multimodal Masked Modeling
nomic
Interact, analyze and structure massive text, image, embedding, audio and video datasets
OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
RapidOCR
Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. (将PaddleOCR模型做了转换,采用ONNXRuntime推理,速度很快)
RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
stable-audio-tools
Generative models for conditional audio generation
Streamer-Sales
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️
YaFSDP
YaFSDP: Yet another Fully Sharded Data Parallel