dreamhomes

沈梦家's starred repositories

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

Language:PythonGPL-3.0367900

360LayoutAnalysis

360LayoutAnaylsis, a series Document Analysis Models and Datasets deleveped by 360 AI Research Institute

Apache-2.016800

Docs2KG

Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models

Language:PythonLGPL-2.113100

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT2896700

timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Language:PythonApache-2.0306400

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language:PythonApache-2.0363700

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.02601700

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.01157600

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonApache-2.0221500

Qwen-Agent

Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.

Language:PythonNOASSERTION261100

one-api

OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.

Language:JavaScriptMIT1623200

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookCC-BY-4.02827500

QAnything

Question and Answer based on Anything.

Language:PythonApache-2.01060500

OpenVoice

Instant voice cloning by MyShell.

Language:PythonMIT2720700

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonGPL-3.01420700

excalidraw

Virtual whiteboard for sketching hand-drawn like diagrams

Language:TypeScriptMIT7714100

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.03818200

One2345plus

48700

Vary

[ECCV2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Language:Python165300

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT606100

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonNOASSERTION103300

habitat-sim

A flexible, high-performance 3D simulator for Embodied AI research.

Language:C++MIT245700

agentlego

Enhance LLM agents with versatile tool APIs

Language:PythonApache-2.031000

Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.

GPL-3.0854000