MeteorMan (Vincent131499)

Vincent131499

Geek Repo

Location:hangzhou

Github PK Tool:Github PK Tool

MeteorMan's starred repositories

llama.cpp

LLM inference in C/C++

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:12389Issues:69Issues:410

sglang

SGLang is a fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:5420Issues:55Issues:541

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:5263Issues:52Issues:397

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Language:PythonLicense:Apache-2.0Stargazers:4777Issues:35Issues:133

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4494Issues:58Issues:152

gpu.cpp

A lightweight library for portable low-level GPU computation using WebGPU.

Language:C++License:Apache-2.0Stargazers:3686Issues:45Issues:22

ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:3682Issues:20Issues:1106

awesome-llm-apps

Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

Language:PythonLicense:CC0-1.0Stargazers:3463Issues:46Issues:19

llm_interview_note

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2827Issues:37Issues:121

open-parse

Improved file parsing for LLM’s

Language:PythonLicense:MITStargazers:2413Issues:17Issues:33

jetson-containers

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Language:PythonLicense:MITStargazers:2183Issues:48Issues:493

Linly-Talker

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬

Language:PythonLicense:MITStargazers:1845Issues:25Issues:98

llama_deploy

Deploy your agentic worfklows to production

Language:PythonLicense:MITStargazers:1742Issues:26Issues:104

RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

AnyGPT

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Language:PythonLicense:Apache-2.0Stargazers:552Issues:5Issues:75

T-MAC

Low-bit LLM inference on CPU with lookup table

Language:C++License:MITStargazers:464Issues:11Issues:40

TAG-Bench

TAG-Bench: A benchmark for table-augmented generation (TAG)

Language:PythonLicense:MITStargazers:457Issues:8Issues:1

GoMate

GoMate:RAG Framework within Reliable input,Trusted output

ai-hub-models

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Language:PythonLicense:BSD-3-ClauseStargazers:439Issues:16Issues:90
Language:PythonLicense:Apache-2.0Stargazers:318Issues:5Issues:25

OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

MooER

MooER: Open-sourced LLM for audio understanding trained on 80,000 hours of data

Language:PythonLicense:NOASSERTIONStargazers:118Issues:4Issues:11
Language:PythonLicense:Apache-2.0Stargazers:75Issues:10Issues:2

llm-deploy

大模型/LLM推理和部署理论与实践

Magic-Doc

conversion doc(pdf/html/doc/docx/ppt/pptx)to markdown

Language:PythonLicense:Apache-2.0Stargazers:31Issues:1Issues:4