Uranus (UranusSeven)

UranusSeven

Geek Repo

Company:Xprobe

Github PK Tool:Github PK Tool

Uranus's starred repositories

ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

Language:TypeScriptLicense:MITStargazers:72671Issues:405Issues:2757

Ventoy

A new bootable USB solution.

Language:CLicense:GPL-3.0Stargazers:60350Issues:644Issues:2186

lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

Language:TypeScriptLicense:NOASSERTIONStargazers:34838Issues:172Issues:1626

chatbot-ui

AI chat for every model.

Language:TypeScriptLicense:MITStargazers:27330Issues:243Issues:935

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:22918Issues:187Issues:189

tabby

Self-hosted AI coding assistant

Language:RustLicense:NOASSERTIONStargazers:18320Issues:95Issues:558

presto

The official home of the Presto distributed SQL query engine for big data

Language:JavaLicense:Apache-2.0Stargazers:15752Issues:862Issues:6435

LibreChat

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development

Language:TypeScriptLicense:MITStargazers:15273Issues:112Issues:1026

trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Language:JavaLicense:Apache-2.0Stargazers:9869Issues:170Issues:6389
Language:PythonLicense:Apache-2.0Stargazers:8793Issues:83Issues:1802

outlines

Structured Text Generation

Language:PythonLicense:Apache-2.0Stargazers:7026Issues:44Issues:498

gateway

A Blazing Fast AI Gateway. Route to 200+ LLMs with 1 fast & friendly API.

Language:TypeScriptLicense:MITStargazers:5195Issues:35Issues:229
Language:PythonLicense:Apache-2.0Stargazers:4489Issues:49Issues:825

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Language:PythonLicense:MITStargazers:1436Issues:31Issues:76

Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

ring-flash-attention

Ring attention implementation with flash attention

ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Language:PythonLicense:MITStargazers:403Issues:8Issues:10

Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:317Issues:9Issues:9

msccl

Microsoft Collective Communication Library

Language:C++License:NOASSERTIONStargazers:270Issues:13Issues:27

MatmulTutorial

A Easy-to-understand TensorOp Matmul Tutorial

Language:C++License:Apache-2.0Stargazers:218Issues:8Issues:9

long-context-attention

Sequence Parallel Attention for Long Context LLM Model Training and Inference

KIVI

KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache

Language:PythonLicense:MITStargazers:166Issues:3Issues:20

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonLicense:Apache-2.0Stargazers:147Issues:5Issues:11

libflash_attn

Standalone Flash Attention v2 kernel without libtorch dependency

Language:C++License:BSD-3-ClauseStargazers:78Issues:15Issues:4

flux

A fast communication-overlapping library for tensor parallelism on GPUs.

Language:C++License:Apache-2.0Stargazers:68Issues:6Issues:3