EmbeddedLLM

EmbeddedLLM's repositories

vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.078 2 13

JamAI Base: Let Your Database Orchestrate LLMs and RAG

Language:PythonApache-2.02200

Language:PythonApache-2.04 10

JamAI Base cookbook repo

Language:PythonApache-2.0200

Language:TypeScript100

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.0100

Strip down to support flash attention v2 ROCM.

Language:PythonNOASSERTION100

Causal depthwise conv1d in CUDA, with a PyTorch interface

Language:CudaBSD-3-Clause000

DSPy: The framework for programming—not prompting—foundation models

Language:PythonMIT000

EAGLE: Lossless Acceleration of LLM Decoding by Feature Extrapolation

Language:PythonApache-2.0000

PyTorch bindings for CUTLASS grouped GEMM.

Language:CudaApache-2.0000

Language:PythonApache-2.0000

Language:PythonApache-2.0000

Language:TypeScript000

Typescript Documentation of JamAISDK

Language:HTML000

The 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 Conversational AI JavaScript Library

NOASSERTION000