Tat Trong Vu (tattrongvu)

tattrongvu

Geek Repo

Company:T-Systems International

Location:Hamburg, Germany

Github PK Tool:Github PK Tool

Tat Trong Vu's starred repositories

Language:JavaScriptStargazers:2Issues:0Issues:0

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:34915Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:5997Issues:0Issues:0

deepeval

The LLM Evaluation Framework

Language:PythonLicense:Apache-2.0Stargazers:2545Issues:0Issues:0

create-tsi

Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.

Language:TypeScriptStargazers:226Issues:0Issues:0

create-llama

The easiest way to get started with LlamaIndex

Language:TypeScriptLicense:MITStargazers:681Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7060Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4197Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1137Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18435Issues:0Issues:0

asyncpg

A fast PostgreSQL Database Client Library for Python/asyncio.

Language:PythonLicense:Apache-2.0Stargazers:6792Issues:0Issues:0

S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Language:PythonLicense:Apache-2.0Stargazers:1654Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1056Issues:0Issues:0

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:15708Issues:0Issues:0

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1273Issues:0Issues:0

state-of-open-source-ai

:closed_book: Clarity in the current fast-paced mess of Open Source innovation

Language:TeXLicense:NOASSERTIONStargazers:1479Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7729Issues:0Issues:0

LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference

Language:C++License:MITStargazers:22224Issues:0Issues:0

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34015Issues:0Issues:0

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165280Issues:0Issues:0

h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/

Language:PythonLicense:Apache-2.0Stargazers:11073Issues:0Issues:0

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

Language:PythonLicense:MITStargazers:8984Issues:0Issues:0

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Language:PythonLicense:Apache-2.0Stargazers:9095Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23905Issues:0Issues:0

pytriton

PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.

Language:PythonLicense:Apache-2.0Stargazers:695Issues:0Issues:0

ray-llm

RayLLM - LLMs on Ray

Language:PythonLicense:Apache-2.0Stargazers:1207Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54753Issues:0Issues:0

rag-stack

🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corporate oracle. Supports open-source LLMs like Llama 2, Falcon, and GPT4All.

Language:TypeScriptLicense:MITStargazers:1446Issues:0Issues:0

langflow

⛓️ Langflow is a visual framework for building multi-agent and RAG applications. It's open-source, Python-powered, fully customizable, model and vector store agnostic.

Language:JavaScriptLicense:MITStargazers:22736Issues:0Issues:0