Hamed (Dufumiza)

Dufumiza

Geek Repo

Location:Jeddah, Saudi Arabia

Github PK Tool:Github PK Tool

Hamed's repositories

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

License:Apache-2.0Stargazers:0Issues:0Issues:0

chatbot-ui

The open-source AI chat app for everyone.

License:MITStargazers:0Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

License:Apache-2.0Stargazers:0Issues:0Issues:0

faiss

A library for efficient similarity search and clustering of dense vectors.

License:MITStargazers:0Issues:0Issues:0

promptbench

A unified evaluation framework for large language models

License:MITStargazers:0Issues:0Issues:0

LLMLingua

To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

License:MITStargazers:0Issues:0Issues:0

promptsource

Toolkit for creating, sharing and using natural language prompts.

License:Apache-2.0Stargazers:0Issues:0Issues:0

auto-cot

Official implementation for "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0