Huiqiang Jiang's starred repositories
open-interpreter
A natural language interface for computers
mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
alignment-handbook
Robust recipes to align language models with human and AI preferences
OpenAgents
OpenAgents: An Open Platform for Language Agents in the Wild
consistencydecoder
Consistency Distilled Diff VAE
prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
SparsePrimingRepresentations
Public repo to document some SPR stuff
LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
gpt_paper_assistant
GPT4 based personalized ArXiv paper assistant bot
flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores