唐国梁Tommy's repositories
LangChain_LLM_ChatBot
基于LLM和LangChain实现基于本地文档的QA chatbot
Langchain-Chatchat
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain
Personal_Paper_Reading
Share the papers and articles that I have read
agents
An Open-source Framework for Autonomous Language Agents
ChatDB
The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".
ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
dolma
Data and tools for generating and inspecting OLMo pre-training data.
flash-attention
Fast and memory-efficient exact attention
langchain
⚡ Building applications with LLMs through composability ⚡
LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
MLproduction
study how to deploy ML or DL in production efficiently
neural-compressor
Provide unified APIs for SOTA model compression techniques, such as low precision (INT8/INT4/FP4/NF4) quantization, sparsity, pruning, and knowledge distillation on mainstream AI frameworks such as TensorFlow, PyTorch, and ONNX Runtime.
promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
streaming-llm
Efficient Streaming Language Models with Attention Sinks
Yi
A series of large language models trained from scratch by developers @01-ai