wzljerry

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

GPL-3.0000

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.

000

Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

000

GPTInterviewer

GPT Interviewer - Practice interview with AI interviewer based on job descriptions and resume

MIT000

gpt4-pdf-chatbot-langchain

GPT4 & LangChain Chatbot for large PDF docs

000

search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

Apache-2.0000

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

MIT000

petals

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

MIT000

localllm

Apache-2.0000

ollama

Get up and running with Llama 2, Mistral, and other large language models locally.

MIT000

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Apache-2.0000

Anton

AI-powered Resume Generation Tool

Language:PythonMIT100

ng-video-lecture

000

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Apache-2.0000