Zhilin Wang's repositories
graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
cartesia-python
The official Python library for the Cartesia API
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
langchain
🦜🔗 Build context-aware reasoning applications
Scrapegraph-ai
Python scraper based on AI
ragas
Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
txtai
💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
build-nanogpt
Video+code lecture on building nanoGPT from scratch
manim
Animation engine for explanatory math videos
rag-search
RAG Search API
search2ai
Help your LLMs online
langgraph
Build resilient language agents as graphs.
phidata
Build AI Assistants with memory, knowledge and tools.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
Qwen1.5
Qwen1.5 is the improved version of Qwen, the large language model series developed by Qwen team, Alibaba Cloud.
Llama-Chinese
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
GPTInterviewer
GPT Interviewer - Practice interview with AI interviewer based on job descriptions and resume
gpt4-pdf-chatbot-langchain
GPT4 & LangChain Chatbot for large PDF docs
search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
ollama
Get up and running with Llama 2, Mistral, and other large language models locally.
MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.