michael's repositories
verl
veRL: Volcano Engine Reinforcement Learning for LLM
zerox
Zero shot pdf OCR with gpt-4o-mini
EasySpider
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Auto_Jobs_Applier_AIHawk
Auto_Jobs_Applier_AIHawk is a tool that automates the jobs application process. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personalized way.
LightRAG
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
GoMate
GoMate:RAG Framework within Reliable input,Trusted output
Liger-Kernel
Efficient Triton Kernels for LLM Training
ChatTTS
A generative speech model for daily dialogue.
MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Weekly-Top-LLM-Papers
Curated list of weekly published LLM papers
sglang
SGLang is yet another fast serving framework for large language models and vision language models.
mem0
The memory layer for Personalized AI
graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
CodeGeeX4
CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more.
era-cot
[ACL 2024] ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis.
equibot
Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
bosszp-selenium
使用python+selenium完成对boss互联网相关岗位的数据爬取
GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
tree2retriever
Recursive Abstractive Processing for Tree-Organized Retrieval
dataherald
Interact with your SQL database, Natural Language to SQL using LLMs