HuangXinzhe's starred repositories

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:3144Issues:0Issues:0

awesome-LLM-resourses

🧑‍🚀 全世界最好的中文LLM资料总结

Stargazers:675Issues:0Issues:0

HuixiangDou

HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance

Language:PythonLicense:BSD-3-ClauseStargazers:1352Issues:0Issues:0

WanJuan1.0

万卷1.0多模态语料

License:CC-BY-4.0Stargazers:523Issues:0Issues:0

marker

Convert PDF to markdown quickly with high accuracy

Language:PythonLicense:GPL-3.0Stargazers:15969Issues:0Issues:0

nano-llama31

nanoGPT style version of Llama 3.1

Language:PythonStargazers:1107Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:10195Issues:0Issues:0

haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

Language:PythonLicense:Apache-2.0Stargazers:15976Issues:0Issues:0

langgraph

Build resilient language agents as graphs.

Language:PythonLicense:MITStargazers:5336Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:27404Issues:0Issues:0

storm

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Language:PythonLicense:MITStargazers:9995Issues:0Issues:0

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:30915Issues:0Issues:0

Awesome-Text2SQL

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

License:MITStargazers:1548Issues:0Issues:0

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Stargazers:3091Issues:0Issues:0

gptpdf

Using GPT to parse PDF

Language:PythonLicense:MITStargazers:2707Issues:0Issues:0

Awesome-AGI

AGI资料汇总学习(主要包括LLM和AIGC),持续更新......

Language:Jupyter NotebookStargazers:263Issues:0Issues:0

genai-handbook.github.io

A roadmap for "generative AI" learning resources

Language:CSSStargazers:138Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5969Issues:0Issues:0

llm-autoeval

Automatically evaluate your LLMs in Google Colab

Language:PythonLicense:MITStargazers:497Issues:0Issues:0

evalscope

A streamlined and customizable framework for efficient large model evaluation and performance benchmarking

Language:PythonLicense:Apache-2.0Stargazers:155Issues:0Issues:0

tiny-llm-zh

从零实现一个小参数量中文大语言模型。

Language:PythonStargazers:158Issues:0Issues:0

llms-from-scratch-cn

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:942Issues:0Issues:0

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:25199Issues:0Issues:0

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookLicense:MITStargazers:12851Issues:0Issues:0

unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language:HTMLLicense:Apache-2.0Stargazers:8210Issues:0Issues:0

openaiee

OepnaiEE 支持多个 API 服务集成,包括 OpenAI、Groq、Gemini 和 Claude,使其能够在 Vercel 和 Netlify 等平台上快速部署。

Language:JavaScriptLicense:MITStargazers:57Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5263Issues:0Issues:0

tiny-universe

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Language:PythonStargazers:813Issues:0Issues:0

llm-universe

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Language:Jupyter NotebookStargazers:4221Issues:0Issues:0

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookLicense:MITStargazers:14322Issues:0Issues:0