dage1210

dage1210

Geek Repo

Github PK Tool:Github PK Tool

dage1210's starred repositories

ChatLM-mini-Chinese

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Language:PythonLicense:Apache-2.0Stargazers:1100Issues:0Issues:0

TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Language:PythonLicense:Apache-2.0Stargazers:548Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:63486Issues:0Issues:0

Langchain-Chatchat

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptLicense:Apache-2.0Stargazers:30913Issues:0Issues:0

awesome-llm-understanding-mechanism

awesome papers in LLM interpretability

Stargazers:215Issues:0Issues:0

statistic

collecting books, papers and docs.

Stargazers:2211Issues:0Issues:0

LLM-RAG-QA

LLM+RAG for QA

Language:PythonLicense:MITStargazers:19Issues:0Issues:0

notebooks

Notebooks using the Hugging Face libraries 🤗

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3531Issues:0Issues:0

evaluate

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Language:PythonLicense:Apache-2.0Stargazers:1941Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:131243Issues:0Issues:0

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonLicense:Apache-2.0Stargazers:7581Issues:0Issues:0

PyTorch-DDPM

500 行代码实现降噪扩散模型 DDPM,干净无依赖

Language:Jupyter NotebookStargazers:127Issues:0Issues:0

ms-swift

Use PEFT or Full-parameter to finetune 300+ LLMs or 60+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Language:PythonLicense:Apache-2.0Stargazers:3096Issues:0Issues:0
Language:PythonStargazers:208Issues:0Issues:0

Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

Language:MATLABStargazers:3143Issues:0Issues:0

BaiYang-chatGLM2-6B

(1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。

Language:PythonStargazers:45Issues:0Issues:0

chatGLM-6B-QLoRA

使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。

Language:PythonStargazers:348Issues:0Issues:0

chatgpt-comparison-detection

Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

Language:PythonStargazers:1241Issues:0Issues:0