Beast code in Giters

huziyuan14's starred repositories

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonMIT400500

RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

92500

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonMIT607000

Langchain-Chatchat（原Langchain-ChatGLM, Qwen 与 Llama 等）基于 Langchain 与 ChatGLM 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

Language:TypeScriptApache-2.02988300

grok-1

Grok open release

Language:PythonApache-2.04918100

hugging-multi-agent

A tutorial based on MetaGPT to quickly help you understand the concept of agent and muti-agent and get started with coding development. 基于MetaGPT的多智能体入门与开发教程

Language:CSS127400

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT464600

aliendao

huggingface mirror download

Language:PythonMIT53800

Phi2-mini-Chinese

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Language:Jupyter NotebookApache-2.043200

ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Language:PythonApache-2.0100000

NLP

Some application models of natural language processing

Language:Python600

Chinese-Llama-2-7b

开源社区第一个能下载、能运行的中文 LLaMA2 模型！

Language:PythonApache-2.0222100

Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型；ChatFlow中文对话模型；中文OpenLLaMA模型；NLP预训练/指令微调数据集

Language:Python301200

CLUEDatasetSearch

搜索所有中文NLP数据集，附常用英文NLP数据集

Language:Python402100

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.0445700

torchkeras

Pytorch❤️ Keras 😋😋

Language:Jupyter NotebookApache-2.0150300

ChineseNLPCorpus

中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。

Language:Python416500

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

Language:PythonApache-2.070800

LLM-Tuning

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

Language:HTML94200

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.0252400

huziyuan14

huziyuan14's starred repositories

RGB

Chinese-CLIP

RAG-Survey

FlagEmbedding

Langchain-Chatchat

grok-1

hugging-multi-agent

AppAgent

aliendao

Phi2-mini-Chinese

ChatLM-mini-Chinese

NLP

Chinese-Llama-2-7b

Linly

CLUEDatasetSearch

RedPajama-Data

torchkeras

ChineseNLPCorpus

albert_pytorch

LLM-Tuning

Alpaca-CoT

ColossalAI

P-tuning-v2

fastllm

FastChat

ChatGLM-Finetuning

chatGLM-6B-QLoRA

ChatGLM-Tuning

transformers_tasks

ChatGLM-6B