Haolin Song (shl5133)

shl5133

Geek Repo

Company:Beijing Institute of Technology

Location:Beijing, China

Home Page:https://shl5133.github.io/

Github PK Tool:Github PK Tool

Haolin Song's starred repositories

gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

gpt-engineer

Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app

Language:PythonLicense:MITStargazers:52163Issues:510Issues:478

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:32288Issues:204Issues:4971

Flowise

Drag & drop UI to build your customized LLM flow

Language:TypeScriptLicense:Apache-2.0Stargazers:30547Issues:250Issues:1385

semantic-kernel

Integrate cutting-edge LLM technology quickly and easily into your apps

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Language:PythonLicense:NOASSERTIONStargazers:15704Issues:132Issues:615

SuperAGI

<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.

Language:PythonLicense:MITStargazers:15363Issues:174Issues:408

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13731Issues:114Issues:1064

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Language:PythonLicense:MITStargazers:8791Issues:97Issues:181

xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Language:PythonLicense:NOASSERTIONStargazers:8468Issues:75Issues:537

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6464Issues:112Issues:294

oneflow

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

Language:C++License:Apache-2.0Stargazers:5877Issues:144Issues:969

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Language:PythonLicense:Apache-2.0Stargazers:5671Issues:67Issues:129

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4080Issues:40Issues:351

ChatGLM-Efficient-Tuning

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

Language:PythonLicense:Apache-2.0Stargazers:3652Issues:32Issues:374

MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonLicense:MITStargazers:1615Issues:15Issues:81

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1581Issues:20Issues:0

pandallm

Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。

Language:PythonLicense:Apache-2.0Stargazers:1065Issues:38Issues:34

summarize-from-feedback

Code for "Learning to summarize from human feedback"

Language:PythonLicense:NOASSERTIONStargazers:982Issues:147Issues:21

generative-agents

An attempt to build a working, locally-running cheap version of Generative Agents: Interactive Simulacra of Human Behavior

Language:Jupyter NotebookLicense:MITStargazers:924Issues:29Issues:8

awesome-language-agents

List of language agents based on paper "Cognitive Architectures for Language Agents"

lorahub

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Language:PythonLicense:MITStargazers:582Issues:11Issues:22

LongChat

Official repository for LongChat and LongEval

Language:PythonLicense:Apache-2.0Stargazers:507Issues:10Issues:37

Visual-Chinese-LLaMA-Alpaca

多模态中文LLaMA&Alpaca大语言模型(VisualCLA)

Language:PythonLicense:Apache-2.0Stargazers:416Issues:9Issues:13

RLHF-Label-Tool

用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.

lynx-llm

paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/

Language:PythonLicense:Apache-2.0Stargazers:227Issues:8Issues:6

AdaMix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.12410).

Language:PythonLicense:Apache-2.0Stargazers:126Issues:5Issues:1