Reacubeth (xyjigsaw)

xyjigsaw

Geek Repo

Location:Shanghai, China

Home Page:omegaxyz.com

Twitter:@noverfitting

Github PK Tool:Github PK Tool

Reacubeth's starred repositories

gpt4all

gpt4all: run open-source LLMs anywhere

Open-Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Language:PythonLicense:Apache-2.0Stargazers:36802Issues:425Issues:1643

HowToLiveLonger

程序员延寿指南 | A programmer's guide to live longer

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29040Issues:342Issues:267

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:24437Issues:168Issues:3929

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:18327Issues:155Issues:467

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17781Issues:185Issues:728

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

ChatALL

Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers

Language:JavaScriptLicense:Apache-2.0Stargazers:14565Issues:120Issues:516

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonLicense:Apache-2.0Stargazers:11869Issues:124Issues:353

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9095Issues:157Issues:560

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8514Issues:78Issues:950

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4101Issues:112Issues:119

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4011Issues:40Issues:385

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2506Issues:37Issues:98

ESL-CN

The Elements of Statistical Learning (ESL)的中文翻译、代码实现及其习题解答。

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:2378Issues:70Issues:238

NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

ML_Notes

机器学习算法的公式推导以及numpy实现

Language:Jupyter NotebookStargazers:1945Issues:28Issues:3

Orion

Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型,包括对话模型,长文本模型,量化模型,RAG微调模型,Agent微调模型等。

Language:PythonLicense:Apache-2.0Stargazers:771Issues:9Issues:43

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookLicense:MITStargazers:613Issues:29Issues:35

EvaluationPapers4ChatGPT

Resource, Evaluation and Detection Papers for ChatGPT

HaluEval

This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.

Language:PythonLicense:MITStargazers:333Issues:8Issues:11

Awesome-Scientific-Language-Models

A Curated List of Language Models in Scientific Domains

License:MITStargazers:293Issues:7Issues:0

AutoCompressors

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

nosync-icloud

避免 iCloud 同步 node_modules(Avoid node_modules to sync with iCloud)

Language:JavaScriptLicense:MITStargazers:194Issues:3Issues:7

k2

Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024

Language:PythonLicense:Apache-2.0Stargazers:154Issues:5Issues:11

geogalactica

Code and datasets for paper "GeoGalactica: A Scientific Large Language Model in Geoscience"

Language:PythonLicense:Apache-2.0Stargazers:12Issues:1Issues:2

GPT2-Knowledge-Distillation

Knowledge Distillation for student model of GPT from GPT-medium on tiny Shakespeare dataset

Language:Jupyter NotebookLicense:MITStargazers:5Issues:0Issues:0