Nealcly's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:23358Issues:191Issues:196

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:6119Issues:68Issues:151

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:1763Issues:21Issues:179

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:1000Issues:14Issues:31

awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:806Issues:8Issues:18

sort-google-scholar

Sorting Google Scholar search results based on the number of citations

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookLicense:MITStargazers:650Issues:29Issues:35

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonLicense:NOASSERTIONStargazers:650Issues:13Issues:42

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonLicense:Apache-2.0Stargazers:634Issues:6Issues:20

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language:C++License:MITStargazers:232Issues:7Issues:16

Selective_Context

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonLicense:MITStargazers:169Issues:8Issues:2

BotChat

Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:116Issues:2Issues:1

TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Language:PythonLicense:GPL-3.0Stargazers:81Issues:4Issues:3

CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonLicense:MITStargazers:70Issues:4Issues:8

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Language:PythonLicense:MITStargazers:55Issues:3Issues:2

GSM-Plus

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

drpo

Dateset Reset Policy Optimization

Language:PythonLicense:Apache-2.0Stargazers:25Issues:2Issues:0

ALCUNA

[EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge

CaRing

Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

Language:PythonLicense:MITStargazers:15Issues:5Issues:0

MORE

Code for paper - On Diversified Preferences of Large Language Model Alignment

Language:PythonStargazers:13Issues:2Issues:0

TACS

Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts

Language:PythonLicense:MITStargazers:11Issues:0Issues:0

CoEval

A collaborative LLM-human evaluation pipeline COEVAL.

Language:PythonLicense:Apache-2.0Stargazers:6Issues:1Issues:0