Nealcly

Nealcly's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION23358 191 196

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT6119 68 151

transformer-debugger

Language:PythonMIT3981 26 14

weak-to-strong

Language:PythonMIT2459 33 18

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.01763 21 179

OpenMoE

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Language:Python1304 14 8

llm-reasoners

A library for advanced large language model reasoning

Language:PythonApache-2.01000 14 31

awesome-mixture-of-experts

A collection of AWESOME things about mixture-of-experts

847 21 2

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0806 8 18

sort-google-scholar

Sorting Google Scholar search results based on the number of citations

Language:Python729 17 24

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookMIT650 29 35

vec2text

utilities for decoding deep representations (like sentence embeddings) back to text

Language:PythonNOASSERTION650 13 42

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonApache-2.0634 6 20

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language:C++MIT232 7 16

Selective_Context

Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.

Language:Python212 3 23

speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Language:PythonMIT169 8 2

BotChat

Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.

Language:Jupyter NotebookApache-2.0116 2 1

world-model-for-language-model

Language:Python99 1 7

TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Language:PythonGPL-3.081 4 3

CLEX

[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models

Language:PythonMIT70 4 8

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Language:PythonMIT55 3 2

GSM-Plus

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Language:Python28 1 3

drpo

Dateset Reset Policy Optimization

Language:PythonApache-2.025 20

ALCUNA

[EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge

MIT19 1 1

CaRing

Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

Language:PythonMIT15 50

Code for paper - On Diversified Preferences of Large Language Model Alignment

Language:Python13 20