Beast code in Giters

Xiaojian Yuan's starred repositories

circuit-breakers

4900

L1B3RT45

J41LBR34K PR0MPT5 F0R 4LL M4J0R LLM5

AGPL-3.098400

SOUL

Official repo for paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"

Language:PythonMIT900

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonMIT125600

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION2192600

negative-preference-optimization

Language:Python1400

llm_unlearn

LLM Unlearning

Language:PythonMIT8400

tofu

Landing Page for TOFU

Language:PythonMIT6000

awesome-machine-unlearning

Awesome Machine Unlearning (A Survey of Machine Unlearning)

Language:Jupyter NotebookMIT63000

LLM_unlearning

Language:Python600

awesome-llm-unlearning

A resource repository for machine unlearning in large language models

Apache-2.04500

llm-adaptive-attacks

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [arXiv, Apr 2024]

Language:ShellMIT11200

LLM-Safeguard

Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"

Language:Python4200

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:Jinja29000

rpo

Official repository for paper, "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"

Language:Python3000

llm_attack_defense_arena

Language:Python5800

MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonMIT1355600

DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目

104000

grok-1

Grok open release

Language:PythonApache-2.04901500

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.01458000

ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Language:PythonMIT5200

EasyJailbreak

An easy-to-use Python framework to generate adversarial jailbreak prompts.

Language:PythonGPL-3.028800

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.03282600

transformer-debugger

Language:PythonMIT393200

ShadowAlignment

Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models

Language:PythonApache-2.01900

weak-to-strong

Weak-to-Strong Jailbreaking on Large Language Models

Language:PythonMIT4900

CLIPInversion

What do we learn from inverting CLIP models?

Language:Python3000

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.02409300

llm-attacks

Universal and Transferable Attacks on Aligned Language Models

Language:PythonMIT299100

SafeDecoding

Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

Language:Jupyter NotebookMIT6000