Beast code in Giters

cytan's starred repositories

arxiv_crawler

这是一个高效，快捷的arXiv论文爬虫，它可以将指定时间范围，指定主题，包含指定关键词的论文信息爬取到本地，并且将其中的标题和摘要翻译成中文。

Language:Python300

learning-to-refuse

Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"

600

Say-I-Dont-Know

[ICML'2024] Can AI Assistants Know What They Don't Know?

Language:Python5900

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.02926100

probing-lm-data

Official Implementation of "Probing Language Models for Pre-training Data Detection"

Language:Python1500

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonApache-2.0818400

UHGEval

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA

Language:PythonApache-2.017300

MoE-SFT

🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts

Language:PythonApache-2.03300

HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Language:PythonApache-2.066400

mink-plus-plus

Min-K%++: Improved baseline for detecting pre-training data of LLMs https://arxiv.org/abs/2404.02936

Language:PythonMIT2500

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT617800

TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Language:PythonGPL-3.09400

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.

76500

llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

87700

DCR-consistency

DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models

Language:PythonApache-2.02100

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.082700

TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Language:Jupyter NotebookApache-2.056200

UnknownBench

Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge

Language:Jupyter Notebook900

FalseQA

Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"

Language:Python2000

R-Tuning

[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"

Language:Python7900

SelfAware

Do Large Language Models Know What They Don’t Know?

Language:PythonApache-2.08200

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1136900

cytan17726