Yichi Zhang's starred repositories
Awesome-LLMs-in-Graph-tasks
A curated collection of research papers exploring the utilization of LLMs for graph-related tasks.
jailbreakbench
An Open Robustness Benchmark for Jailbreaking Language Models [arXiv 2024]
PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Awesome-LLM-Constrained-Decoding
A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.
open-llm-server
This Open LLM Framework serves as a powerful and flexible tool for serving endpoints for embeddings and chat completions using SOTA open source language models. By leveraging models Transformers, this enables various natural language processing (NLP) tasks to be performed via simple HTTP endpoints similar to openai endpoints.
graphrag-local-ollama
Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction
ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
EasyJailbreak
An easy-to-use Python framework to generate adversarial jailbreak prompts.
AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
modelbench
Run safety benchmarks against AI models and view detailed reports showing how well they performed.
DecodingTrust
A Comprehensive Assessment of Trustworthiness in GPT Models
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
llm-attacks
Universal and Transferable Attacks on Aligned Language Models
opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
LLM-Finetuning
LLM Finetuning with peft