meizhen-nlp

Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

Language:PythonApache-2.03802 21 1135

llm-datasets

High-quality datasets, tools, and concepts for LLM fine-tuning.

1852 31 1

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookNOASSERTION1499 16 25

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonMIT1324 23 17

machine-learning-interview

算法工程师-机器学习面试题总结

1322 12 1

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT1319 14 56

awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1138 52 7

automatic_prompt_engineer

Language:PythonMIT1132 16 19

PrefixTuning

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Language:Python887 8 50

IncarnaMind

Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMs

Language:PythonApache-2.0776 17 10

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Language:PythonApache-2.0685 10 142

train-CLIP

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Language:PythonMIT657 16 37

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonMIT600 10 37

SCAN

PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)

Language:PythonApache-2.0544 10 60

awesome-fairness-papers

Papers on fairness in NLP

424 31 4

PoSE

Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)

Language:PythonMIT196 5 16

In-Context-Learning_PaperList

Paper List for In-context Learning 🌷

166 40

bias-bench

ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.

Language:Python119 4 22

debias_vl

Code for Debiasing Vision-Language Models via Biased Prompts

Language:PythonMIT51 1 2

XLM-Align

Language:PythonMIT36 4 4

Auto-Debias

Language:Python27 2 1

Mitigate-Gender-Bias-in-Image-Search

Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arxiv.org/abs/2109.05433

Language:PythonMIT11 3 2

infoxlm_paddle

Implementing InfoXLM's code base and training process with PaddlePaddle

Language:Python400