chanmuzi / Papers

Paper list and short/long summaries I've read for my research or interests

LLM (Large Language Model)

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.02	Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models	Griffin, Hawk	Google DeepMind	Blog
2024.02	OLMo: Accelerating the Science of Language Models	Open Source, OLMo	AI2	Makrdown
2024.01	Orion-14B: Open-source Multilingual Large Language Models	Open Source, MLLM	OrionStar Inc.	Markdown
2023.05	PaLM2 Technical Report	PaLM2	Google	Blog
2023.04	Scaling Transformer to 1M tokens and beyond with RMT	Attention, RMT	Neural Networks and Deep Learning Lab, MIPT, Dolgoprudny, Russia	Blog
2023.03	GPT-4 Technical Report	GPT-4	OpenAI	Blog
2019.05	BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding	BERT	Google AI Language	Blog	NAACL-HLT 2019

Reasoning

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.17	Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models	Boosting of Thoughts (BoT)	University of Toronto	Markdown	ICLR 2024
2024.02	Chain-of-Thought Reasoning Without Prompting	CoT-decoding	Google DeepMind	Blog
2024.02	Self-Discover: Large Language Models Self-Compose Reasoning Structures	Decompose, Reasoning Structure	University of Southern California	Blog
2024.02	A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains	Benchmark, REVEAL	Google	Blog
2024.01	LLMs cannot find reasoning errors, but can correct them!	CoT	Google Research	Markdown
2023.10	Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning	Prompt Decomposition	IIT Delhi, India	Markdown	EMNLP 2023
2023.05	Learning to Reason and Memorize with Self-Notes	Self-Notes	Meta AI, NVIDIA	Blog	NeurIPS 2023 Poster
2023.04	Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System	Self-Controlled Memory (SCM)	ByteDance AI Lab	Blog
2023.04	Least-to-Most Prompting Enables Complex Reasoning in Large Language Models	least-to-most prompting	Google Research	Blog	ICLR 2023
2023.04	PINTO: Faithful Language Reasoning Using Prompt-Generated Rationales	Rationale	University of Southern California	Blog	ICLR 2023
2023.04	Answering Questions by Meta-Reasoning over Multiple Chains of Thought	CoT, Meta-Reasoning	AI2	Blog	EMNLP 2023
2023.01	Specializing Smaller Language Models towards Multi-Step Reasoning	Knowledge Distillation, Reasoning	AI2	Makrdown

RAG (Retrieval Augmented Generation)

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.03	Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering	LLMQA	Renmin University	Markdown	WWW 2024
2024.01	RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval	tree, summarization	Stanford University	Markdown	ICLR 2024
2024.01	A Survey on Evaluation of Large Language Models	Evaluation, Survey	...	Markdown	ACM
2024.01	The Power of Noise: Redefining Retrieval for RAG Systems	Noise, RAG	...	Makrdown	ACM
2024.01	Corrective Retrieval Augmented Generation	RAG, CRAG	University of Science and Technology of Chian, Google Research	Blog
2023.12	RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!	Reranking	University of Waterloo	Markdown
2023.10	Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents	Re-Ranking	Baidu	Markdown
2023.10	Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!	Reranking	Nanyang Technological University	Markdown
2023.10	Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection	Self, RAG	University of Washington, AI2, IBM Research AI	Markdown
2023.02	Augmented Language Models: a Survey	RAG, Survey	Meta	Markdown
2021.10	BEIR: A Heterogeneous Benchmark for Zero-shot Evaluation of Information Retrieval Models	IR Benchmark	UKP-TUDA	Markdown	NeurIPS 2021

Training

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.01	Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation	RELC, RL, PPO	McGill, Toronto, Mila, Google Research	Blog	ACL 2024 under review
2024.01	Can AI Assistants Know What They Don't Know?	Hallucination	Fudan University	Blog
2024.01	Tuning Language Models by Proxy	Proxy-tuning	Allen Institue of AI	Blog
2024.01	Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models	Self-Play	...	Markdown
2024.01	Knowledge Fusion of Large Language Models	Knowledge Fusion	Tencent AI Lab	Blog
2023.12	Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision	superhuman model	OpenAI	Blog
2023.12	Select, Prompt, Filter: Distilling Large Language Models for Summarizing Conversations	Knowledge Distillation	Zoom Video Communications	Markdown	EMNLP 2023
2023.12	Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models	Self-Training, ReST	Google DeepMind	Markdown
2023.10	SELF: Self-Evolution with Language Feedback	SELF	The University of Hong Kong	Blog
2023.03	oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes	RoBERTa, Knowledge Distillation	Department of Computer Science, the University of Illinois Urbana-Champaign	Blog

Prompting

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.02	Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge	HSP, Summarization (not mentioned though)	National University of Singapore	Markdown
2024.02	A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts	Gist Memory, ReadAgent	Google DeepMind	Markdown
2024.01	Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding	Meta-Prompting	Stanford University, OpenAI	Markdown

PEFT

Date	Title(arxiv)	Keyword	Affiliation	Note	Conference
2024.02	ResLoRA: Identity Residual Mapping in Low-Rank Adaption	ResLoRA, Merge	Microsoft	Blog
2024.02	LoRA+: Efficient Low Rank Adaptation of Large Models	LoRA+	UC Berkeley	Markdown
2023.04	LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model	LLaMA, Visual Instruction	Shanghai Artificial Intelligence Laboratory	Blog

Compression

Date	Title(arxiv)	Keyword	Affiliation	Note	Conference
2024.01	SliceGPT: Compress Large Language Models by Deleting Rows and Columns	Model Compression	Microsoft Research	Markdown
2023.04	Learning to Compress Prompts with Gist Tokens	Gist Token, Compression	Stanford University	Blog	NeurIPS 2023
2022.05	Matryoshka Representation Learning	Matryoshka, Adaptive	Google Research	Markdown	NeurIPS 2022

Tool, Agent

Date	Title(arxiv)	Keyword	Affiliation	Note	Conference
2023.04	Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models	Tools, Chameleon	University of California, Los Angeles, Microsoft Research	Blog	NeurIPS 2023

Evaluation / Benchmark / Dataset

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.03	RewardBench: Evaluating Reward Models for Language Modeling	Reward Model, DPO	AI2	Markdown
2024.02	Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation	Bonito, Instruction Tuning Dataset	Brown University	Markdown	ACL 2024 submission
2024.02	LLM-based NLG Evaluation: Current Status and Challenges	NLG, Evaluation	Peking University	Markdown
2024.02	Can Large Language Models Understand Context?	Benchmark, In-Context Learning	Apple	Markdown	Findings of EACL 2024
2024.01	Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research	Dolma, Dataset, Pretraining	AI2	Markdown
2023.05	Do Large Language Models Know What They Don't Know?	Self-Knowledge, SelfAware (Dataset)	School of Computer Science, Fudan University	Markdown
2023.04	Are Emergent Abilities of Large Language Models a Mirage?	Emergent Ability	Stanford University	Blog	NeurIPS 2023 oral
2023.04	Evaluating Verifiability in Generative Search Engines	Generative Search Engine	Stanford University	Blog	Findings of EMNLP 2023
2016.10	SQuAD: 100,000+ Questions for Machine Comprehension of Text	SQUAD, QA, Benchmark	Stanford University	Blog

Others

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2024.02	The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits	1-bit, BitNet b1.58	Microsoft Research	Blog
2024.02	LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens	LongRoPE, 2048k tokens	Microsoft Research	Makrdown
2023.04	ResiDual: Transformer with Dual Residual Connections	Layer Normalization	Microsoft Research	Blog	ICLR 2024 rejected

Multimodal

Date	Title (arxiv)	Keyword	Affiliation	Note	Conference
2023.03	Visual Instruction Tuning	LLaVA, Visual Instruction Tuning	Microsoft Research	Blog	NeurIPS 2023 oral

About

Paper list and short/long summaries I've read for my research or interests