Welcome to Awesome AI Research

NB: This repo is a WIP and I hope to keep refreshing and extending it as time allows

Welcome to awesome-ai-research, a personal collection of research papers that have piqued my interest in the expansive field of artificial intelligence, especially focusing on Large Language Models (LLMs) and their multifaceted applications.

Artificial intelligence is a dynamic field where innovation is constant. The papers included here reflect a range of work that I've found interesting - you may or may not too! :)

Here, you'll discover papers covering everything from enhancing the reasoning abilities of AI systems to innovative approaches for training and benchmarking. Each selection is driven by a personal interest and a belief in the importance of the work, rather than a comprehensive or systematic review.

I hope this collection serves as a springboard for your exploration and inspires a deeper understanding and appreciation of artificial intelligence.

Reasoning

This section presents papers exploring advanced reasoning capabilities of Large Language Models (LLMs). Topics include self-improvement techniques, multi-agent systems, ethical reasoning, mathematical reasoning, and approaches to enhance LLMs' ability to process and generate logically coherent and contextually relevant responses. Expect insights on enhancing code generation, debugging, and solving complex algorithmic challenges.

Context window

Papers in this section address how expanding the contextual awareness of models can significantly enhance memory and attention mechanisms. This includes exploring architectures that handle long-term dependencies and infinite contexts to improve the working memory of transformers.

Extension
- 2024

Operational efficiency

Focuses on optimizing the computational efficiency and practical deployment of LLMs. This includes reducing the computational load, increasing inference speed, improving resource allocation, and strategies to minimize costs without sacrificing performance.

Hallucination/Factuality

This segment covers research on minimizing hallucinations—false or misleading information generated by LLMs. Papers propose various techniques for enhancing the fact-checking capabilities of models, ensuring more reliable and factual outputs.

2023

Specific niche/topics

Includes papers that apply LLMs to specialized fields such as data science, DevOps, and personalized learning. These studies explore how LLMs can be tailored to specific professional tasks, including automated code generation and data interpretation.

Data Science
- 2024
  - DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning (04/2024)
  - DATA INTERPRETER: AN LLM AGENT FOR DATA SCIENCE
Coding (and related)
- 2023

Evaluation/benchmarking

This section presents methodologies for assessing the performance of LLMs, particularly in handling extensive contexts and complex reasoning tasks. It includes innovative evaluation strategies that aim to provide more comprehensive and practical assessments of model capabilities.

2024
- Counting-Stars (★): A Simple, Efficient, and Reasonable Strategy for Evaluating Long-Context Large Language Models

Training

Discusses methods and strategies for the continuous training and improvement of LLMs. Topics include token efficiency, scalable training strategies, and data autonomy to enhance the learning process without extensive resources.

General
- 2024
  - RHO-1: Not All Tokens Are What You Need (04/2024)
  - ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy (03/2024)
  - Rephrasing the Web (from Apple) (01/2024)
- 2023
  - Agent LUMOS: Unified and Modular Training for Open-Source Language Agents (11/2023)
"Continuous" learning
- 2024
  - Simple and Scalable Strategies to Continually Pre-train Large Language Models
  - Larimar: Large Language Models with Episodic Memory Control

Tools/tool use

Explores how LLMs can be integrated with external tools or APIs to extend their functionality beyond pure text processing. This includes learning to interact with and manipulate external software tools or databases, thereby broadening the practical applications of LLMs.

2023
- Gorilla: Large Language Model Connected with Massive APIs (05/2023)
- GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution (07/2023)
- Tool Learning with Foundation Models (04/2023)

About

A collection of research papers (mostly), that I've personally found interesting.