jwkangmarco / LLM-narratives

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[스터디] 거꾸로 읽는 SSL 시즌4: Large Language Models and Alignment Learning

  • 신청 Google form (신청 마감)
  • 거꾸로 읽는 SSL 이번에는 LLM and AL 분야로 넘어 왔습니다! :)
  • 2022년도 이후로 가파르게 발전하고 있는 LLM 논문에 집중하여 의미가 있었던 논문을 살펴봅니다.
  • 해당 논문에서 제시하는 메소드의 특징 그리고 역사적으로 평가되는 이유에 대해서 즐겁게 토론하는 시간을 가집니다.

기간 (예정)

  • 2024 1/27 ~ 6/15 (예정)

발표 논문 및 순서

1 주차 overview overview (참고 논문)     강재욱
2 주차 pretraining Training Compute-Optimal Large Language Models DeepMind 2022 Mar 김택민
3 주차 pretraining Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning MIT 2022 Oct 이인규
4 주차 alignment learning Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback antropic 2022 Apr 조원익
4 주차 alignment learning Constitutional AI: Harmlessness from AI Feedback antropic 2022 Dec 조원익
5 주차 alignment learning Learning to summarize from human feedback Open AI 2022 Feb 김선호
6 주차 alignment learning InstructGPT: Training language models to follow instructions with human feedback Open AI 2022 Mar 이영수
7 주차 SFT Scaling Instruction-Finetuned Language Models Google 2022 Dec 이승준
8 주차 Reward Model Scaling Laws for Reward Model Over optimization Open AI 2022 October 김기범
9 주차 Reward Model Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Univ Washainton Et al 2023 Oct 김강민
10 주차 non RL approach Training Language Models with Language Feedback at Scale Univ of NY et al 2022 November 이성윤
11 주차 non RL approach DPO: Direct Preference Optimization: Your Language Model is Secretly a Reward Model Standford Univ 2023 May 이승현
12 주차 sLLM Llama 2: Open Foundation and Fine-Tuned Chat Models / Mistral 7B Meta / Mistral 2023 Jul 김보섭
13 주차 sLLM / layer extension SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling upstage 2023 Dec 조진욱
14 주차 LLM evaluation Self-critiquing models for assisting human evaluators Open AI 2022 June 김범준
15 주차 LLM evaluation Judging LLM-as-a-judge with MT-Bench and Chatbot Arena UC Berkeley et al 2023 Oct 오영화
16 주차 augmented LM Chain-of-Thought Prompting Elicits Reasoning in Large Language Models Google 2022 Jan 조성국
17 주차 augmented LM WebGPT: Browser-assisted question-answering with human feedback Open AI 2022 Jun 조용래
18 주차 augmented LM In-context retrieval-augmented language models AI21 Labs 2023 January 김재희
19 주차 augmented LM SAIL: Search-Augmented Instruction Learning MIT 2023 June 우태강
20 주차 wrap-up 전체 흐름 재정리     강재욱

관련 링크

About