Amortizing intractable inference in large language models

This repository contains code for GFlowNet fine-tuning of language models, as described in the paper

Amortizing intractable inference in large language models
Edward J. Hu*, Moksh Jain*, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin
Paper: https://arxiv.org/abs/2310.04363

BibTeX

@article{hu2023amortizing,
  title={Amortizing intractable inference in large language models},
  author={Hu, Edward J. and Jain, Moksh and Elmoznino, Eric and Kaddar, Younesse and Lajoie, Guillaume and Bengio, Yoshua and Malkin, Nikolay},
  year={2023},
  journal={arXiv preprint 2310.04363}
}

Visit the subdirectories to find code and documentation for each experiment in the paper:

Random number generation (§2): rng
Sentence continuation (§4.1): next_sentence
Story infilling (§4.2): infill_subj_arithmetic
Subjectivity classification (§4.3): infill_subj_arithmetic
Arithmetic with tool use (§4.4): infill_subj_arithmetic

Please contact us or post an issue if you have any questions.

GFNOrg / gfn-lm-tuning

Amortizing intractable inference in large language models

About

Languages