salesforce / ctrl

Conditional Transformer Language Model for Controllable Generation

Home Page:https://arxiv.org/abs/1909.05858

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

benchmarking with GPT-2

leejason opened this issue · comments

Any suggestion for benchmarking CTRL with GPT-2? Say, loss value, PPL, or any metric to measure text generation quality?

Not a direct answer to your question, but this (timely) article by @chiphuyen is really good

https://thegradient.pub/understanding-evaluation-metrics-for-language-models/

very helpful & thanks