tcbegley / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[nanoChatGPT] Don't shuffle val set?

tcbegley opened this issue · comments

See here

This is probably actually fine, because we don't perform a full pass over the validation set when calculating validation metrics, we sample instead, so we likely do want some randomness so as to not always validate on the same subset of the validation data.