What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible?
okuchaiev opened this issue · comments
Oleksii Kuchaiev commented
What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible?
Originally posted by @panyi121 in #70 (comment)