What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible?

Question

What's the largest PPO model size and context length that have been trained successfully with this library? Can you also share some performance metrics (i.e. GPU count, training time) if possible?

okuchaiev opened this issue 8 months ago · comments

Originally posted by @panyi121 in #70 (comment)