PaLM-rlhf-pytorch Roadmap

Question

PaLM-rlhf-pytorch Roadmap

HappyPony opened this issue 2 years ago · comments

Hi,

Unfortunately, comments in this thread An Open-Source Version of ChatGPT is Coming sound too technical to my ears. I would like to have a summary, a roadmap on who, what, with which means is starting or wants to start now. So that I would have an idea in the role as a user, when can I get involved to - just like I am currently doing for ChatGPT OpenAI, train the OpenSource language model.

Phil Wang · Answer 1 · Fri Jan 20 2023 00:56:36 GMT+0800 (China Standard Time)

@HappyPony if you aren't doing a phd, the only way to participate is from the data angle. there is also potentially room to contribute in building the application for collecting human feedback to train the rewards model, but right now it is uncertain if this approach will be usurped by something like RL"AI"F, as Anthropic is promoting

i would suggest joining Laion and just helping out with Yannic Kilcher's similar efforts

Phil Wang · Answer 2 · Fri Jan 20 2023 01:07:57 GMT+0800 (China Standard Time)

@HappyPony if you truly want to understand what is going on beneath the surface, without getting a graduate degree, i highly recommend starting with fast.ai, before working your way into transformers and reinforcement learning

HappyPony · Answer 3 · Fri Jan 20 2023 01:16:20 GMT+0800 (China Standard Time)

thank you for the feedback @lucidrains. For understanding - I have no ambition to contribute greatly in the development of the models or algorithms. Although I have a university degree in physics and programming experience. But I am interested to contribute as a tester. And I'd like to have an idea of the timescales involved until there is something to test ;-)

Phil Wang · Answer 4 · Fri Jan 20 2023 01:28:26 GMT+0800 (China Standard Time)

@HappyPony yea, i would say, go mingle with the people doing the real work and see what they need

mainly Laion and CarperAI at this point