support for older models

Question

support for older models

qwenzo opened this issue 2 months ago · comments

Mohamed Hesham Abdalla commented 2 months ago

Hello,

I was wondering if it is straightforward to bring older models such as GPT-2 to lit-gpt.
If so, what files/configs do I need to change?

Thank you!

Sebastian Raschka · Answer 1 · Tue Mar 19 2024 01:03:53 GMT+0800 (China Standard Time)

Good point, and it should be. I use GPT-2 myself privately a lot as well, and it'd be nice to have it in LitGPT as well.

I think the architecture is similar to GPTNeo, so you can probably copy and adapt the GPTNeo config. The general todo list I use for adding new configs is: