Adding falcon gpt model - howto?

Question

Adding falcon gpt model - howto?

NicoJuicy opened this issue a year ago · comments

Nico Sap commented a year ago

How would i start with adding the falcon-40b and falcon-7b model?

Am i correct to just copy+rename eg. GPT-NeoXT-Chat-Base-20B

And then:

Adjust the 'get_models.py' to the relevant hugginface
Adjust the 'models.toml' file ( or create from template)

Would that work?

Louis Hénault · Answer 1 · Sat Jun 03 2023 17:46:03 GMT+0800 (China Standard Time)

Hi @NicoJuicy, thanks for the question.

I'd rather start with the MPT examples, they're a bit more advanced (transformers models seemed to have memory leaks on GPU and it's addressed in these examples). Otherwise yes that's the way to go.

You might also want to edit how chat input is formatted during preprocessing for chat models, and check if the device_map fits your needs.

Nico Sap · Answer 2 · Sat Jun 10 2023 18:32:29 GMT+0800 (China Standard Time)

Thanks for the info. Some work has popped up and let's see if i can get back to this later on!

Nice work fyi!