how to finetune with gemma model？

Question

how to finetune with gemma model？

runningabcd opened this issue 4 months ago · comments

runningabcd commented 4 months ago

runningabcd commented 4 months ago

help

runningabcd · Answer 1 · Thu Feb 22 2024 21:36:05 GMT+0800 (China Standard Time)

I had download gemma-7b-it model from hugging face already, but not find the script that can finetune wiht my own data

runningabcd · Answer 2 · Thu Feb 22 2024 21:40:11 GMT+0800 (China Standard Time)

I had download gemma-7b-it model from hugging face already, but not find the script that can finetune wiht my own data

How to sft with gemma？Can you tell me the sft data format?

runningabcd · Answer 3 · Thu Feb 22 2024 21:40:55 GMT+0800 (China Standard Time)

I had download gemma-7b-it model from hugging face already, but not find the script that can finetune wiht my own data

How to sft with gemma？Can you tell me the sft data format?

@pengchongjin

Pengchong Jin · Answer 4 · Fri Feb 23 2024 01:14:04 GMT+0800 (China Standard Time)

Hi there, unfortunately, this repo doesn't provide finetuning features.

Here are a few alternatives that might fit your needs:

On Gemma model card in Vertex Model Garden, there are a few notebooks which demonstrate how to do finetuning and then deploy to Vertex endpoints.
On Gemma model card in Kaggle, there are a few notebooks which uses KerasNLP to do finetuning.
HuggingFace demonstrates how to use TRL to do finetuning in this blog post.

Hope it helps.

Roberto · Answer 5 · Sun Feb 25 2024 03:57:52 GMT+0800 (China Standard Time)

@pengchongjin Is it possible to implement a class for fine tuning the model inside this repo similar to what done with llama-recipes?

Sriram Ramanathan · Answer 6 · Tue Mar 19 2024 00:32:49 GMT+0800 (China Standard Time)

Are there any tutorials for fine-tuning 7b-it-quant model ?

Selam Waktola · Answer 7 · Thu Apr 25 2024 05:58:01 GMT+0800 (China Standard Time)

Hi @aliasneo1

There are a few tutorials that demonstrate fine-tuning the gemma-2b model. You can follow similar procedures to fine-tune the Gemma variant gemma-7b-it.

Here are some resources:

Fine-tuning Gemma using JAX and Flax.
Fine-Tuning Gemma Models in Hugging Face with PyTorch on GPU and TPU.