How many decoder layer needs to add in EAGLE?

Question

How many decoder layer needs to add in EAGLE?

yjdy opened this issue 4 months ago · comments

Thanks for the great work. I find a different between code and paper

In the paper, it say eagle trains a decoder layer.
In the code, cnet.py class Model (https://github.com/SafeAILab/EAGLE/blob/main/model/cnets.py#L491)

self.layers = nn.ModuleList([LlamaDecoderLayer(config, index) for index in range(config.num_hidden_layers)])

It needs to add a decoder layer for every hidden layer.

Please tell me which one is correct.

Best regards

Chengyuan Li · Answer 1 · Thu Feb 29 2024 18:39:48 GMT+0800 (China Standard Time)

This config is not the config file in your llm directory, check the config.json in your ea_model_path, config.num_hidden_layers should be 1.

yjdy · Answer 2 · Fri Mar 01 2024 08:58:04 GMT+0800 (China Standard Time)

Thanks for the response.
If I need to train eagle on different LLM, I think I should prepare different config file.
I just need to copy the config from the LLM I want to train on and change the num_hidden_layers to 1, right?

Chengyuan Li · Answer 3 · Fri Mar 01 2024 11:17:08 GMT+0800 (China Standard Time)

Not 100% sure, you better check the code in train/main.py

yuhuili · Answer 4 · Mon Mar 04 2024 00:17:20 GMT+0800 (China Standard Time)

If I need to train eagle on different LLM, I think I should prepare different config file. I just need to copy the config from the LLM I want to train on and change the num_hidden_layers to 1, right?

You are right.

yuhuili · Answer 5 · Mon Mar 04 2024 00:23:04 GMT+0800 (China Standard Time)

I find a different between code and paper.

Thank you for your interest. As @cyLi-Tiger mentioned, the configuration of EAGLE is different from the base LLM.