About project structure

Question

About project structure

zsnoob opened this issue 3 months ago · comments

Thank you for your work! Regarding the project structure, I would like to know the design purposes of the modeling_eagle and ea_model source files? It appears that both describe the structure of the original model and a single decoder layer. Perhaps modeling_eagle is specifically designed for inference using a custom model?

yuhuili · Answer 1 · Mon Apr 01 2024 22:51:11 GMT+0800 (China Standard Time)

Perhaps modeling_eagle is specifically designed for inference using a custom model?

Yes, modeling_eagle can be used to accelerate any model in the transformers library. To use modeling_eagle, you need to slightly modify the code (pre-allocated KV cache and tree mask, refer to modeling_llama_kv.py and modeling_Mixtral_kv.py for examples).