huggingface / optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Home Page:https://huggingface.co/docs/optimum/main/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

advice for simple onnxruntime script for ORTModelForVision2Seq (or separate encoder/decoder)

eduardatmadenn opened this issue · comments

I am trying to use implement this class in C++ because unfortunately I didn't find any C++ implementation for this.

Therefore, my current approach is to revert this class and the auxiliary classes to a simple onnxruntime prediction, to make things easier to port to C++.

Does anyone have any advice in this matter? Thank you