Benchmarking Phi-3 on CPU

Question

Benchmarking Phi-3 on CPU

bkaruman opened this issue 2 months ago · comments

Bhargavi Karumanchi commented 2 months ago

Hi,

Is there a tool to benchmark Phi-3 model performance on CPU?

Nat Kershaw (MSFT) · Answer 1 · Thu May 23 2024 05:12:19 GMT+0800 (China Standard Time)

Yes, we have a benchmarking tool here: https://github.com/microsoft/onnxruntime-genai/blob/main/benchmark/python/benchmark_e2e.py

Bhargavi Karumanchi · Answer 2 · Thu May 23 2024 05:44:27 GMT+0800 (China Standard Time)

@natke

This seems to be failing for Phi-3 model from HF - https://huggingface.co/microsoft/Phi-3-mini-128k-instruct-onnx/tree/main/cpu_and_mobile/cpu-int4-rtn-block-32

Baiju Meswani · Answer 3 · Thu May 23 2024 05:48:32 GMT+0800 (China Standard Time)

I think the script needs to be updated.

Could you change the line 21 in benchmark_phi3.py to

params.set_search_options(**{...})

Nat Kershaw (MSFT) · Answer 4 · Thu May 23 2024 08:01:38 GMT+0800 (China Standard Time)

@aciddelgado Can you please take a look?

Bhargavi Karumanchi · Answer 5 · Thu May 23 2024 08:58:39 GMT+0800 (China Standard Time)

I think the script needs to be updated.

Could you change the line 21 in benchmark_phi3.py to

params.set_search_options(**{...})

That doesn't work either

aciddelgado · Answer 6 · Fri May 24 2024 03:27:45 GMT+0800 (China Standard Time)

Are you using genai installed from pip or building from main? It looks like you are using an outdated version of the benchmarking script as I was able to run the phi-3 mini model from hf with the latest version of the benchmarking script found in the main branch of this repo. Also, did you make any further changes to benchmark_e2e.py to create your benchmark_phi3.py script?

Bhargavi Karumanchi · Answer 7 · Fri May 24 2024 09:18:53 GMT+0800 (China Standard Time)

Are you using genai installed from pip or building from main? It looks like you are using an outdated version of the benchmarking script as I was able to run the phi-3 mini model from hf with the latest version of the benchmarking script found in the main branch of this repo. Also, did you make any further changes to benchmark_e2e.py to create your benchmark_phi3.py script?

Moving to the latest commit worked. benchmark_phi3.py is same as the benchmarking script from the repo.

Nat Kershaw (MSFT) · Answer 8 · Sat May 25 2024 07:43:15 GMT+0800 (China Standard Time)

Great! Let us know how you get on.