iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

Home Page:http://iree.dev/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[EPIC][CPU] Enable predictable performance on mixed-types GEMM using data-tiling

hanhanW opened this issue · comments

This EPIC tracks all the related work.

Tasks related to core functionality

Tasks

Tasks related to performance improvements

llama2 specific tasks

Tasks

@Max191 I think you have some local patches and ideas that are required for mixed-types data-tiling work, could you add them to tasklist accordingly?

@bjacob please help update this if there are on-going/TODO tasks in your mind.

@MaheshRavishankar I created an epic to help us understand better what needs to be done for mixed-types data-tiling, and the work we've been working on.

Thank you all for all the awesome work!

For small tasks, adding a brief description to tasklist is good enough. For large tasks, it would be good if you can create an issue/epic. It's not necessary to do it now, but please help add a brief description. Thank you!