yang123vc / matrix-multiply-optimization

Used cache blocking, parallelizing, loop unrolling, register blocking, loop ordering, and SSE instructions to optimize the multiplication of large matrices to 55 gFLOPS

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

Used cache blocking, parallelizing, loop unrolling, register blocking, loop ordering, and SSE instructions to optimize the multiplication of large matrices to 55 gFLOPS


Languages

Language:C 100.0%