yang123vc / matrix-multiply-optimization

Used cache blocking, parallelizing, loop unrolling, register blocking, loop ordering, and SSE instructions to optimize the multiplication of large matrices to 55 gFLOPS

Geek Repo

Github PK Tool

This repository is not active

About

Used cache blocking, parallelizing, loop unrolling, register blocking, loop ordering, and SSE instructions to optimize the multiplication of large matrices to 55 gFLOPS

Languages

Language:C 100.0%