ashvardanian / AssemblyStats

A research project highlighting the rarity of SIMD instructions in modern software

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyAssemblyStats

PyAssemblyStats is Python project, that analyzes the binaries found in the default system paths. The aim is to understand, how often do our favorite tools use SIMD instructios, that make up a huge part of modern CPUs. The results are astonishing and show, that often less than 1% of generated lines even reference vector registers.

The full report is available in this article in the Unum blog.

Usage

  • main.py generates the main.json with stats about differnt binaries.
  • topic_modeling.ipynb shows, how the clusters of SIMD instruction can be recognized using Latend Dirichlet Allocation.

Clusters

TODO

About

A research project highlighting the rarity of SIMD instructions in modern software


Languages

Language:Jupyter Notebook 92.8%Language:Python 7.2%