Dumb matmul bench

The only purpose is showing that matmul in Anaconda is done using Intel MKL library and performance is exactly the same between C++ and Python code calling into it. All code besides bench.sh is written by qwen3-coder-480b.

On my machine results are these:

> ./bench.sh
=== Building
g++ -O3 -m64 -march=native -std=c++17 matrix_mult.cpp -I/opt/intel/oneapi/mkl/latest/include -L/opt/intel/oneapi/mkl/latest/lib/intel64 -lmkl_rt -lpthread -lm -ldl  -o matrix_mult
=== C++
MKL version: 2023.0
Matrix size: 16384x16384
Time taken: 6.15184 seconds
Performance: 1429.83 GFLOPS
=== Python
Blas implementation: mkl-sdl
Generating 16384x16384 matrices...
Multiplying 16384x16384 matrices...
Matrix size: 16384x16384
Time taken: 6.349443 seconds
Performance: 1385.33 GFLOPS

To build install MKL using your system package manager. I used python from recent Anaconda download.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Makefile		Makefile
README.md		README.md
bench.sh		bench.sh
matrix_mult.cpp		matrix_mult.cpp
matrix_mult.py		matrix_mult.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dumb matmul bench

About

Uh oh!

Releases

Packages

Languages

lazy/dumb-matmul-bench

Folders and files

Latest commit

History

Repository files navigation

Dumb matmul bench

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages