Skip to content
This repository was archived by the owner on Jan 26, 2023. It is now read-only.
/ optimize-gemm Public archive

Optimize single-threaded general matrix multplication

License

Notifications You must be signed in to change notification settings

axmat/optimize-gemm

Repository files navigation

Optimize single-threaded General Matrix Multiplication (GEMM) fo two square matrices

Transpose the second matrix

Blocking

Tiling

Vectorization with std::experimental::simd

Automatic vectorization on Clang

OpenBLAS

Build

mkdir build
cd build
cmake .. -DCMAKE_CXX_COMPILER=clang++
cmake --build . --

Run the benchmarks

export $OMP_NUM_THREADS=1
./bench-gemm [dim_size]

About

Optimize single-threaded general matrix multplication

Topics

Resources

License

Stars

Watchers

Forks