MatrixMul

A task done for the unit "GPU Architecture and Programming (ENG722S2)". Implements tiled matrix multiplication in CUDA, through two methods.

Kernel 1: Matrix dimensions must be multiples of BLOCK_SIZE
Kernel 2: Matrix dimensions can be arbitrary (at the cost of a slight drop in performance)

cutil.h

This task was supposed to use cutil.h, however support for that has been dropped in CUDA 5.x. Included in the CUDA SDK is helper_functions.h, which is meant to replace the functionality of the deprecated cutil.h. As a result, cutilmk2.h was created which replicates some of the missing functions by calling helper_functions.h.

NVIDIA

As was part of the assignment, much of the original source was based upon code samples from NVIDIA. In particular:

matrixmul.cu
matrixmul.h
matrixmul_gold.cpp

Though matrixmul.cu was modified substantially to include the following functionality:

Timing metrics
Multiple kernel invocations
Kernel selection
Matrix generation parameters

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
README.md		README.md
cutilmk2.cu		cutilmk2.cu
cutilmk2.h		cutilmk2.h
matrixmul.cu		matrixmul.cu
matrixmul.h		matrixmul.h
matrixmul_gold.cpp		matrixmul_gold.cpp
matrixmul_kernel.cu		matrixmul_kernel.cu
matrixmul_kernel.h		matrixmul_kernel.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MatrixMul

cutil.h

NVIDIA

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MatrixMul

cutil.h

NVIDIA

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages