matrixvecmult executes kernels MatrixVectorMul1n which compute c = A * b where A is stored in row major form and c and b are vectors Using device GeForce GT 330M CPU took 0.01191 s Testing MatrixVectorMul1 WorkGroupSize = 64 GlobalSize 100032 Finished kernel execution Average kernel execution time 0.126011 Found 0 different entries Testing MatrixVectorMul2 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0410887 Found 0 different entries Testing MatrixVectorMul3 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0448893 Found 84943 different entries 0 entries more than 0.001% Testing MatrixVectorMul4 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0313644 Found 81490 different entries 0 entries more than 0.001% Testing MatrixVectorMul5 WorkGroupSize = 64 GlobalSize 3840 Finished kernel execution Average kernel execution time 0.0252405 Found 81430 different entries 0 entries more than 0.001%