When compiling and running you must NOT use gollum itself but one of the nodes node1 to node8. Present the results as a table of run time and spedup (wrt OpenMP on 1 thread). Speedup is defined as T1/Tp where T1 is the time on 1 thread and Tp is the time on p threads. Comment on the results.
The preferred way to time the codes is to use omp_set_num_threads(number) to set the threads and omp_get_wtime() to time the relevant part e.g.
omp_set_num_threads(s);
start = omp_get_wtime();
#pragma omp parallel ...
PUT PARALLEL CODE HERE
end = omp_get_wtime();
printf("time=:%.16g number of threads: %d\n",end - start, s);
Implement and check the time to do a matrix-matrix product of a 1500x900 matrix A with a 900x1200 matrix B of doubles using 1,2,3,4,5,6, 7 and 8 threads on a gollum node.
Define Aij = (i+1)*(j+1) and Bij = 1/((double) (i+1)* (double)(j+1)).
The result matrix C=A*B should be Cij= 900*(double)(i+1)/(double) (j+1) . You should check that the result is correct in each case by comparing A*B with a matrix C with these values.
Modify the code to run similar tests using static with default chuck size, dynamic with chunk size 1 and guided scheduling with chunk size 10.
You may turn in either one program that does all 3 types of scheduling or three seperate programs.
List the various run times and speedup in seperate tables for each type of scheduling and comment on the speedup of each scheduling type and draw conclusions as to which is preferable (and in what circumstances).
Note that to implement this program you will need to increase the default stack size for both the original thread and the OMP threads. If you do not you will get a segmentation error. To do this you need to execute shell commands similar to:
ulimit -s unlimited
export OMP_STACKSIZE=16M
Hello,
expert in parallel programming using C++ and OpenMP here. I can implement a matrix matrix OpenMP parallel algorithm for you as per the attached specifications.
The code will be 100% original, no part of the code will be copied from Internet.
Thanks and let me know if you are interested to work with me,
Paul
Have been working with Linux Kernel Programming since more than 2 years. Have deep insights of C/C++ programming.
Have experience of developing device drivers for USB 3 devices on Xilinx Zynq Embedded Platform
I can complete your project within your given deadline.
I can
* have the source code ready in a few days.
* also give you a makefile so that you can compile the code easily.
* also help you get the code compiling on gollum (or whatever your cluster is called).
I've worked with OpenMP and OpenMPI for my masters thesis. So i'm familiar with what is being asked here.