Preview

Advanced Engineering Research (Rostov-on-Don)

Advanced search

Study of access template to graphics engine GM effect on the performance

https://doi.org/10.12737/19687

Abstract

The work objective is to study the effect of the graphical processor unit computational cores load level and memory access pattern on the memory bus bandwidth and scaling acceleration. The research subject is the problem of scalability of the parallel computing performance and acceleration. The following hypothesis is checked: while processing images for multi-core shared-memory systems, Gustafson - Barsis’s law is more crucial than the memory access template at the underloading of the GPU cores. The research methodology is a computational experiment with further analysis of the obtained results. The conclusions are as follows. The suggested hypothesis is proved. For that, a series of experiments on various heterogeneous computational systems with OpenCL standard support is conducted. The application field of the results obtained includes the development of algorithms and software for the highly parallel computer systems. The memory access template starts to place certain restrictions on the algorithm efficiency only when the load level of the computational cores is sufficient. Video cards with the private memory show more stable results in comparison to those which share memory with the central processing unit.

About the Authors

RomanV. . Arzumanyan
Institute of Computer Technology and Information Security, Southern Federal University
Russian Federation


Alexander I. Sukhinov
Don State Technical University
Russian Federation


References

1. OpenCL Programming Guide for the CUDA Architecture. NVIDIA corporation. Santa Clara: NVIDIA, 200, 60 p.

2. AMD Accelerated Parallel Processing OpenCL Programming Guide. Advanced Micro Devices. Sunnyvale: ADM, 2013, 294 p.

3. Mali T600 Series OpenCL GPU Developer Guide. ARM. Available at:

4. http://infocenter.arm.com/help/topic/com.arm.doc.dui0538e/DUI0538E_mali_t600_opencl_dg.pdf (accessed: 16.04.16).

5. Sukhinov, А.I. Dvumernye skhemy rasshchepleniya i nekotorye ikh prilozheniya. [Two-dimensional splitting schemes and some of their applications.] Moscow: MAKS Press, 2005, 408 p. (in Russian).

6. Nikolayev, I.A., Sukhinov, А.I., Kharina, O.D. O rasparallelivanii treugol'nykh iteratsionnykh metodov na spetsializirovannoy mnogoprotsessornoy sisteme. [On parallel application of triangular iterative methods in a special-purpose multiprocessor system.] Avtomatika i Telemekhanika, 1986, iss. 5, pp. 135–142 (in Russian).

7. Sukhinov, А.I. Lokal'no-dvumernye skhemy dlya resheniya mnogomernykh parabolicheskikh uravneniy na vychislitel'nykh sistemakh matrichnogo tipa. [Locally two-dimensional schemes for solving multidimensional parabolic equations in computer systems of matrix type.] Izvestiya VUZ. Matematika, 1984, no. 11, pp. 45–53 (in Russian).

8. Padua, D., ed. Encyclopedia of Parallel Computing. New York: Springer, 2011, 2176 p.

9. Quinn, M.-J. Parallel Programming in C with MPI and OpenMP. New York: McGraw-Hill, 2003, 516 p.

10. AMD Graphic Core Next. Advanced Micro Devices. AMD Fusion Developer Summit 2013. Available at: http://developer.amd.com/wordpress/media/2013/06/2620_final.pdf (accessed 16.04.16).

11. Global Internet Phenomena Report. Sandvine. Available at: https://www.sandvine.com/trends/global-internetphenomena/ (accessed: 16.04.16).


Review

For citations:


Arzumanyan R., Sukhinov A.I. Study of access template to graphics engine GM effect on the performance. Vestnik of Don State Technical University. 2016;16(2):105-110. (In Russ.) https://doi.org/10.12737/19687

Views: 492


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 2687-1653 (Online)