Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a.

Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a Chip (HPPC 2009)

1) Introduction 2) What is StarPU ? 3) How to define and to build performance models ? 4) Build history-based performance models dynamically 5) Experimental validation 6) Conclusion

 Multi-core architectures featuring specialized accelerator ◦ Those are getting an increasing amount of attention. ◦ This success will probably influence the design of future High Performance Computing hardware.  Homogeneous multi-core system → Heterogeneous multi-core system  Static prediction → Dynamic prediction

 Auto-tuning performance prediction approach ◦ based on performance history tables dynamically built during the application run.

 A runtime system for task scheduling on heterogeneous multi-core architecture.  The design of StarPU is organized around three main components: ◦ An unified execution model. ◦ A data management library. ◦ A scheduling framework.

 Define performance model : ◦ We need to decide which parameters the model should depend on. ◦ Find relationship between these parameters.

 Build performance model : ◦ It is common to use specific pre-calibration program to build those model. ◦ It is however possible to design a model based on the amount of computations per task, and to calibrate the parameters by the means of a regression. ◦ StarPU can therefore automatically calibrate parametric models, either at runtime using linear regression models or offline in the case of non- linear models.

 Regression analysis will be create a model of dependent variable and independent variable.  In the model, we can be prediction value of dependent variable by independent variable.  General cases are linear regression and non- linear regression

 Measuring tasks' duration.  Identifying task kinds.  Feeding and looking up from the model.

 Each computational kernel is associated with a hash table per architecture.  Steps : 1.A task is submitted to StarPU 2.It computes its hash. 3.Consults the hash table corresponding to the proper kernel-architecture pair to retrieve the average execution time previously measured for this kind of task. 4.Update hash table, and save the new hash table to a file. (These performance models are persistent between different runs.)

 Environment : ◦ They have implemented these automatic model calibration mechanisms in StarPU. ◦ Multi-core CPU, GPU, Cell processor(SPU)

Performance feedback tools

 We have proposed a generic approach to seamlessly build history-based performance models.  It has been implemented within the StarPU runtime system with the support of its integrated data management library, and we have shown how StarPU's performance feedback tools help the programmer to analyze whether the resulting performance prediction are relevant or not.

Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a.

Similar presentations

Presentation on theme: "Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a.

Similar presentations

Presentation on theme: "Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a."— Presentation transcript:

Similar presentations

About project

Feedback