Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/29287

TítuloA framework for efficient execution of data parallel irregular applications on heterogeneous systems
Autor(es)Ribeiro, Roberto
Barbosa, João
Santos, Luís Paulo
Palavras-chaveHeterogeneous systems
Irregular applications
Efficiency
Programming productivity
Data2015
EditoraWorld Scientific Publishing
RevistaParallel Processing Letters
CitaçãoRoberto Ribeiro, João Barbosa, and Luís Paulo Santos, Parallel Process. Lett. 25, 1550004
Resumo(s)Exploiting the computing power of the diversity of resources available on heterogeneous systems is mandatory but a very challenging task. The diversity of architectures, execution models and programming tools, together with disjoint address spaces and di erent computing capabilities, raise a number of challenges that severely impact on application performance and programming productivity. This problem is further compounded in the presence of data parallel irregular applications. This paper presents a framework that addresses development and execution of data parallel irregular applications in heterogeneous systems. A uni ed task-based programming and execution model is proposed, together with inter and intra-device scheduling, which, coupled with a data management system, aim to achieve performance scalability across multiple devices, while maintaining high programming productivity. Intradevice scheduling on wide SIMD/SIMT architectures resorts to consumer-producer kernels, which, by allowing dynamic generation and rescheduling of new work units, enable balancing irregular workloads and increase resource utilization. Results show that regular and irregular applications scale well with the number of devices, while requiring minimal programming e ort. Consumer-producer kernels are able to sustain signi cant performance gains as long as the workload per basic work unit is enough to compensate overheads associated with intra-device scheduling. This not being the case, consumer kernels can still be used for the irregular application. Comparisons with an alternative framework, StarPU, which targets regular workloads, consistently demonstrate signi cant speedups. This is, to the best of our knowledge, the rst published integrated approach that successfully handles irregular workloads over heterogeneous systems.
TipoArtigo
URIhttps://hdl.handle.net/1822/29287
DOI10.1142/S0129626415500048
ISSN0129-6264
1793-642X
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:DI/CCTC - Artigos (papers)

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
ppl2014.pdfDocumento principal511,45 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID