GPGPU-9 - General-Purpose GPU

Call for Papers

Scope

The goal of this workshop is to provide a forum to discuss new and emerging general-purpose purpose programming environments and platforms, as well as evaluate applications that have been able to harness the horsepower provided by these platforms. This year’s workshop is particularly interested in exploring new heterogeneous GPU platforms, new forms of concurrency, and novel/irregular applications that can leverage these platforms. Papers are being sought on many aspects of GPUs, including (but not limited to):

GPU applications
GPU programming environments
GPU runtime systems
GPU compilation
GPU architectures
Multi-GPU systems
GPU power/efficiency
GPU reliability
GPU benchmarking/measurements
Heterogeneous GPU platforms that incorporate GPUs

Important Dates

Papers due: December 1, 2015 (paper submission closed)
Notification: December 21, 2015
Final paper due: January 4, 2015

Submissions

All submissions must be made electronically through the EasyChair system. Full paper submissions must be in PDF formatted for US lettersize paper. They must not exceed 10 pages (all inclusive) in standard ACM two-column conference format (preprint mode, with page number). Templates for ACM format are available for Microsoft Word, and LaTeX here (use the 9 pt template). Authors can choose to reveal their identity (or not) in submitted papers. All accepted papers will be published in the ACM Online Conference Proceedings Series. For questions, contact David Kaeli kaeli@ece.neu.edu.

Travel Awards

US National Science Foundation (NSF) Support

US National Science Foundation (NSF) Support The US National Science Foundation has provided funding to support student attendance at PPoPP 2016. Applicants must be registered students at accredited US academic institutions. Successful applicants will be reimbursed for approved expenses, including travel, accommodation, and reasonable meal expenses. All reimbursements will require original receipts. Instructions on filing for reimbursement will be provided to successful applicants.

Application can be made at http://goo.gl/forms/cK0gOAIOHP.

Support from NSF is provided through Grant CCF-1552229. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

ACM SIGPLAN PAC Funds

The ACM SIGPLAN PAC Funds support travel and accommodation for Students who have a paper at the conference or associated workshops, see http://www.sigplan.org/PAC/ for additional details.

PPoPP Scholarship

For students who are not eligible for the NSF support or PAC funding; PPoPP has set up a scholarship. If your institution is unable to fund you for attending the conference; please fill this form: http://goo.gl/forms/cK0gOAIOHP Priority will be given to :

Students who submitted a paper to PPoPP
Students who submitted a paper at a workshop
Students who wish to attend the conference

We will require a letter of recommendation from your Advisor. Notifications will be sent early in March.

Time Zone

The program is currently displayed in (GMT) Belfast.

Use conference time zone: (GMT) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

You're viewing the program in a time zone which is different from your device's time zone change time zone

Sat 12 Mar
Displayed time zone: Belfast change

09:00 - 10:30	AlgorithmsGPGPU-9 at Mitre

09:00 45m Talk		Keynote: Runtime Aware Architectures GPGPU-9 Mateo Valero
09:45 20m Talk		GPU Centric Extensions for Parallel Strongly Connected Components Computation GPGPU-9 Shrinivas Devshatwar , Madhur Amilkanthwar , Rupesh Nasre IIT Madras, India
10:05 20m Talk		General-Purpose Join Algorithms for Large Graph Triangle Listing on Heterogeneous Systems GPGPU-9 Daniel Zinn , Haicheng Wu , Jin Wang , Molham Aref , Sudhakar Yalamanchili

11:00 - 12:30	Heterogenous Languages, Extensions and RuntimesGPGPU-9 at Mitre

11:00 20m Talk		Designing High Performance Communication Runtime for GPU Managed Memory: Early Experiences GPGPU-9 Dip Sankar Banerjee , Khaled Hamidouche , Dhabaleswar K. Panda
11:20 20m Talk		Multi-Stage Programming for GPUs in Modern C++ using PACXX GPGPU-9 Michael Haidl , Michel Steuwer , Tim Humernbrum , Sergei Gorlatch
11:40 20m Talk		Simplifying Programming and Load Balancing of Data Parallel Applications on Heterogeneous systems GPGPU-9 Borja Pérez , Jose Luis Bosque , Ramón Beivide

14:00 - 15:30	Tasking and SchedulingGPGPU-9 at Mitre

14:00 45m Talk		Keynote: Working Together to Build the Heterogeneous Processing Ecosystem GPGPU-9 Andrew Richards
14:45 20m Talk		Implementing Directed Acyclic Graphs with the Heterogeneous System Architecture GPGPU-9 Sooraj Puthoor , Ashwin Aji , Shuai Che , Mayank Daga , Wei Wu , Bradford M. Beckmann , Gregory Rodgers
15:05 20m Talk		GPUpIO: The Case for I/O-Driven Preemption on GPUs GPGPU-9 Lior Zeno , Avi Mendleson , Mark Silberstein

16:00 - 17:30	Matrix-based and Stencil OptimizationGPGPU-9 at Mitre

16:00 40m Talk		A Systems Perspective on GPU Computing: A Tribute to Karsten Schwan GPGPU-9 Naila Farooqui
16:40 20m Talk		Performance Portable GPU Code Generation for Matrix Multiplication GPGPU-9 Toomas Remmelg , Thibaut Lutz , Michel Steuwer , Christophe Dubach University of Edinburgh
17:00 20m Talk		Effective Resource Management for Enhancing Performance of 2D and 3D Stencils on GPUs GPGPU-9 Prashant Singh Rawat , Changwan Hong , Mahesh Ravishankar , Vinod Grover , Louis-Noël Pouchet Ohio State University, P. Sadayappan Ohio State University
17:20 10m Talk		Wrap up GPGPU-9

GPGPU '16- Proceedings of the 9th Annual Workshop on General Purpose Processing using Graphics Processing Unit

Full Citation in the ACM Digital Library

SESSION: Algorithm

Runtime aware architectures

Mateo Valero

GPU centric extensions for parallel strongly connected components computation

Shrinivas Devshatwar
Madhur Amilkanthwar
Rupesh Nasre

General-purpose join algorithms for large graph triangle listing on heterogeneous systems

Daniel Zinn
Haicheng Wu
Jin Wang
Molham Aref
Sudhakar Yalamanchili

SESSION: Heterogenous languages, extensions and runtimes

Performance portable GPU code generation for matrix multiplication

Toomas Remmelg
Thibaut Lutz
Michel Steuwer
Christophe Dubach

Multi-stage programming for GPUs in C++ using PACXX

Michael Haidl
Michel Steuwer
Tim Humernbrum
Sergei Gorlatch

Simplifying programming and load balancing of data parallel applications on heterogeneous systems

Borja Pérez
José Luis Bosque
Ramón Beivide

SESSION: Tasking and scheduling

Working together to build the heterogeneous processing ecosystem

Andrew Richards

Implementing directed acyclic graphs with the heterogeneous system architecture

Sooraj Puthoor
Ashwin M. Aji
Shuai Che
Mayank Daga
Wei Wu
Bradford M. Beckmann
Gregory Rodgers

GPUpIO: the case for I/O-driven preemption on GPUs

Lior Zeno
Avi Mendelson
Mark Silberstein

SESSION: Stencil optimization

A systems perspective on GPU computing: a tribute to Karsten Schwan

Naila Farooqui

Designing high performance communication runtime for GPU managed memory: early experiences

Dip Sankar Banerjee
Khaled Hamidouche
Dhabaleswar K. Panda

Effective resource management for enhancing performance of 2D and 3D stencils on GPUs

Prashant Singh Rawat
Changwan Hong
Mahesh Ravishankar
Vinod Grover
Louis-Noël Pouchet
P. Sadayappan

Keynote: Runtime Aware Architectures

Mateo Valero, Universitat Politécnica de Catalunya

Abstract

In the last years the traditional ways to keep the increase of hardware performance to the rate predicted by the Moore’s Law vanished. When uni-cores were the norm, hardware design was decoupled from the software stack thanks to a well defined Instruction Set Architecture (ISA). This simple interface allowed developing applications without worrying too much about the underlying hardware, while computer architects proposed techniques to aggressively exploit Instruction-Level Parallelism (ILP) in superscalar processors. Current multi-cores are designed as simple symmetric multiprocessors on a chip. While these designs are able to compensate the clock frequency stagnation, they face multiple problems in terms of power consumption, programmability, resilience or memory. The solution is to give more responsibility to the runtime system and to let it tightly collaborate with the hardware. The runtime has to drive the design of future multi-cores architectures. In this talk, we introduce an approach towards a Runtime-Aware Architecture (RAA), a massively parallel architecture designed from the runtime’s perspective.

Biography

Mateo Valero, http://www.bsc.es/cv-mateo/, obtained his Telecommunication Engineering Degree from the Technical University of Madrid (UPM) in 1974 and his Ph.D. in Telecommunications from the Technical University of Catalonia (UPC) in 1980. He is a professor in the Computer Architecture Department at UPC, in Barcelona. His research interests focuses on high performance architectures. He has published approximately 700 papers, has served in the organization of more than 300 International Conferences and he has given more than 400 invited talks. He is the director of the Barcelona Supercomputing Centre, the National Centre of Supercomputing in Spain.

Dr. Valero has been honoured with several awards. Among them, the Eckert-Mauchly Award 2007 by the IEEE and ACM; Seymour Cray Award 2015 by IEEE; Harry Goode Award 2009 by IEEE: ACM Distinguished Service Award 2012; Euro-Par Achievement Award 2015; the Spanish National Julio Rey Pastor award, in recognition of research in Mathematics; the Spanish National Award “Leonardo Torres Quevedo” that recognizes research in engineering; the “King Jaime I” in basic research given by Generalitat Valenciana; the Research Award by the Catalan Foundation for Research and Innovation and the “Aragón Award” 2008 given by the Government of Aragón. He has been named Honorary Doctor by the University of Chalmers, by the University of Belgrade, by the Universities of Las Palmas de Gran Canaria, Zaragoza, Complutense de Madrid, Cantabria and Granada in Spain and by the University of Veracruz in Mexico. “Hall of the Fame” member of the ICT European Program (selected as one of the 25 most influents European researchers in IT during the period 1983-2008. Lyon, November 2008)

In December 1994, Professor Valero became a founding member of the Royal Spanish Academy of Engineering. In 2005 he was elected Correspondant Academic of the Spanish Royal Academy of Science, in 2006 member of the Royal Spanish Academy of Doctors, in 2008 member of the Academia Europaea and in 2012 Correspondant Academic of the Mexican Academy of Sciences. He is a Fellow of the IEEE, Fellow of the ACM and an Intel Distinguished Research Fellow.

In 1998 he won a “Favourite Son” Award of his home town, Alfamén (Zaragoza) and in 2006, his native town of Alfamén named their Public College after him.

Keynote: Working Together to Build the Heterogeneous Processing Ecosystem

Andrew Richards, Codeplay Software Ltd,

Abstract

We can now say that almost all future performance improvements will come from heterogeneous acceleration. But the reality of building successful software and platforms is that no one company or individual can create everything. That means we need to provide standard platforms, interfaces, tools, languages and components that interoperate. Only through open standards can we all innovate in our own specialist areas. What are the challenges, opportunities and requirements of working together to allow software components, languages, tools and processor cores from different researchers and companies to play nice together?

Biography

As well as being CEO and Founder of Codeplay Software Ltd, Andrew is also the Chair of the Tools and System Runtime working groups of the HSA Foundation and the Chair of the SYCL™ for OpenCL™ Group of the Khronos Group. After graduating from Cambridge University with a degree in Computer Science and Physics, Andrew started his career in the 8-bit days writing videogames, before researching compiler technology and founding Codeplay in 2002. Codeplay have been producing compilers for games consoles, special-purpose processors and GPUs since then. Today Codeplay is a world-leading specialist in GPU compiler technology, working on high-end mobile graphics and research into future graphics and processing technologies.

General-Purpose GPUGPGPU-9

Call for Papers

Program Display Configuration

Sat 12 MarDisplayed time zone: Belfast change

Links to accepted papers

GPGPU '16- Proceedings of the 9th Annual Workshop on General Purpose Processing using Graphics Processing Unit

SESSION: Algorithm

SESSION: Heterogenous languages, extensions and runtimes

SESSION: Tasking and scheduling

SESSION: Stencil optimization

Keynotes

David Kaeli

Northeastern University

United States

John Cavazos

University of Delaware

Yifan Sun

Northeastern University

Tor Aamodt

UMBC

Jose Luis Abellan

UCAM

Chris Batten

Cornell University

Martin Burtscher

Texas State University

Neal Crago

NVIDIA

Lieven Eeckhout

U. of Ghent

Christian Fensch

Heriot-Watt University

United Kingdom

Björn Franke

University of Edinburgh

Germany

Xin Fu

U. of Houston

Isaac Gelado

NVIDIA

Michael Gerndt

TUM

Lee Howes

Facebook

Wei Chung Hsu

National Chiao Tung U.

Wen-mei Hwu

UIUC

Byunghyun Jang

U. of Mississippi

Daniel Jimenez

Texas A&M

Adwait Jog

William and Mary

United States

Ajay Joshi

Boston U.

John Kim

KAIST

Rainer Leupers

Aachen U.

James Lin

Shanghai Jiao Tong U.

Mikel Luján

University of Manchester

Simon McIntosh-Smith

University of Bristol, UK

United Kingdom

Avi Mendelson

Technion

Perhaad Mistry

AMD

Nacho Navarro † 2016

UPC

Oscar Plata

Department of Computer Architecture at the University of Malaga, Spain

Tim Rogers

Purdue U.

Norm Rubin

NVIDIA

Sat 12 Mar
Displayed time zone: Belfast change

Nacho Navarro^{† 2016}