Compilers, Hands-Off My Hands-On Optimizations (WPMVP 2016 - - 3rd Workshop on Programming Models for SIMD/Vector Processing)

Sat 12 - Wed 16 March 2016 Barcelona, Spain

Track

WPMVP 2016

Time Zone

The program is currently displayed in (GMT) Belfast.

Use conference time zone: (GMT) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Sun 13 Mar 2016 12:00 - 12:30 at Mallorca - Session 2 - Programming Models Chair(s): Joel Falcou

Abstract

Achieving high performance for compute bounded numerical kernels typically requires an expert to hand select an appropriate set of Single-instruction multiple-date (SIMD) instructions, then statically scheduling them in order to hide their latency while avoiding register spilling in the process. Unfortunately, this necessary amount of control of the code forces the expert to trade programming abstraction for performance which is why many performance critical kernels are written in assembly language. An alternative is to either resort to auto-vectorization (see Figure 1) or to use intrinsic functions, both features offered by compilers. However, in both scenarios the expert loses control over which instructions are selected, which optimizations are applied to the code and moreover how the instructions are scheduled for a target architecture. Ideally, the expert would need assembly-like control over their SIMD instructions beyond what intrinsics provide while maintaining a C-level abstraction for the non-performance critical parts. In this paper, we bridge the gap between performance an ab- straction for SIMD instructions through the use of custom macro intrinsics that provide the programmer control over the instruction selection, and scheduling, while leveraging the compiler to manage the registers. This provides the best of both assembly and vector intrinsics programming so that a programmer can obtain high performance implementations within the C programming language.

Time Zone

The program is currently displayed in (GMT) Belfast.

Use conference time zone: (GMT) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Sun 13 Mar
Displayed time zone: Belfast change

11:00 - 12:30	Session 2 - Programming ModelsWPMVP at Mallorca Chair(s): Joel Falcou LRI, Université Paris-Sud

11:00 30m Talk		Support for Data Parallelism in the CAL Actor Language WPMVP Essayas Gebrewahid Halmstad University
11:30 30m Talk		An Evaluation of Current SIMD Programming Models for C++ WPMVP Angela Pohl TU Berlin
12:00 30m Talk		Compilers, Hands-Off My Hands-On Optimizations WPMVP Richard Veras Carnegie Mellon University

Compilers, Hands-Off My Hands-On Optimizations

Sun 13 Mar
Displayed time zone: Belfast change

Richard Veras

Carnegie Mellon University

Tracks

Workshops

Compilers, Hands-Off My Hands-On Optimizations

Program Display Configuration

Program Display Configuration

Sun 13 MarDisplayed time zone: Belfast change

Richard Veras

Carnegie Mellon University

Sun 13 Mar
Displayed time zone: Belfast change