Coarse Grain Parallelization of Deep Neural Networks (PPoPP 2016 - Main conference)

Sat 12 - Wed 16 March 2016 Barcelona, Spain

Track

PPoPP 2016 Main conference

Time Zone

The program is currently displayed in (GMT) Belfast.

Use conference time zone: (GMT) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 14 Mar 2016 10:00 - 10:25 at Mallorca+Menorca - Applications Chair(s): Albert Cohen

Abstract

Deep neural networks (DNN) have recently achieved extraordinary results in domains like computer vision and speech recognition. An essential element for this success has been the introduction of high performance computing (HPC) techniques in the critical step of training the neural network. This paper describes the implementation and analysis of a network-agnostic and convergence-invariant coarse-grain parallelization of the DNN training algorithm. The coarse-grain parallelization is achieved through the exploitation of the batch-level parallelism. This strategy is independent from the support of specialized and optimized libraries. Therefore, the optimization is immediately available for accelerating the DNN training. The proposal is compatible with multi-GPU execution without altering the algorithm convergence rate. The parallelization has been implemented in Caffe, a state-of-the-art DNN framework. The paper describes the code transformations for the parallelization and we also identify the limiting performance factors of the approach. We show competitive performance results for two state-of-the-art computer vision datasets, MNIST and CIFAR-10. In particular, on a 16-core Xeon E5-2667v2 at 3.30GHz we observe speedups of 8× over the sequential execution, at similar performance levels of those obtained by the GPU optimized Caffe version in a NVIDIA K40 GPU.

Link to Publication

http://dl.acm.org/citation.cfm?id=2851158&CFID=752402131&CFTOKEN=70128886

DOI

https://doi.org/10.1145/2851141.2851158

Time Zone

The program is currently displayed in (GMT) Belfast.

Use conference time zone: (GMT) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 14 Mar
Displayed time zone: Belfast change

10:00 - 11:15	ApplicationsMain conference at Mallorca+Menorca Chair(s): Albert Cohen INRIA

10:00 25m Talk		Coarse Grain Parallelization of Deep Neural Networks Main conference Marc Gonzalez Tallada UPC Link to publication DOI
10:25 25m Talk		High Performance Model Based Image Reconstruction Main conference Xiao Wang Purdue University, USA, Amit Sabne School of Electrical and Computer Engineering, Purdue University, Sherman Kisner High Performance Imaging LLC, Anand Raghunathan School of Electrical and Computer Engineering, Purdue University, Charles Bouman School of Electrical and Computer Engineering, Purdue University, Samuel Midkiff School of Electrical and Computer Engineering, Purdue University Link to publication DOI
10:50 25m Talk		Exploiting Accelerators for Efficient High Dimensional Similarity Search Main conference Sandeep Agrawal Oracle Labs, Christopher Michael Dee , Alvin R. Lebeck Duke University Link to publication DOI

Coarse Grain Parallelization of Deep Neural Networks

Mon 14 Mar
Displayed time zone: Belfast change

Marc Gonzalez Tallada

UPC

Tracks

Workshops

Coarse Grain Parallelization of Deep Neural Networks

Program Display Configuration

Program Display Configuration

Mon 14 MarDisplayed time zone: Belfast change

Marc Gonzalez Tallada

UPC

Mon 14 Mar
Displayed time zone: Belfast change