Monday, Sept. 7

From 8:40

8:40

Opening

9:30 – 10:30

Invited talks (2 slots x 40 min.)
Chairperson: Ewa Deelman
 The Future of Advanced Computing  Daniel Reed
 Parallel Processing for Large Scale Community Detection in Social and Bio Medical Networks  Boleslaw Szymanski

10:30 – 11:00

11:00 – 12:40

Contributed papers in parallel (4 slots x 25 min.)

Track A: WS on GPU Computing
Chairperson: Enrique S. QuintanaOrti
 Revisiting the GaussHuard Algorithm on Graphics Accelerators  P. Benner, P. Ezzatti, E. S. QuintanaOrti and A RemonGomez
 Increasing arithmetic intensity in multigrid methods on GPUs using block smoothers  M. Bolten and O. Letterer
 Optimized CUDAbased PDE Solver for Reaction Diffusion Systems on Arbitrary Surfaces  S. M. Descombes, D. S. Dhillon and M. Zwicker
 Comparing Different Programming Approaches for SpMVOperations on GPUs  J. P. Ecker, R. Berrendorf, J. Razzaq, S. E. Scholl and F. Mannuss

Track B: Main Track: Numerical Algorithms and Parallel Scientific Computing
Chairperson: Marc Baboulin
 A bucket sort algorithm for the particleincell method on manycore architectures  A. Jocksch, F. Hariri, T. M. Tran, S. Brunner, C. Gheller and L. Villard
 Experience on vectorizing Lattice Boltzmann kernels for multi and manycore architectures  E. Calore, N. Demo, S. F. Schifano and R. Tripiccione
 Performance analysis of the Kahanenhanced scalar product on current multicore processors  J. Hofmann, D. Fey, J. Eitzinger, G. Hager, G. Wellein and M. Riedmann
 Performance Analysis of the Chebyshev Basis Conjugate Gradient Method on the K Computer  Y. Kumagai, A. Fujii, T. Tanaka, Y. Hirota, T. Fukaya, T. Imamura and R. Suda

Track C: Main Track: Applications of Parallel Computing
Chairperson: Alex Druinsky
 Synthetic Signature Program for Performance Scalability  J. Panadero, A. Wong, D. Rexachs and E. Luque
 FEniCSHPC: Automated predictive highperformance finite element computing  J. Jansson, J. Hoffman and N. Jansson
 Accelerating NWChem Coupled Cluster through dataflowbased Execution  H. McCraw, A. Danalis, G. Bosilca and J. Dongarra
 Parallelization and Optimization of a CAD Model Processing Tool from the Automotive Industry to Distributed Memory Parallel Computers  L. F. Ayuso, J. J. Durillo, T. Fahringer, B. Kornberger and M. Schifko

Track D: Main Track: Nonnumerical Algorithms
Chairperson: Marian Bubak
 Running Time Prediction for Web Search Queries  O. Rojas, V. G. Costa and M. Marin
 Comparison of Large Graphs Using Distance Information  W. Czech, W. Mielczarek and W. Dzwinel
 Fast Incremental Community Detection on Dynamic Graphs  A. Zakrzewska and D. Bader
 A Diffusion Process for Graph Partitioning: its Solutions and their Refinement  A. Jocksch

Track E: Workshop on Models, Algorithms and Methodologies for Hybrid Parallelism in New HPC Systems
Chairpersons: Marco Lapegna and Giuliano Laccetti
 Virtualizing CUDA enabled GPGPUs on ARM clusters  R. Montella, G. Giunta, G. Laccetti, M. Lapegna, C. Palmieri, C. Ferraro and V. Pelliccia
 A Distributed Hash Table for Shared Memory  W. Oortwijn, T. van Dijk and J. van de Pol
 Mathematical Approach to the Performance Evaluation of Matrixmatrix Multiply Algorithm on a Two Level Parallel Architecture  L. D'Amore, V. Mele, G. Laccetti and A. Murli
 How to mitigate node failures in hybrid parallel applications  M. Szpindler

12:40 – 13:40

13:40 – 15:40

Invited talks (3 slots x 40 min. in parallel)
Track A:
Chairperson: Boleslaw Szymanski
 Parallel Data Analytics and the Role of Stratified Data Placement  Srinivasan Parthasarathy
 MassiveScale Graph Analytics  David A. Bader
 Algorithmic time, energy, and power tradeoffs in graph computations  Richard Vuduc
Track B:
Chairperson: Jeffrey Vetter
 DirectiveBased Parallel Programming in an Age of Diversity  Barbara Chapman
 Parallel Computing: from traditional execution time minimization to multiobjective optimization  Rizos Sakellariou
 Adding New Flavours to GEMM: ARM big.LITTLE Architectures and Fault Tolerance  Enrique S. Quintana Orti

15:40 – 16:10

16:10 – 17:50

Contributed papers in parallel (4 slots x 25 min.)

Track A: WS on Parallel Computational Biology
Chairperson: Roman Wyrzykowski
 Performance analysis of a parallel, multinode pipeline for DNA sequencing  D. Decap, J. Reumers, Ch. Herzeel, P. Costanza and J. Fostier
 Engineering the Computation of Minimal Absent Words  C. Barton, A. Heliou, L. Mouchard and S. Pissis
 Accelerating 3D Protein Structure Similarity Searching on Microsoft Azure Cloud with Local Replicas of Macromolecular Data  D. Mrozek and B. MalysiakMrozek
 A Finite Automata Approach for Largescale DNA analysis on Multicore Architectures  S. Memeti and S. Pllana

Track B: Main Track: Numerical Algorithms and Parallel Scientific Computing
Chairperson: Przemyslaw Stpiczynski
 Dense Symmetric Indefinite Factorization on GPU Accelerated Architectures  M. Baboulin, J. Dongarra, A. Remy, S. Tomov and I. Yamazaki
 A Parallel MultiThreaded Solver for Symmetric Positive Definite Bordered Band Linear Systems  P. Benner, P. Ezzatti, E. S. QuintanaOrti and A. RemonGomez
 Comparative Performance Analysis of Coarse Solvers for Algebraic Multigrid on Leading Multicore Architectures  A. Druinsky, P. Ghysels, S. Li, O. Marques, S. Williams, A. Barker, D. Kalchev and P. Vassilevski
 LU Preconditioning for Overdetermined Sparse Least Squares Problems  G. Howell and M. Baboulin

Track C: Main Track: Environments and Tools for Distributed/Cloud/Grid Computing
Chairperson: Krzysztof Zieliński
 Distributed Computing Instrastructure as a Tool for eScience  J. Kitowski, K. Wiatr, Ł. Dutka, M. Twardy, T. Szepieniec, M. Sterzel, R. Słota and R. Pająk
 A lightweight approach for deployment of scientific workflows in cloud infrastructures  B. Balis, M. Bubak, K. Figiela, M. Malawski and M. Pawlik
 Scalable Distributed TwoLayer Block Based Datastore  A. Krechowicz, S. Deniziak, M. Bedla, A. Chrobot and G. Łukawski
 Metadata Organization and Management for Globalization of Data Access with onedata  M. Wrzeszcz, T. Lichoń, R. Słota, K. Zemek, K. Trzepla, Ł. Opioła, D. Nikolow, L. Dutka, R. Slota and J. Kitowski

Track D: Main Track: Nonnumerical Algorithms
Chairperson: Franciszek Seredyński
 A Parallel Algorithm for LZW decompression, with GPU implementation  S. Funasaka, K. Nakano and Y. Ito
 A Parallel FDFM Approach for Breaking Weak RSA Keys using the FPGA  X. Zhou, K. Nakano and Y. Ito
 Parallel Induction of Nondeterministic Finite Automata  T. Jastrząb, Z. J. Czech and W. Wieczorek

18:20

