A1: High Performance Numerical Solving - 1
-
Kuniyoshi ABE, Seiji FUJINO
Application of Eisenstat-SSOR Preconditioner to Realistic Stress Analysis Problem by Parallel Cache-Cache Computing (Download Abstract) -
Susumu YAMADA, Toshiyuki IMAMURA, Masahiko MACHIDA
Communication avoiding Neumann expansion preconditioner for LOBPCG method: Convergence property of exact diagonalization method for Hubbard model (Download Abstract)
B1: Parallel Systems for Physics and Simulations - 1
-
William Robert SAUNDERS, Eike Hermann M†LLER, James GRANT
Long range forces in a performance portable Molecular Dynamics framework (Download Abstract) -
Marco MEONI, Raffaele PEREGO, Nicola TONELLOTTO
Popularity-based caching of CMS datasets (Download Abstract)
C1-REPARA: Third International Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms - 1
-
Rafael AsenjoJose Nunez-Yanez, Mohammad Hosseinabady, Angeles Navarro, Andres Rodriguez, Ruben Gran and Dario Suarez
Simultaneous Multiprocessing on a FPGA+CPU Heterogeneous System-On-Chip (Download Abstract) -
Leonardo Gazzarri
A tool to support FastFlow program design (Download Abstract)
D1-ParaFPGA: Keynote - Christian Pilato
-
Pilato CHRISTIAN
Bridging the Gap between Software and Hardware Designers (Download Abstract)
A2: High Performance Numerical Solving - 2
-
Iain BETHUNE, Andreas GLOESS, Juerg HUTTER, Alfio LAZZARO, Hans PABST, Fiona REID
Porting of the DBCSR library for Sparse Matrix-Matrix Multiplications to Intel Xeon Phi systems (Download Abstract) -
Valeria MELE, Emil COSTANTINESCU, Luisa CARRACCIUOLO, Luisa D'AMORE
Performance Prediction of a Parallel-in-Time Solver based on MGRIT. (Download Abstract)
B2: Parallel Systems for Physics and Simulations - 2
-
Alessandro COLOMBO, Liberato DE CARO, Davide Emilio GALLI
Memetic Phase Retrieval and HPC for the Imaging of Matter at Atomic Resolution (Download Abstract) -
Ferdinando AURICCHIO, Marco FEDELE, Marco FERRETTI, Adrien LEFIEUX, Rodrigo ROMAROWSKI, Luigi SANTANGELO, Alessandro VENEZIANI
Benchmarking a hemodynamics application on Intel based HPC systems: preliminary results (Download Abstract)
C2-REPARA: Third International Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms - 2
-
Dalvan Griebler
Higher-Level Parallelism Abstractions for Video Applications with SPar (Download Abstract) -
Rafael AsenjoAlejandro Villegas, Angeles Navarro and Oscar Plata
Towards a Software Transactional Memory for heterogeneous CPU-GPU processors (Download Abstract)
D2-ParaFPGA: High Level Synthesis techniques and applications
-
Mohammad HOSSEINABADY, Jose NUNEZ-YANEZ
Pipelined Streaming Computation of Histogram in FPGA OpenCL (Download Abstract) -
Nuno PAULINO, Lu’s REIS, Jo‹o M.P. CARDOSO
On Coding Techniques for Targeting FPGAs via OpenCL (Download Abstract) -
Ying Hao XU LIN, Miquel VIDAL, Be–at AREJITA, Javier DIAZ, Carlos ALVAREZ, Daniel JIMƒNEZ GONZçLEZ, Xavier MARTORELL BOFILL, Filippo MANTOVANI
Implementation of the K-means algorithm on heterogeneous devices: a use case based on an industrial dataset (Download Abstract)
A3: High Performance Numerical Solving - 3
-
Siegfried COOLS, Jeffrey CORNELIS, Wim VANROOSE
On parallel performance and numerical stability of the pipelined Conjugate Gradient and BiCGStab algorithms (Download Abstract) -
Ambra ABDULLAHI HASSAN, Valeria CARDELLINI, Salvatore FILIPPONE
Solving Sparse Linear Systems of Equations using CAF (Download Abstract) -
Toshiyuki IMAMURA, Daichi MUKUNOKI, Yusuke HIROTA, Susumu YAMADA, Masahiko MACHIDA
Design Towards Modern High Performance LA Library Enabling Heterogeneity and Flexible Data Formats (Download Abstract) -
Luca BERGAMASCHI, Angeles MARTINEZ
Spectral acceleration of parallel iterative eigensolvers for large scale scientific computing (Download Abstract) -
Alejandro LAMAS DAVI„A, Xavier CARTOIXË, Jose E. ROMAN
Scalable block-tridiagonal eigensolvers in the context of electronic structure calculations (Download Abstract) -
Sirine MARRAKCHI, Mohamed JEMNI
Solving Sparse Triangular Systems on a Multicore Machine (Download Abstract)
B3: Parallel Systems for Physics and Simulations - 3
-
Michael OBERSTEINER, Alfredo PARRA HINOJOSA, Heene MARIO, Hans-Joachim BUNGARTZ, Dirk PFL†GER
A Highly-Scalable, Algorithm-Based Fault-Tolerant Solver for Gyrokinetic Plasma Simulations (Download Abstract) -
Olga OLKHOVSKAYA, Vladimir GASILOV, Mikhail YAKOBOVSKIY, Alexey KOTELNIKOV
Parallel ray tracing algorithm for numerical analysis in radiative media physics (Download Abstract) -
Giuseppe CIACCIO, Valerio CALVELLI, Fabio DI BENEDETTO
A Parallel Simulator of Quench in Superconducting Magnets (Download Abstract) -
Jonas SUKYS, Mira KATTWINKEL
SPMC: Scalable Python Markov Chain Monte Carlo with application to Bayesian parameter inference in stochastic ecological models (Download Abstract) -
Hong GUO, Aiqing ZHANG, Zeyao MO
A Parallel Module for Multiblock Structured Grids in JASMIN and its Applications (Download Abstract) -
Keiichiro FUKAZAWA, Takeshi SOGA, Takayuki UMEDA, Takeshi NANRI
Performance Evaluation and Optimization of MagnetoHydroDynamic Simulation for Planetary Magnetosphere with Xeon Phi KNL (Download Abstract)
C3-REPARA: Third International Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms - 3
-
J. D. Garcia Sanchez
Invited: GrPPI a general purpose parallel pattern interface (demo) (Download Abstract) -
M. Torquati
Invited: Design patterns matching parallel benchmarks (Download Abstract)
D3-ParaFPGA: Spatial computing
-
Thomas JANSON, Udo KEBSCHULL
Highly Parallel Lattice QCD Wilson Dirac Operator with FPGAs (Download Abstract) -
Nadeen Yassir GEBARA, Kermin FLEMING
Spatial Memory Trace Prediction (Download Abstract)
A4: Real-Time and Adaptive Systems - 1
-
Alessandro FANFARILLO, Davide DEL VENTO, Patrick NICHOLS
Optimizing Communication and Synchronization in CAF Applications (Download Abstract) -
Luis A. GARCêA-GONZçLEZ, CŽsar R. GARCêA-JACAS, Liesner ACEVEDO-MARTINEZ, Rafael TRUJILLO-RASUA, Dirk ROOSE
Self-scheduling for Heterogeneous Distributed Tasks (Download Abstract)
B4: Energy Awareness and Efficiency - 1
-
Mark ENDREI, Chao JIN, Minh DINH, David ABRAMSON, Heidi POXON, Luiz DEROSE, Bronis R DE SUPINSKI
A Bottleneck-centric Tuning Policy for Optimizing Energy in Parallel Programs (Download Abstract) -
Daniele CESARINI, Andrea BARTOLINI, Luca BENINI
Energy Saving and Thermal Management Opportunities in a Workload-Aware MPI Runtime for a Scientific HPC Computing Node (Download Abstract)
D4-E-Aware: Energy Aware Scientific Computing on low power and heterogeneous architectures - 1
-
Filippo MANTOVANI, Enrico CALORE
Multi-node advanced performance and power analysis with Paraver (Download Abstract)
A5: Real-Time and Adaptive Systems - 2
-
Daniele DE SENSI, Peter KILPATRICK, Massimo TORQUATI
State-Aware Concurrency Throttling (Download Abstract) -
Ludek KUCERA
On architecture for future petascale computing (Download Abstract)
B5: Energy Awareness and Efficiency - 2
-
Patrick SCHIFFMANN, Dirk MARTIN, Gundolf HAASE, GŸnter OFFNER
Optimizing a RBF Interpolation Solver for Energy on Heterogeneous Systems (Download Abstract) -
Stefano CHERUBIN, Giovanni AGOSTA, Imane LASRI, Erven ROHOU, Olivier SENTIEYS
Implications of Reduced-Precision Computations in HPC: Performance, Energy and Error (Download Abstract)
C5: GPU computing - 2
-
Peter BENNER, Martin K…HLER, Carolin PENKE
GPU Accelerated Storage Efficient Implementation of the QR Decomposition (Download Abstract)
D5-E-Aware: Energy Aware Scientific Computing on low power and heterogeneous architectures - 2
-
Enrico CALORE, Alessandro GABBANA, Sebastiano Fabio SCHIFANO, Raffaele TRIPICCIONE
Energy-efficiency evaluation of Intel KNL for HPC workloads (Download Abstract) -
Roberto ALFIERI, Sebastiano BERNUZZI, Pablo GALAVIZ, Albino PEREGO, David RADICE
Numerical relativity with many-core architectures (Download Abstract)
A6: Real-Time and Adaptive Systems - 3
-
Gal OREN, Guy MALAMUD
CalCul: A Python-based Workspace for High-Performance Parameters-Survey in Scientific Legacy Codes (Download Abstract) -
Marco GREBE, Tilman LACKO, Rita LOOGEN
Comparing Actor System Topologies and Parameters Using BeCoMe (Download Abstract)
B6: Energy Awareness and Efficiency - 3
-
Anamika CHOWDHURY, Madhura KUMARASWAMY, Michael GERNDT
Design-time Analysis for the READEX Tool Suite (Download Abstract) -
Giandomenico SPEZZANO, Andrea VINCI
A nature-inspired, anytime and parallel algorithm for Big Data stream clustering (Download Abstract)
C6: GPU computing - 3
-
Carlos CARRASCAL-MANZANARES, Alexandre IMPERIALE, Gilles ROUGERON, Vincent BERGEAUD, Lionel LACASSAGNE
A fast implementation of a multidomain spectral finite elements method on CPU and GPU applied to ultrasound propagation (Download Abstract) -
JosŽ I. ALIAGA, Ruyman REYES, Mehdi GOLI
SYCL-BLAS: Combining expression trees and kernel fusion on heterogeneous systems (Download Abstract)
D6-E-Aware: Energy Aware Scientific Computing on low power and heterogeneous architectures - 3
-
Piero VICINI
Large scale low power architectures computing system: status of ExaNeSt and EuroExa projects (Download Abstract) -
Andrea BIAGIONI
The brain on low power scalable architectures: efficient simulation of cortical slow waves and asynchronous states (Download Abstract) -
Lucia MORGANTI, Daniele CESINI, Andrea FERRARO, Elena CORNI, Antonio FALABELLA, Luca LAMA
The INFN COSA project experience and low power computing and storage (Download Abstract)
B8: High Performance Graph Analytics - 1
-
Thomas MESSI NGUƒLƒ, Maurice TCHUENTE, Jean-Franois MƒHAUT
Using Complex-Network Properties For Efficient Graph Analysis (Download Abstract) -
Mattia D'ANTONIO, Paolo D'ONORIO DE MEO, Giuseppe FIAMENI, Claudio CACCIARI
Characterization of genomic data using graph databases (Download Abstract)
C8: Load Balancing and Fault Tolerance - 1
-
Christian NEUGEBAUER, Rudolf BERRENDORF, Florian MANNUSS
Improving the Performance of Parallel SpMV Operations on NUMA Systems with Adaptive Load Balancing (Download Abstract) -
Steffen HIRSCHMANN, Malte BRUNN, Dirk PFL†GER, Colin W. GLASS
Load balancing with p4est for Short-Range Molecular Dynamics with ESPResSo (Download Abstract)
D8: Compiler Directives for Parallel Computing - 1
-
Aidan Bernard Gerard CHALK, Alin Marin ELENA, Luke MASON
Task Based Parallelism with OpenMP: A Case Study with DL_POLY_4. (Download Abstract) -
James S. WILLIS, Matthieu SCHALLER, Pedro GONNET
An efficient SIMD implementation of pseudo-Verlet lists for neighbour interactions in particle-based codes (Download Abstract)
A9: Efficient I/O and Networking
-
Samantha Vanessa ADAMS, Olga ABRAMKINA, Yann MEURDESOIF, Mike REZNY
Parallel IO in the LFRic Infrastructure (Download Abstract) -
Andrew David BROWN, Simon William MOORE, David Barrie THOMAS, Andrey Andrey MOKHOV, Jeffrey Stephen REEVE, Tom KAZMIERSKI
Distributed event-based computing (Download Abstract)
B9: High Performance Graph Analytics - 2
-
Adrian Marek K_USEK, Witold DZWINEL
Efficient multi GPU implementation of exact and approximated k-Nearest Neighbour Search (Download Abstract) -
Katerina DIMITRAKOPOULOU, Nikolaos M. MISSIRLIS
Optimal Diffusion for load balancing in regular graphs (Download Abstract)
C9: Load Balancing and Fault Tolerance - 2
-
Thomas GON‚ALVES, Marc PƒRACHE, FrŽdŽric DESPREZ, Jean-Franois MƒHAUT
Dynamic Load Balancing of Monte Carlo Particle Transport Applications on HPC Clusters (Download Abstract) -
Dai YANG, Josef WEIDENDORFER, Carsten TRINITIS, Tilman K†STNER
Enabling Application-Integrated Proactive Fault Tolerance (Download Abstract)
D9: Compiler Directives for Parallel Computing - 2
-
Bronson MESSER, Thomas PAPATHEODORE
Exploiting Hierarchical Parallelism in an Astrophysical Equation of State using OpenACC and OpenMP (Download Abstract) -
Andrea CRIVELLINI, Matteo FRANCIOLINI
ON THE IMPLEMENTATION OF OPENMP AND HYBRID MPI/OPENMP PARALLELIZATION STRATEGIES FOR AN EXPLICIT DG SOLVER (Download Abstract)
A10: Parallel Solutions for AI and Machine Learning
-
Jack DENNIS, Lei HUANG, William LIM, Hsiang-Huang WU, Yuzhong YAN
Implementing Deep Neural Networks on Fresh Breeze (Download Abstract) -
Giuseppe FIAMENI, Riccardo ZANELLA
A performance study of machine and deep learning frameworks on CINECA HPC systems (Download Abstract)
B10: Parallel Programming and Clouds
-
Vaidy SUNDERAM
Adaptive Execution of Parallel Programs on Grids and Clouds (Download Abstract) -
Fabio TORDINI, Marco ALDINUCCI, Paolo VIVIANI, Ivan MERELLI, Pietro LIñ
Scientific Workflows on Clouds with Heterogeneous and Preemtible Instances (Download Abstract)
C10: GPU and accelerators
-
Paul F BAUMEISTER, Benedikt ROMBACH, Thorsten HATER, Sabine GRIESSBACH, Lars HOFFMANN, Markus BUEHLER, Dirk PLEITER
Strategies for Forward Modelling of Infrared Radiative Transfer on GPUs (Download Abstract) -
JosŽ FLICH, Alessandro CILARDO, Mario KOVA‚, Rafael TORNERO, Jose Maria MARTêNEZ, Tomas PICORNELL
Deeply Heterogeneous Many-Accelerator Infrastructure for HPC Architecture Exploration (Download Abstract)
D10-EDGE: IoT and Edge Computing - 1
-
Jose NUNEZ-YANEZ
FPGAs for high-productivity and low-power edge computing (Download Abstract)
A11: High-level Parallel Programming Models
-
Dalvan GRIEBLER, Luiz Gustavo FERNANDES
Towards Distributed Parallel Programming Support for the SPar DSL (Download Abstract) -
Fabian WREDE, Breno Augusto DE MELO MENEZES, Luis Filipe DE ARAUJO PESSOA, Bernd HELLINGRATH, Fernando BUARQUE DE LIMA NETO, Herbert KUCHEN
High-level Parallel Implementation of Swarm Intelligence-based Optimization Algorithms with Algorithmic Skeletons (Download Abstract)
B11: Array Programming
-
Victoriano MONTESINOS CçNOVAS, JosŽ Manuel GARCêA CARRASCO
Vectorization Strategies for Ant Colony Optimization on Intel Architectures (Download Abstract) -
Ludomir OTESKI, Guillaume COLIN DE VERDIéRE, Sylvain CONTASSOT-VIVIER, StŽphane VIALLE, Juliette RYAN
A GPU Based Optimization Strategy Efficient on Other Modern Architectures (Download Abstract)
D11-EDGE: IoT and Edge Computing - 2
-
Blesson VARGHESE, Nan WANG, Jianyu LI, Dimitrios S. NIKOLOPOULOS
Edge-as-a-Service: Towards Distributed Cloud Architectures (Download Abstract) -
Alexandros PATRAS, Spyros LALIS
Flexible Distributed Computing Across End-Devices, the Edge and the Cloud (Download Abstract) -
Kai CHEN, Blesson VARGHESE, Dimitrios S. NIKOLOPOULOS
Power Modelling for Heterogeneous Cloud-Edge Data Centers (Download Abstract) -
Christos KALOGIROU, Panos KOUTSOVASILIS, Manolis MAROUDAS, Christos D. ANTONOPOULOS, Spyros LALIS, Nikolaos BELLAS
Edge and Cloud Provider Cost Minimization by Exploiting Extended Voltage and Frequency Margins (Download Abstract)