Aurélien Cavelan
Aurélien Cavelan
E-mail confirmado em unibas.ch
Título
Citado por
Citado por
Ano
Assessing general-purpose algorithms to cope with fail-stop and silent errors
A Benoit, A Cavelan, Y Robert, H Sun
ACM Transactions on Parallel Computing (TOPC) 3 (2), 13, 2016
382016
Optimal resilience patterns to cope with fail-stop and silent errors
A Benoit, A Cavelan, Y Robert, H Sun
IEEE International Parallel and Distributed Processing Symposium, 202-211, 2016
352016
Towards optimal multi-level checkpointing
A Benoit, A Cavelan, V Le Fèvre, Y Robert, H Sun
IEEE Transactions on Computers 66 (7), 1212-1226, 2016
232016
Which verification for soft error detection?
L Bautista-Gomez, A Benoit, A Cavelan, SK Raina, Y Robert, H Sun
IEEE 22nd International Conference on High Performance Computing (HiPC), 2-11, 2015
162015
Coping with silent and fail-stop errors at scale by combining replication and checkpointing
A Benoit, A Cavelan, F Cappello, P Raghavan, Y Robert, H Sun
Journal of Parallel and Distributed Computing 122, 209-225, 2018
122018
Resilience for stencil computations with latent errors
A Fang, A Cavelan, Y Robert, AA Chien
2017 46th International Conference on Parallel Processing (ICPP), 581-590, 2017
112017
Identifying the right replication level to detect and correct silent errors at scale
A Benoit, A Cavelan, F Cappello, P Raghavan, Y Robert, H Sun
Proceedings of the 2017 Workshop on Fault-Tolerance for HPC at Extreme Scale …, 2017
112017
Assessing the impact of partial verifications against silent data corruptions
A Cavelan, SK Raina, Y Robert, H Sun
44th International Conference on Parallel Processing (ICPP), 440-449, 2015
112015
Coping with recall and precision of soft error detectors
L Bautista-Gomez, A Benoit, A Cavelan, SK Raina, Y Robert, H Sun
Journal of Parallel and Distributed Computing 98, 8-24, 2016
82016
Multi-level checkpointing and silent error detection for linear workflows
A Benoit, A Cavelan, Y Robert, H Sun
Journal of Computational Science, 2017
72017
Voltage overscaling algorithms for energy-efficient workflow computations with timing errors
A Cavelan, Y Robert, H Sun, F Vivien
Proceedings of the 5th Workshop on Fault Tolerance for HPC at eXtreme Scale …, 2015
72015
Two-level checkpointing and verifications for linear task graphs
A Benoit, A Cavelan, Y Robert, H Sun
IEEE International Parallel and Distributed Processing Symposium Workshops …, 2016
6*2016
Combining checkpointing and replication for reliable execution of linear workflows with fail-stop and silent errors
A Benoit, A Cavelan, FM Ciorba, V Le Fèvre, Y Robert
International Journal of Networking and Computing 9 (1), 2-27, 2019
52019
Combining checkpointing and replication for reliable execution of linear workflows
A Benoit, A Cavelan, FM Ciorba, V Le Fevre, Y Robert
2018 IEEE International Parallel and Distributed Processing Symposium …, 2018
52018
When Amdahl Meets Young/Daly
A Cavelan, J Li, Y Robert, H Sun
IEEE International Conference on Cluster Computing (CLUSTER), 203-212, 2016
52016
Algorithm-Based Fault Tolerance for Parallel Stencil Computations
A Cavelan, FM Ciorba
2019 IEEE International Conference on Cluster Computing (CLUSTER), 1-11, 2019
32019
Resilient n-body tree computations with algorithm-based focused recovery: Model and performance analysis
A Cavelan, A Fang, AA Chien, Y Robert
International Workshop on Performance Modeling, Benchmarking and Simulation …, 2017
32017
Optimal checkpointing period with replicated execution on heterogeneous platforms
A Benoit, A Cavelan, V Le Fèvre, Y Robert
Proceedings of the 2017 Workshop on Fault-Tolerance for HPC at Extreme Scale …, 2017
32017
rDLB: A novel approach for robust dynamic load balancing of scientific applications with independent tasks
A Mohammed, A Cavelan, FM Ciorba
2019 International Conference on High Performance Computing & Simulation …, 2019
22019
SPH-EXA: Optimizing Smoothed Particle Hydrodynamics for Exascale Computing
FM Ciorba, L Mayer, RM Cabezón, D Imbert, D Guerrera, A Cavelan, ...
Project Poster at the 34th International Conference on High Performance …, 2019
22019
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–20