Follow
Thomas Naughton
Title
Cited by
Cited by
Year
Proactive fault tolerance using preemptive migration
C Engelmann, GR Vallee, T Naughton, SL Scott
2009 17th Euromicro International Conference on Parallel, Distributed and …, 2009
1502009
A survey of MPI usage in the US exascale computing project
DE Bernholdt, S Boehm, G Bosilca, M Gorentla Venkata, RE Grant, ...
Concurrency and Computation: Practice and Experience 32 (3), e4851, 2020
1192020
A framework for proactive fault tolerance
G Vallee, K Charoenpornwattana, C Engelmann, A Tikotekar, ...
2008 Third International Conference on Availability, Reliability and …, 2008
922008
A comparison of Amazon Web Services and Microsoft Azure cloud platforms for high performance computing
C Kotas, T Naughton, N Imam
2018 IEEE International Conference on Consumer Electronics (ICCE), 1-4, 2018
892018
System-level virtualization for high performance computing
G Vallee, T Naughton, C Engelmann, H Ong, SL Scott
16th Euromicro Conference on Parallel, Distributed and Network-Based …, 2008
852008
Checkpoint/restart of virtual machines based on Xen
G Vallee, T Naughton, H Ong, SL Scott
Proceedings of the High Availability and Performace Computing Workshop …, 2006
652006
Fault injection framework for system resilience evaluation: fake faults for finding future failures
T Naughton, W Bland, G Vallee, C Engelmann, SL Scott
Proceedings of the 2009 workshop on Resiliency in high performance, 23-28, 2009
502009
System management software for virtual environments
G Vallée, T Naughton, SL Scott
Proceedings of the 4th international conference on Computing frontiers, 153-160, 2007
502007
Oscar clusters
J Mugler, T Naughton, SL Scott, B Barret, A Lumsdaine, JM Squyres, ...
Linux Symposium, 387, 2003
492003
A log-scaling fault tolerant agreement algorithm for a fault tolerant MPI
J Hursey, T Naughton, G Vallee, RL Graham
Recent Advances in the Message Passing Interface: 18th European MPI Users …, 2011
432011
Open Source Cluster Application Resources (OSCAR): design, implementation and interest for the [computer] scientific community
B Des Ligneris, S Scott, T Naughton, N Gorsuch
Proceeding of 17th Annual International Symposium on High Performance …, 2003
412003
Evaluation of fault-tolerant policies using simulation
A Tikotekar, G Vallée, T Naughton, SL Scott, C Leangsuksun
2007 IEEE International Conference on Cluster Computing, 303-311, 2007
402007
An analysis of hpc benchmarks in virtual machine environments
A Tikotekar, G Vallée, T Naughton, H Ong, C Engelmann, SL Scott
Euro-Par 2008 Workshops-Parallel Processing: VHPC 2008, UNICORE 2008, HPPC …, 2009
352009
Effects of virtualization on a scientific application running a hyperspectral radiative transfer code on virtual machines
A Tikotekar, G Vallée, T Naughton, H Ong, C Engelmann, SL Scott, ...
Proceedings of the 2nd workshop on System-level virtualization for high …, 2008
352008
Effects of virtualization on a scientific application running a hyperspectral radiative transfer code on virtual machines
A Tikotekar, G Vallée, T Naughton, H Ong, C Engelmann, SL Scott, ...
Proceedings of the 2nd workshop on System-level virtualization for high …, 2008
352008
Scalable and fault tolerant failure detection and consensus
A Katti, G Di Fatta, T Naughton, C Engelmann
Proceedings of the 22nd European MPI Users' Group Meeting, 1-9, 2015
302015
Toward a performance/resilience tool for hardware/software co-design of high-performance computing systems
C Engelmann, T Naughton
2013 42nd International Conference on Parallel Processing, 960-969, 2013
272013
System-level virtualization research at oak ridge national laboratory
SL Scott, G Vallée, T Naughton, A Tikotekar, C Engelmann, H Ong
Future Generation Computer Systems 26 (3), 304-307, 2010
202010
OSCAR meta-package system
J Mugler, T Naughton, SL Scott
19th International Symposium on High Performance Computing Systems and …, 2005
202005
Efficient checkpointing of virtual machines using virtual machine introspection
F Aderholdt, F Han, SL Scott, T Naughton
2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2014
182014
The system can't perform the operation now. Try again later.
Articles 1–20