Dr. Leonardo A. Bautista Gomez (Leo)

Team Leader and Senior Researcher at Barcelona Supercomputing Center
Address: Placa Eusebi Guell 1-3, 08034, Barcelona, Spain
Email: leonardo (dot) bautista (at) bsc (dot) es

CV generated on March 29, 2024



Education

  • Ph. D. in Science :
  • PhD. in Math. & Computing Sciences at the Tokyo Institute of Technology, Tokyo, Japan. (Sep. 2012)
  • Master in Science :
  • Master in Distributed Systems at the Universite Pierre & Marie Curie, Paris, France. (Sep. 2009)
  • Bachelor in Science :
  • Bachelor in Computer Sciences at the Universite Pierre & Marie Curie, Paris, France. (Jun. 2006)
  • High School Diploma :
  • Bachillerato at the French-Nicaraguan School Victor Hugo, Managua, Nicaragua. (Nov. 2001)
  • Secondary School :
  • Elementary and Secondary at the Swiss School Refous, Bogota, Colombia. (Nov. 2000)

    Work Appointments

  • Senior Researcher :
  • Senior Researcher at Barcelona Supercomputing Center, Barcelona, Spain. Leading research on memory reliability for low-power devices and hierarchical memories. Interest on error injection analysis and approximate computing. Continuous integration and maintenance of the multilevel checkpoint library FTI. (Mar. 2016 - Ongoing)
  • Postdoctoral Researcher :
  • Postdoctoral Researcher at Argonne National Laboratory, Lemont, IL, USA. Research in resilience covering silent data corruption detection through the use of machine learning. Integration of SDC detectors into a resilience library. Improvement and maintenance of FTI as a research and production tool. (Apr. 2013 - Feb. 2016)
  • Postdoctoral Appointee :
  • Postdoctoral Appointee at Tokyo Institute of Technology, Tokyo, Japan. Development and evaluation of a library integrating fast multilevel checkpointing and failure prediction with proactive checkpointing. Partial parallelization study of an application simulating the Fukushima power plant accident. (Oct. 2012 - Mar. 2013)

    Awards and Fellowships

  • TCSC Early Career :
  • 2016 IEEE Technical Committee on Scalable Computing (TCSC) Award for Excellence in Scalable Computing for Early Career Researchers. (December 2016)
  • Marie Curie Fellowship :
  • Marie Sklodowska-Curie Actions Fellowship granted by the European Commission for the two-years project "DURO : Deep-memory Ubiquity, Reliability and Optimization". (February 2016)
  • George Michael Fellow :
  • ACM/IEEE George Michael Memorial High Performance Computing Ph.D. Fellow at Supercomputing Conference 2011 (SC11), Honorable Mention. (November 2011)
  • SC11 Perfect Score :
  • Special Certificate of Recognition for achieving a perfect score at the Supercomputing Conference 2011, "FTI : High Performance Fault Tolerance Interface for Hybrid Systems". (November 2011)
  • JSPS PhD Fellowship :
  • Japanese Society for the Promotion of Science (JSPS), Research Fellowships for Young Scientists (Doctoral Course), from April 2010 until March 2012 (About JPY 3'100,000/year). (April 2010)
  • First National Debate :
  • First Place Winner in the 1st National High School Speech and Debate Tournament in Managua, Nicaragua. (May 2001)

    Languages

  • Spanish :
  • Native (Teaching Experience)
  • French :
  • Fluent (DELF, DALF, DEUG)
  • English :
  • Fluent (IELTS 8.0 - C1)
  • Japanese :
  • Intermediate (JLPT N5, JLPT N4)

    Other Unrelated Skills

  • Private Pilot :
  • Licensed Private Pilot (About 50 flight hours).
  • Boat Skipper :
  • Licensed Recreational Boat Skipper in Spain (PER).
  • Open Water Diver :
  • Licenced (PADI) Open Water Diver (OWD).
  • Triathlon Finisher :
  • Olympic Triathlon (Chicago 2013, 2014, 2015, Barcelona 2016, 2017, 2018, 2019) Sprint Triathlon (Barcelona 2021).
  • Emergency Responce :
  • Certified Emergency First Responder.

    Peer-Reviewed Journal Papers

  • IJHPCA'21 :
  • Resiliency in Numerical Algorithm Design for Extreme Scale Simulations - Agullo, E., Altenbernd, M., Anzt, H., Bautista-Gomez, L., Benacchio, T., Bonaventura, L., Bungartz, H.J., Chatterjee, S., Ciorba, F.M., DeBardeleben, N. and Drzisga, D., International Journal of High Performance Computing Applications (IJHPCA).
  • FUTGEN'20 :
  • Extending the OpenCHK Model with advanced checkpoint features - Marcos Maroñas, Sergi Mateo, Kai Keller, Leonardo Bautista-Gomez, Eduard Ayguadé, Vicenç Beltran, Journal on Future Generation Computer Systems (FUTGEN).
  • Book - Chapter 4 :
  • Chapter 4 : The Mont-Blanc Prototype - Filippo Mantovani, Daniel Ruiz, Leonardo Bautista, Vishal Metha, Fabio Banchelli, Nikola Rajovic, Eduard Ayguade, Jesus Labarta, Mateo Valero, Alejandro Rico Carro, Alex Ramirez Bellido, Markus Geimer, Daniele Tafani, Contemporary High Performance Computing, From Petascale toward Exascale (Book chapter).
  • PARCO'19 :
  • Checkpoint/Restart Approaches for a Thread-Based MPI Runtime - Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Pérache, Patrick Carribault, Julien Jaeger, Allen D Malony, Sameer Shende, Journal on Parallel Computing, Systems and Applications (PARCO).
  • SUSCOM'18 :
  • Exploring The Capabilities of Support Vector Machines in Detecting Silent Data Corruptions - Omer Subasi, Sheng Di, Leonardo Bautista-Gomez, Prasanna Balaprakash, Osman Unsal, Jesus Labarta, Adrian Cristal, Sriram Krishnamoorthy, Franck Cappello, Journal on Sustainable Computing, Informatics and Systems (SUSCOM).
  • TPDS'17 :
  • Toward General Software Level Silent Data Corruption Detection for Parallel Applications - Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, Franck Cappello, IEEE Transactions on Parallel and Distributed Systems (TPDS).

    International Peer-Reviewed Conference Papers

  • HiPC'21 :
  • Towards Zero-Waste Recovery and Zero-Overhead Checkpointing in Ensemble Data Assimilation - Kai Keller and Leonardo Bautista-Gomez, IEEE 28th International Conference on High Performance Computing, Data, and Analytics 2021 (HiPC'21), Bangalore, India.
  • BCCA'21 :
  • Discovering the Ethereum2 P2P Network - Mikel Cortes-Goicoechea, Leonardo Bautista-Gomez, The 3rd IEEE International Conference on Blockchain Computing and Applications 2021 (BCCA'21), Tartu, Estonia.
  • BRAINS'21 :
  • Resource Analysis of Ethereum 2.0 Clients - Mikel Cortes-Goicoechea, Luca Franceschini, Leonardo Bautista-Gomez, The 3rd IEEE Conference on Blockchain Research and Applications for Innovative Networks and Services 2021 (BRAINS'21), Paris, France.
  • IOLTS'21 :
  • FPGA Checkpointing for Scientific Computing - Marc Perello Bacardit, Leonardo Bautista-Gomez and Osman Unsal, The 27th IEEE International Symposium on On-Line Testing and Robust System Design 2021 (IOLTS'21), Naples, Italy.
  • HPDC'21 :
  • An Oracle for Guiding Large-Scale Model/Hybrid Parallel Training of Convolutional Neural Networks - Albert Njoroge Kahira, Truong Thao Nguyen, Leonardo Bautista-Gomez, Ryousei Takano, Rosa M Badia, Mohamed Wahib, ACM Symposium on High-Performance Parallel and Distributed Computing 2021 (HPDC'21), Stockholm, Sweden.
  • CCGrid'21 :
  • Co-Designing Multi-Level Checkpoint Restart for MPI Applications - Konstantinos Parasyris, Giorgis Georgakoudis, Leonardo Bautista-Gomez and Igansio Laguna, International Symposium on Cluster Cloud and Grid Computing 2021 (CCGrid'21), Melbourne, Australia.
  • HPCS'20 :
  • A Study of Checkpointing in Large Scale Training of Deep Neural Networks - Elvis Rojas, Albert N. Kahira, Esteban Meneses, Leonardo Bautista-Gomez and Rosa M Badia, The International Conference on High Performance Computing and Simulation 2020 (HPCS'20), Barcelona, Spain.
  • HiPC'20 :
  • Design and Study of Elastic Recovery in HPC Applications - Kai Keller, Konstantinos Parasyris and Leonardo Bautista-Gomez, IEEE 27th International Conference on High Performance Computing, Data, and Analytics 2020 (HiPC'20), Pune, India.
  • CCGrid'20 :
  • Checkpoint Restart Support for Heterogeneous HPC Applications - Konstantinos Parasyris, Kai Keller and Leonardo Bautista-Gomez, International Symposium on Cluster Cloud and Grid Computing 2020 (CCGrid'20), Melbourne, Australia.
  • EuroPar'19 :
  • Checkpointing Kernel Executions of MPI+ CUDA Applications - Max Baird, Sven-Bodo Scholz, Artjoms Sinkarovs and Leonardo Bautista-Gomez, European Conference on Parallel Processing 2019 (EuroPar'19), Gottingen, Germany.
  • CCGrid'19 :
  • Application-Level Differential Checkpointing for HPC Applications with Dynamic Datasets - Kai Keller and Leonardo Bautista-Gomez, International Symposium on Cluster Cloud and Grid Computing 2019 (CCGrid'19), Larnica, Cyprus.
  • ICPADS'17 :
  • Portable Topology-Aware MPI-I/O - Rob Latham, Leonardo Bautista-Gomez, Pavan Balaji, IEEE 23rd International Conference on Parallel and Distributed Systems 2017 (ICPADS'17), Shenzen, China.
  • SC'16 :
  • Unprotected Computing : A Large-Scale Study of DRAM Raw Error Rate on a Supercomputer - Leonardo Bautista-Gomez, Ferad Zyulkyarov, Simon McIntosh-Smith, Osman Unsal, International Conference for High Performance Computing, Networking, Storage, and Analysis 2016 (SC'16), Salt Lake City, UT, USA. (Acceptance rate is 18.3%)
  • IGSC'16 :
  • Monitoring strategies for scalable dynamic checkpointing - Swann Perarnau, Leonardo Bautista-Gomez, Seventh International Green and Sustainable Computing Conference 2016 (IGSC'16), Hangzhou, China.
  • Cluster'16 :
  • Adaptive performance-constrained in situ visualization of atmospheric simulations - Matthieu Dorier, Robert Sisneros, Leonardo Bautista Gomez, Tom Peterka, Leigh Orf, Lokman Rahmani, Gabriel Antoniu, Luc Bougé, IEEE International Conference on Cluster Computing 2016 (Cluster'16), Taipei, Taiwan.
  • EuroPar'16 :
  • Exploring partial replication to improve lightweight silent data corruption detection for HPC applications - Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, Franck Cappello, European Conference on Parallel Processing 2016 (EuroPar16), Grenoble FRANCE.
  • IPDPS'16 :
  • Reducing Waste in Extreme Scale Systems through Introspective Analysis - Leonardo Bautista-Gomez, Ana Gainaru, Swann Perarnau, Devesh Tiwari, Saurabh Gupta, Christian Engelmann, Franck Cappello and Marc Snir,, 30th IEEE International Parallel and Distributed Processing Symposium 2016 (IPDPS'16), Chicago, IL, USA.
  • CCGrid'16 :
  • Spatial Support Vector Regression to Detect Silent Errors in the Exascale Era - Omer Subasi, Sheng Di, Leonardo Bautista-Gomez, Prasanna Balaprakash, Osman Unsal, Jesus Labarta, Adrian Cristal, Franck Cappello, International Symposium on Cluster Cloud and Grid Computing 2016 (CCGrid'16), Cartagena, Colombia.
  • HiPC'15 :
  • Which Verification for Soft Error Detection? - Leonardo Bautista-Gomez, Anne Benoit, Aurelien Cavelan, Saurabh K. Raina, Yves Robert and Hongyang Sun, 22nd IEEE International Conference on High Performance Computing 2015 (HiPC'15), Bangalore, INDIA. (Acceptance rate 23%)
  • EuroMPI'15 :
  • Detecting Silent Data Corruption for Extreme-Scale MPI Applications - Leonardo Bautista-Gomez, Franck Cappello, 22nd European MPI Users' Group Meeting, 2015 (EuroMPI'15), Bordeaux, FRANCE.
  • Cluster'15 :
  • Detecting and correcting data corruption in stencil applications through multivariate interpolation - Leonardo Arturo Bautista Gomez, Franck Cappello, IEEE International Conference on Cluster Computing 2015 (Cluster'15)
  • HPCC'15 :
  • Exploiting Spatial Smoothness in HPC Applications to Detect Silent Data Corruption - Leonardo Bautista-Gomez, Franck Cappello, International Conference on High Performance Computing and Communications 2015 (HPCC'15), New York, NY, USA.
  • HPDC'15 :
  • Lightweight silent data corruption detection based on runtime data analysis for HPC applications - Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, Franck Cappello, Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing 2015 (HPDC'15).
  • SC'14 :
  • Optimization of Multi-level Checkpoint Model with Uncertain Execution Scales - Sheng Di, Leonardo Bautista-Gomez, Franck Cappello, International Conference for High Performance Computing, Networking, Storage, and Analysis 2014 (SC'14), New Orleans, LU, USA. (Acceptance rate 20.8%)
  • IPDPS'14 :
  • Optimization of Multi-level Checkpoint Model for Large Scale HPC Applications - Sheng Di, Mohamed Slim Bouguerra, Leonardo Bautista-Gomez, Franck Cappello, 28th IEEE International Parallel and Distributed Processing Symposium 2014 (IPDPS'14), Phoenix, AZ, USA. (Acceptance rate 21.1%)
  • SNA-MC'13 :
  • SAMPSON Parallel Computation for Sensitivity Analysis of TEPCO's Fukushima Daiichi Nuclear Power Plant Accident - Marco Pellegrini, Leonardo Bautista-Gomez, Naoya Maruyama, Masanori Naitoh, Satoshi Matsuoka, Franck Cappello, Joint International Conference on Supercomputing in Nuclear Applications and Monte Carlo 2013 (SNA-MC'13), Paris, France.
  • IPDPS'13 :
  • Improving the computing efficiency of HPC systems using a combination of proactive and preventive checkpointing - Mohamed Slim Bouguerra, Ana Gainaru, Leonardo Bautista-Gomez, Franck Cappello, Naoya Maruyama, Satoshi Matsuoka, IEEE International Parallel and Distributed Processing Symposium 2013 (IPDPS'13), Boston, MA, USA. (Acceptance rate 21.0%)
  • Cluster'12 :
  • Hierarchical Clustering Strategies for Fault Tolerance in Large Scale HPC Systems - Leonardo Bautista-Gomez, Thomas Ropars, Naoya Maruyama, Franck Cappello, Satoshi Matsuoka, IEEE International Conference on Cluster Computing 2012 (Cluster'12), Beijing, China.
  • EuroPar'12 :
  • Scalable Reed-Solomon-based Reliable Local Storage for HPC Applications in IaaS Clouds - Leonardo Bautista-Gomez, Bogdan Nicolae, Naoya Maruyama, Franck Cappello, Satoshi Matsuoka, International European Conference on Parallel and Distributed Computing 2012 (EuroPar'12), Rhodes Island, Greece. (Acceptance rate 32.8%)
  • SC'11 :
  • FTI: high performance Fault Tolerance Interface for hybrid systems - Leonardo Bautista-Gomez, Naoya Maruyama, Dimitri Komatitsch, Seiji Tsuboi, Franck Cappello, Satoshi Matsuoka, Takeshi Nakamura, International Conference for High Performance Computing, Networking, Storage, and Analysis 2011 (SC'11), Seattle, WA, USA. (Acceptance rate 21.0%)
  • HiPC'10 :
  • Low-overhead Diskless Checkpoint for Hybrid Computing Systems - Leonardo Bautista-Gomez, Naoya Maruyama, Akira Nukada, Franck Cappello, Satoshi Matsuoka, International Conference on High Performance Computing 2010 (HiPC'10), Goa India. (Acceptance rate 19.2%)
  • CCGrid'10 :
  • Distributed Diskless Checkpoint for Large Scale Systems - Leonardo Bautista-Gomez, Naoya Maruyama, Franck Cappello, Satoshi Matsuoka, International Symposium on Cluster Cloud and Grid Computing 2010 (CCGrid'10), Melbourne Australia. (Acceptance rate 23.2%)

    Short Papers, Workshops and Posters

  • HiPC'21 :
  • Accelerating checkpoint/restart with lossy methods (Poster at Student Research Symposium) - Kevser Ildes, Athanasios Kastoras, Kai Keller, Leonardo Bautista-Gomez, 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC'21), Bangalore, India.
  • FTXS'21 :
  • Accelerating checkpoint/restart with lossy methods (Extended Abstract) - Kevser Ildes, Athanasios Kastoras, Kai Keller, Leonardo Bautista-Gomez, 2021 IEEE/ACM 11th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS'21), St. Louis, USA.
  • BRAINS'21 :
  • Discovering the Ethereum2 P2P Network (Poster) - Mikel Cortes-Goicoechea, Leonardo Bautista-Gomez, The 3rd IEEE Conference on Blockchain Research and Applications for Innovative Networks and Services (BRAINS'21), Paris, France.
  • DATE'20 :
  • LEGaTO: Low-Energy, Secure, and ResilientToolset for Heterogeneous Computing (Workshop Paper) - Behzad Salami, Konstantinos Parasyris, Adrian Cristal, Osman Unsal, Xavier Martorell, Paul Carpenter, Raul De La Cruz, Leonardo Bautista, Daniel Jimenez, Carlos Alvarez, Saber Nabavi, Sergi Madonar, Miquel Pericàs, Pedro Trancoso, Mustafa Abduljabbar, Jing Chen, Pirah Noor Soomro, Madhavan Manivannan, Micha von dem Berge, Stefan Krupop, Frank Klawonn, Amani Mihklafi, Sigrun May, Tobias Becker, Georgi Gaydadjiev, Hans Salomonsson, Devdatt Dubhashi, Oron Port, Yoav Etsion, Le Quoc Do, Christof Fetzer, Martin Kaiser, Nils Kucza, Jens Hagemeyer, René Griessl, Lennart Tigges, Kevin Mika, Arne Hüffmeier, Marcelo Pasin, Valerio Schiavoni, Isabelly Rocha, Christia , Design, Automation and Test in Europe Conference 2020, European Projects Track (DATE'20), Grenoble, France.
  • PDML'19 :
  • Accelerating Hyperparameter Optimisation with PyCOMPSs (Workshop Paper) - Albert Njoroge Kahira, Leonardo Bautista Gomez, Javier Conejero, and Rosa M. Badia, 1st Workshop on Parallel and Distributed Machine Learning 2019 (PDML'19), Kyoto, Japan.
  • FTXS'18 :
  • Towards Ad Hoc Recovery For Soft Errors (Workshop Paper) - Nuria Losada, Leonardo Bautista-Gomez, Kai Keller, Osman Unsal, 2018 IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS'18), Dallas, USA.
  • PMBS'18 :
  • Approximating a Multi-Grid Solver (Workshop Paper) - Valentin Le Fevre, Leonardo Bautista-Gomez, Osman Unsal, Marc Casas, 2018 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS'18), Dallas, USA.
  • MCHPC'18 :
  • On the Applicability of PEBS based Online Memory Access Tracking for Heterogeneous Memory Management at Scale - Aleix Roca Nonell, Balazs Gerofi, Leonardo Bautista-Gomez, Dominique Martinet, Vicenç Beltran Querol, Yutaka Ishikawa, Proceedings of the Workshop on Memory Centric High Performance Computing, (MCHPC'18), Dallas, USA.
  • WOPSSS'18 :
  • Performance study of non-volatile memories on a high-end supercomputer (Workshop Paper) - Leonardo Bautista-Gomez, Kai Keller, Osman Unsal, Workshop on Performance and Scalability of Storage Systems 2018 (WOPSSS'18), Frankfurt, Germany.
  • ATCET'18 :
  • Training Deep Neural Networks with Low Precision Input Data: A Hurricane Prediction Case Study (Workshop Paper) - Albert Kahira, Leonardo Bautista-Gomez, Rosa M. Badia, Workshop on Approximate and Transprecision Computing on Emerging Technologies 2018 (ATCET'18), Frankfurt, Germany.
  • SC'16 :
  • Software-Level Fault Tolerant Framework for Task-Based Applications (Poster) - Joy Yeh, Grzegorz Pawelczak, James Sewart, James Price, Ferad Zyulkyarov, Leonardo Bautista-Gomez, Osman Unsal, Simon McIntosh-Smith, International Conference for High Performance Computing, Networking, Storage, and Analysis 2016 (SC'16), Salt Lake City, UT, USA.
  • FTS'15 :
  • Detecting and Correcting Data Corruption in Stencil Applications through Multivariate Interpolation (Workshop Paper) - Leonardo Bautista-Gomez, Franck Cappello, International Workshop on Fault Tolerant Systems (FTS'15), Chicago, IL, USA.
  • HPDC'15 :
  • Lightweight Silent Data Corruption Detection Based on Runtime Data Analysis for HPC Applications (Short Paper) - Eduardo Berrocal, Leonardo Bautista-Gomez, Sheng Di, Zhiling Lan, Franck Cappello, International Symposium on High-Performance Parallel and Distributed Computing (HPDC'15), Portland, OR, USA. (Acceptance rate 27%)
  • SC'14 :
  • Towards Effective Detection of Silent Data Errors for HPC Applications (Poster) - Sheng Di, Eduardo Berrocal, Katherine Heisey, Leonardo Bautista-Gomez, Rinku Gupta, Franck Cappello, International Conference for High Performance Computing, Networking, Storage, and Analysis 2014 (SC'14), New Orleans, LU, USA. (Acceptance rate 39%)
  • PMBS'14 :
  • Analysis of the Tradeoffs between Energy and Run Time for Multilevel Checkpointing (Workshop Paper) - Prasanna Balaprakash, Leonardo A. Bautista Gomez, Mohamed-Slim Bouguerra, Stefan M. Wild, Franck Cappello and Paul D. Hovland, International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems 2014 (PMBS14), New Orleans, LA, USA. (Acceptance rate 28%)
  • Cluster'14 :
  • Energy-Performance Tradeoffs in Multilevel Checkpoint Strategies (Poster) - Leonardo A. Bautista Gomez, Prasanna Balaprakash, Mohamed-Slim Bouguerra, Stefan M. Wild, Franck Cappello and Paul D. Hovland, IEEE Cluster conference 2014 (Cluster'14), Madrid, SPAIN.
  • DATE'14 :
  • GPGPUs: How to Combine High Computational Power with High Reliability (Embedded Tutorial) - Leonardo Bautista-Gomez, Franck Cappello, Luigi Carro, Nathan DeBardeleben, Bo Fang, Sudhanva Gurumurthi, Karthik Pattabiraman, Paolo Rech, Design, Automation & Test in Europe (DATE'14), Dresden, Germany.
  • PPoPP'14 :
  • Detecting Silent Data Corruption through Data Dynamic Monitoring for Scientific Applications (Poster) - Leonardo Bautista-Gomez, Franck Cappello, 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming 2014 (PPoPP'14), Orlando, FL, USA. (Acceptance rate 26.2%)
  • BigData'13 :
  • Improving Floating Point Compression through Binary Masks (Short Paper) - Leonardo Bautista-Gomez, Franck Cappello, IEEE International Conference on Big Data 2013 (IEEE-BigData'13), Santa Clara, CA, USA. (Acceptance rate 38.2%)

    Student Supervision and Mentoring

  • Mikel Cortes Goicoechea :
  • PhD Supervisor of Mikel Cortes Goicoechea on his PhD on P2P network Analysis for next generation Blockchains at Polytechnical University of Catalunya, Spain. (Oct. 2021 - Ongoing)
  • Kai Keller :
  • PhD Supervisor of Kai Keller on his PhD on Resilience for Data Assimilation at Extreme Scale at Polytechnical University of Catalunya, Spain. (Oct. 2019 - Ongoing)
  • Marc Perello :
  • Supervising Marc Perello during his internship on fault tolerance for FPGAs and adaptation for deep learning frameworks at BSC, Spain. (Oct. 2019 - Ongoing)
  • Kevser Ildes :
  • Mentor of Kevser Ildes during the PRACE Summer of HPC on lossy techniques for checkpoint/restart at BSC, Spain. (Jul. 2021 - Aug. 2021)
  • Athanasios Kastoras :
  • Mentor of Athanasios Kastoras during the PRACE Summer of HPC on lossy techniques for checkpoin/restart at BSC, Spain. (Jul. 2021 - Aug. 2021)
  • Jakub Raczynski :
  • Mentor of Jakub Raczynski during the PRACE Summer of HPC on distributed model parallelism for deep learning at BSC, Spain. (Jul. 2021 - Aug. 2021)
  • Mehmet Erciyes :
  • Mentor of Mehmet Erciyes during the PRACE Summer of HPC on distributed model parallelism for deep learning at BSC, Spain. (Jul. 2021 - Aug. 2021)
  • Albert Kahira :
  • PhD Supervisor of Albert Kahira on his PhD on Resilience for Machine Learning Workflows at Polytechnical University of Catalunya, Spain. (Nov. 2017 - Jul. 2021)
  • Mikel Cortes Goicoechea :
  • Supervising Mikel Cortes Goicoechea (Graduate Student at Pais Vasco University, during his internship on Blockchain scalability at BSC, Spain. (Aug. 2020 - Jun. 2021)
  • Luca Franceschini :
  • Supervising Luca Franceschini (Graduate Student at the Polytechnical University of Catalunya, during his internship on Blockchain scalability at BSC, Spain. (Sep. 2020 - Nov. 2020)
  • Elvis Rojas Ramirez :
  • Supervising Elvis Rojas Ramirez (PhD Student at Instituto de Technologia de Costa Rica), during his internship on resilience for Deep Learning at BSC, Spain. (Jan. 2020 - Mar. 2020)
  • Dr Konstantinos Parasyris :
  • Supervising Dr Konstantinos Parasyris during his Postdoc on Resilience for low-power high-performance accelerators at BSC, Spain. (Nov. 2018 - Jan. 2020)
  • Pak Markthub :
  • Supervising Pak Markthub (PhD Student at Tokyo Institute of Technology), during his internship on Streaming GPU Checkpointing at BSC, Spain. (May 2018 - Jun. 2018)
  • Dr Oguz Kaya :
  • Supervising Dr Oguz Kaya (PostDoctoral Appointee at INRIA), during his research visit on Approximating Tensors at BSC, Spain. (May 2018 - Jun. 2018)
  • Max Baird :
  • Supervising Max Baird (PhD Student at Heriot-Watt University), during his internship on Intra-kernel GPU checkpointing at BSC, Spain. (Apr. 2018 - May 2018)
  • Nuria Losada :
  • Supervising Nuria Losada (PhD Student at University A Coruna), during her internship on Complementing FTI with ABFT techniques at BSC, Spain. (Oct. 2017 - Nov. 2017)
  • Maxime Kermarker :
  • Supervising Maxime Kermarker (Graduate Student), during his internship on the MPC implementation of FTI at BSC, Spain. (Jun. 2017 - Jul. 2017)
  • Valentin Lefevre :
  • Supervising Valentin Lefevre (PhD Student at ENS Lyon), during his internship on Approximate Computing for Multigrid Methods at BSC, Spain. (Apr. 2017 - May 2017)
  • Rodrigo Arias :
  • Supervising Rodrigo Arias (Graduate Student), during his internship on Approximate Computing for Principal Component Analysis at BSC, Spain. (Jan. 2017 - Apr. 2017)
  • Albert Kahira :
  • Supervising Albert Kahira (Graduate Student), during his internship on Resilience of Neural Networks at BSC, Spain. (Jul. 2016 - Aug. 2016)
  • Mohamed Gaalich :
  • Supervising Mohamed Gaalich (Graduate Student), during his internship on Adapting FTI to multiple I/O formats at BSC, Spain. (Jun. 2016 - Jul. 2016)
  • Omer Subasi :
  • Supervising Omer Subasi (PhD student at Barcelona Supercomputing Center), during his internship on Evaluating Support Vector Machines for HPC Resilience at ANL, USA. (Aug. 2015 - Nov. 2015)
  • Eduardo Berrocal :
  • Supervising Eduardo Berrocal (PhD student at Illinois Institute of Technology), during his internship on Spatio-temporal Analysis for Silent Error detection at ANL, USA. (May 2015 - Aug. 2015)
  • Javier Iparraguirre :
  • Mentoring Javier Iparraguirre during the SC14 Broader Engagement program at New Orleans, USA. (Nov. 2014 - Nov. 2014)
  • Eduardo Berrocal :
  • Supervising Eduardo Berrocal (PhD student at Illinois Institute of Technology), during his internship in Online Data Analytics for Silent Error detection at ANL, USA. (May 2014 - Sep. 2014)
  • Adele Villiermet :
  • Supervising Adele Villiermet (Graduate Student), during her internship on Multilevel Checkpointing for Large Scale Application at CINES, France. (Jun. 2013 - Aug. 2013)

    Teaching, Tutorials, Summer Schools and Hackathons

  • Blockchain Class :
  • Master course on Sharding and Proof-of-Stake for Blockchain Scalability (Eth2) at the UPC, Barcelona, Spain. (Apr. 2021)
  • Blockchain Class :
  • Master course on Sharding for Blockchain Scalability (Ethereum Sharding) at the UPC, Barcelona, Spain. (Apr. 2020)
  • Tutorial at CCGrid'19 :
  • Multilevel Checkpointing for Extreme Scale Applications tutorial at CCGrid'19, Larnaca, Cyprus. (May 2019)
  • Blockchain Class :
  • Master course on Sharding for Blockchain Scalability (Ethereum Sharding) at the UPC, Barcelona, Spain. (Apr. 2019)
  • Tutorial at HiPEAC'19 :
  • Multilevel Checkpointing for Extreme Scale Applications tutorial at HiPEAC'19, Valencia, Spain. (Jan. 2019)
  • 3rd BSC Hackathon :
  • Organizer of the Hackathon on Parallel Programming, GPU Computing and Software Security at BSC, Barcelona Spain. (Dec. 2018)
  • Blockchain Class :
  • Postgraduate course on Sharding for Blockchain Scalability (Ethereum Sharding) at the UPC, Barcelona, Spain. (Jun. 2018)
  • Tutorial at HiPEAC'18 :
  • Multilevel Checkpointing for Extreme Scale Applications tutorial at HiPEAC'18, Manchester, UK. (Jan. 2018)
  • 2nd BSC Hackathon :
  • Organizer of the Hackathon on Parallel Programming, GPU Computing and Software Security at BSC, Barcelona Spain. (Nov. 2017)
  • NESUS Winter School :
  • Multilevel Checkpointing for large scale applications tutorial at the Resilience Winter School, Calabria, Italy. (Jan. 2017)
  • 1st BSC Hackathon :
  • Organizer of the Hackathon on Parallel Programming, GPU Computing and Software Security at BSC, Barcelona Spain. (Oct. 2016)
  • JLESC Summer School :
  • Resilience Summer School (Checkpointing and fault tolerance at large scale) at the 5th JLESC Workshop, Lyon, France. (Jun. 2016)
  • Tutorial at CCGrid'16 :
  • Multilevel checkpointing for large scale applications Tutorial at CCGrid'2016, Cartagena, Colombia. (May 2016)
  • MontBlanc 2 Tutorial :
  • Multilevel Checkpointing for large scale applications tutorial at the Mont-Blanc 2 Resilience school at BSC, Barcelona Spain. (Mar. 2016)
  • Instructor at UIUC :
  • Instructor of the CS 498 class, (Hot Topics in HPC - Fault Tolerance and Checkpointing) at UIUC, Urbana-Champaign, USA. (Apr. 2011)
  • Private teacher :
  • Private teacher in sciences (Mathematics, Physics, Quemistry) for high school students, Acadomia, Paris, France. (May 2007 - Mar. 2009)

    Outreach Activities and Public Dissemination

  • Altraradio Divulgation :
  • Once a month I present a topic about computing technology on a program on the Altraradio Radio station, Spain. (2020-2021)
  • Reshaping Science :
  • Presenting the talk "Blockchain and applications" at the Reshaping Science event organized by the Societat Catalana Nanociència i Nanotecnologia Barcelona, Spain. (Oct. 2021)
  • Magazine "Physics World" :
  • Explain about errors in supercomputer induced by cosmic rays on the article "Cosmic challenge: protecting supercomputers from an extraterrestrial threat". (Jul. 2021)
  • Newspaper "El Periodico" :
  • Explain about cryptocurrency energy consumption on newspaper article, El Periodico "Bitcóin y las criptomonedas consumen más energía que países enteros", Spain. (Mar. 2021)
  • Impact Africa Network :
  • Presenting the talk "Blockchain : Virtual Chains for real Freedom" at the Impact Africa Network from Kenya, Virtual. (Mar. 2021)
  • ANL Postdoc Symposium :
  • Participation at the Argonne National Laboratory Postdoc Symposium Academic Career Panel, Virtual. (Nov. 2020)
  • Open Barcelona 2019 :
  • Present and explain the supercomputer Marenostrum 4 during the Open Barcelona festival, Barcelona, Spain. (Oct. 2019)
  • Science Festival 2019 :
  • Presenting the talk "Blockchain : Virtual Chains for real Freedom" at the Science Festival, Barcelona, Spain. (Jun. 2019)
  • Pint of Science 2019 :
  • Presenting the talk "Blockchain : Virtual chains for real freedom" at the Pint of Science, Barcelona, Spain. (May. 2019)
  • Devcentralised Meetup :
  • Presenting the talk "Learning about Ethereum Sharding" at the Devcentralised Meetup, Barcelona, Spain. (Nov. 2018)
  • Open Barcelona 2018 :
  • Present and explain the supercomputer Marenostrum 4 during the Open Barcelona festival, Barcelona, Spain. (Oct. 2018)
  • EuroMPI 2018 :
  • Present and explain the supercomputer Marenostrum 4 during the EuroMPI Conference, Barcelona, Spain. (Sep. 2018)
  • CONACyT Symposium 2018 :
  • Presenting the talk "How to get rich by doing a PhD" at the CONACyT-Catalunya Symposium, Barcelona, Spain. (Jun. 2018)
  • Science Festival 2018 :
  • Presenting the talk "What happens when supercomputers get sick" at the Science Festival, Barcelona, Spain. (Jun. 2018)
  • Hipeac PhD Symposium :
  • Presenting the talk "How to get rich by doing a PhD" at the HiPEAC PhD Symposium, Gottenburg, Sweden. (May. 2018)
  • Pint of Science 2018 :
  • Presenting the talk "What happens when supercomputers get sick" at the Pint of Science, Barcelona, Spain. (May. 2018)
  • BSC PhD Symposium :
  • Presenting the talk "How to get rich by doing a PhD" at the PhD Symposium at BSC, Barcelona, Spain. (May. 2018)
  • BSC Career Day :
  • Presenting the talk "How to get rich by doing a PhD" at the Career Day at BSC, Barcelona, Spain. (Feb. 2018)
  • Open Barcelona 2017 :
  • Present and explain the supercomputer Marenostrum 4 during the Open Barcelona festival, Barcelona, Spain. (Oct. 2017)
  • Centroamerica University :
  • Presenting the talk "Supercomputers and high-performance Computing" to Undergraduate Students at Centroamerica University, Managua, Nicaragua. (Aug. 2015)
  • "Esta Noche" TV show :
  • Talking about Supercomputers and high-performance Computing on the TV show "Esta Noche", Nicaragua. (Aug. 2015)

    Chairmanships and Committees

  • Tutorials Chair :
  • ACM Supercomputing Conference (SC'20)
  • Local Chair :
  • International Conference in Supercomputing (ICS'20)
  • Local Chair :
  • Field Programmable Logic & Applications (FPL'19)
  • Local Chair :
  • EuroMPI Conference (EuroMPI'18)
  • Program Chair :
  • Workshop on Fault-Tolerance Systems (FTS'17)
  • Web Chair :
  • IEEE International Conference on Cluster Computing (Cluster'17)
  • Co-Editor :
  • Journal on Micromachines (Special Issue on Machine Learning for System Diagnosis)
  • Guest Editor :
  • Journal Concurrency and Computation (Special Issue)
  • Organizing Committee :
  • Workshop on High Performance Machine Learning (HPML'19) (HPML'20)
  • Organizing Committee :
  • Workshop on Dependable and Resilient Many-Core and Exascale Computing (DRMEC 2019)
  • BOF Committee :
  • ACM Supercomputing Conference (SC'17), (SC'19)
  • Workshops Committee :
  • ACM Supercomputing Conference (SC'21)
  • Posters Committee :
  • ACM Supercomputing Conference (SC'18)
  • Program Committee :
  • IEEE International Conference on Cluster Computing (CLUSTER'17) (CLUSTER'18) (CLUSTER'21)
  • Program Committee :
  • Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems 2021 (CHEOPS'21)
  • Program Committee :
  • International Workshop on OpenCL and SYCL (IWOCL-SYCL'21)
  • Program Committee :
  • IEEE International Parallel & Distributed Processing Symposium (IPDPS'18) (IPDPS'19) (IPDPS'20) (IPDPS'22)
  • Program Committee :
  • Workshop On Performance and Scalability of Storage Systems (WOPSSS'20)
  • Program Committee :
  • International Workshop on openCL (IWOCL'17), (IWOCL'18), (IWOCL'19), (IWOCL'20)
  • Program Committee :
  • Workshop on Fault-Tolerance for HPC at Extreme Scale (FTXS'13), (FTXS'14), (FTXS'15), (FTXS'16), (FTXS'17), (FTXS'18), (FTXS'19)
  • Program Committee :
  • Workshop on Fault-Tolerance Systems (FTS'15), (FTS'16), (FTS'17), (FTS'18)
  • Program Committee :
  • Colombia Computing Congress (CCC'16), (CCC'17), (CCC'18)
  • Program Committee :
  • International Conference on Bioinspired Intelligence, HPC for natural and health sciences (IWOBI'18)
  • Program Committee :
  • Latinamerican Conference on High Performance Computing (CARLA'18) (CARLA'19)
  • Program Committee :
  • IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID'16), (CCGRID'17)
  • Program Committee :
  • International Workshop on Resilience and/or Energy-aware techniques for High-Performance Computing (RE-HPC'16).
  • Technical Referee :
  • Journal of Signal Processing Systems
  • Technical Referee :
  • International Journal of Computational Fluid Dynamics
  • Technical Referee :
  • International Journal on Cluster Computing
  • Technical Referee :
  • International Journal on Parallel Computing (PARCO)
  • Technical Referee :
  • International Journal IEEE Transactions on Parallel and Distributed Systems (TPDS)
  • Technical Referee :
  • International Journal Transactions on Architecture and Code Optimization (TACO), ACM Society
  • Technical Referee :
  • International Journal Transactions on Cloud Computing (TCC), IEEE Computer Society

    International Mobility and Internships

  • Research Visit :
  • Research visit on Deep Learning Scalability at Tokyo Institute of Technology, Tokyo, Japan. (Jul. 2019 - Aug. 2019)
  • Research Visit :
  • Research visit on Approximate Computing and Machine Learning at Tokyo Institute of Technology, Tokyo, Japan. (Jul. 2018 - Aug. 2018)
  • PhD. Internship :
  • Internship in Erasure Codes (Reed-Solomon) for Cloud Computing and Distributed Storage at NCSA/UIUC, Urbana, USA. (Sep. 2011 - Dec. 2011)
  • PhD. Internship :
  • Internship in Reliable Clustering for Petascale Computing, trade-off between resilience and performance of communications at LRI, Orsay, France. (Jun. 2011 - Sep. 2011)
  • PhD. Internship :
  • Internship in Scalable Multilevel Checkpointing for HPC Applications at NCSA/UIUC, Urbana-Chapaign, USA. (Mar. 2011 - Jun. 2011)
  • Master Internship :
  • Internship in Diskless Checkpointing for scientific codes running in Supercomputers at INRIA/Tokyo Institute of Technology, Tokyo, Japan. (Apr. 2009 - Sep. 2009)
  • Master Internship :
  • Internship in 3D-visualization of internet traffic and internet attacks (i.e., DoS Attacks) at the NII, Tokyo, Japan. (Jun. 2008 - Sep 2008)
  • Master Internship :
  • Internship in distributed model checking for formal verification of critical codes at the LIP6, Paris, France. (Mar. 2008 - May 2008)

    Other Presentations

  • February 2021 :
  • Invited Sucess Story Talk 12th Workshop of the JLESC, Knoxville, USA.
  • April 2019 :
  • 9th Workshop of the JLESC, Knoxville, USA.
  • August 2018 :
  • Invited talk at the Tokyo Institute of Technology
  • July 2018 :
  • Panel at the PASC Conference, Basel, Switzerland.
  • July 2018 :
  • Invited talk at Ethereum Sharding Workshop, Berlin, Germany.
  • April 2018 :
  • 8th Workshop of the JLESC, Barcelona, Spain.
  • November 2017 :
  • Invited talk at EoCoE Face to Face Meeting, Toulouse, France.
  • August 2017 :
  • Invited talk at RIKEN, Tokyo, Japan.
  • August 2017 :
  • Invited talk at Tokyo Institute of Technology, Tokyo, Japan.
  • July 2017 :
  • 7th Workshop of the JLESC, Urbana-Champaign, USA.
  • December 2016 :
  • 6th Workshop of the JLESC, Kobe, Japan.
  • October 2016 :
  • Invited talk at University Carlos III of Madrid, Madrid, Spain.
  • June 2016 :
  • 5th Workshop of the JLESC, Lyon, France.
  • December 2015 :
  • 4th Workshop of the JLESC, Bonn, Germany.
  • July 2015 :
  • 3rd Workshop of the JLESC, Barcelona, Spain.
  • February 2015 :
  • HPC knowledge meeting, Barcelone, Spain.
  • November 2014 :
  • 2nd Workshop of the JLESC, Chicago, USA.
  • October 2014 :
  • Invited talk at CEA, Paris, France.
  • September 2014 :
  • Resilience at Exascale Seminar, Dagstuhl, Germany.
  • June 2014 :
  • 1st Workshop of the JLESC, Sophia Antipolis, France.
  • March 2014 :
  • Invited talk at the Tokyo Institute of Technology, Tokyo, Japan.
  • March 2014 :
  • 6th G8 Enabling Climate Simulations Workshop, Kobe, Japan.
  • November 2013 :
  • 10th INRIA-NCSA Workshop of the JLPC, Urbana-Champaign, USA.
  • November 2013 :
  • Emerging Technologies, Supercomputing Conference 2013, Denver, USA
  • April 2013 :
  • Invited talk at the University of Chicago, Chicago, USA.
  • November 2012 :
  • 3rd G8 Enabling Climate Simulations Workshop, Salt Lake City, USA.
  • Mars 2012 :
  • 2nd G8 Enabling Climate Simulations Workshop, Aachen, Germany.
  • November 2011 :
  • 6th INRIA-NCSA Workshop of the JLPC, Urbana-Champaign, USA.
  • June 2011 :
  • 5th INRIA-NCSA Workshop of the JLPC, Grenoble, France.
  • February 2011 :
  • Paris 11 University, LRI Parallel Seminar, France.
  • November 2010 :
  • 4th INRIA-NCSA Workshop of the JLPC, Urbana-Champaign, USA.
  • June 2010 :
  • 3rd INRIA-NCSA Workshop of the JLPC, Bordeaux, France.
  • June 2010 :
  • Paris 11 University, LRI Parallel Seminar, France.
  • February 2010 :
  • Paris 11 University, LRI Parallel Seminar, France.
  • December 2009 :
  • 2nd INRIA-NCSA Workshop of the JLPC, Urbana-Champaign, USA.