14
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems

      , , ,
      The Journal of Supercomputing
      Springer Nature

      Read this article at

      ScienceOpenPublisher
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: not found
          • Article: not found

          Distributed snapshots: determining global states of distributed systems

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            A survey of rollback-recovery protocols in message-passing systems

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              System structure for software fault tolerance

                Bookmark

                Author and article information

                Journal
                The Journal of Supercomputing
                J Supercomput
                Springer Nature
                0920-8542
                1573-0484
                September 2013
                February 12 2013
                September 2013
                : 65
                : 3
                : 1302-1326
                Article
                10.1007/s11227-013-0884-0
                e0bb321b-0a74-4561-bcd3-a53c4a77becf
                © 2013
                History

                Comments

                Comment on this article