Checkpoint Systems

Results: 35



#Item
1

Employing Checkpoint to Improve Job Scheduling in Large-Scale Systems Shuangcheng Niu1 , Jidong Zhai1 , Xiaosong Ma2 , Mingliang Liu1 , Yan Zhai1 , Wenguang Chen1 , and Weimin Zheng1 2

Add to Reading List

Source URL: hpc.cs.tsinghua.edu.cn

Language: English - Date: 2017-05-18 10:22:13
    2

    Transparent Checkpoint of Closed Distributed Systems in Emulab Anton Burtsev, Prashanth Radhakrishnan, Mike Hibler, and Jay Lepreau University of Utah, School of Computing

    Add to Reading List

    Source URL: eurosys2009.informatik.uni-erlangen.de

    Language: English - Date: 2009-04-24 00:38:10
      3

      The LAM/MPI Checkpoint/Restart Framework: System-Initiated Checkpointing Sriram Sankaran, Jeffrey M. Squyres, Brian Barrett, Andrew Lumsdaine Open Systems Laboratory, Indiana University ssankara,jsquyres,brbarret,lums @l

      Add to Reading List

      Source URL: crd.lbl.gov

      - Date: 2012-09-05 12:42:01
        4Computing / Computer programming / Software / Fault-tolerant computer systems / Parallel computing / Application checkpointing / Debuggers / Debugging / Checkpoint / VMware / Computer cluster

        Kapil Arya • • http://www.ccs.neu.edu/home/kapil Academics Northeastern University, Boston, MA May 2014

        Add to Reading List

        Source URL: www.ccs.neu.edu

        Language: English - Date: 2015-11-17 02:37:33
        5Computing / Computer architecture / Fault-tolerant computer systems / Software / Concurrent computing / Parallel computing / Application checkpointing / Computer cluster / Linux kernel / Checkpoint / Thread / X86-64

        An Overview of Berkeley Lab Checkpoint/Restart (BLCR) for Linux Clusters Paul Hargrove with Jason Duell and Eric Roman http://ftg.lbl.gov/checkpoint

        Add to Reading List

        Source URL: crd.lbl.gov

        Language: English - Date: 2012-09-05 12:41:12
        6Fault-tolerant computer systems / Application checkpointing / Process identifier / Cloud computing / Thread / Kernel

        DMTCP for Checkpoint-Restart: its Past, Present and Future Gene Cooperman College of Computer and Information Sciences Northeastern University

        Add to Reading List

        Source URL: www.ccs.neu.edu

        Language: English - Date: 2013-11-20 07:54:54
        7Computing / Computer architecture / Data management / Network file systems / Computer storage devices / Lustre / RAID / Clustered file system / File system / Storage area network / Computer data storage / Data corruption

        Zest Checkpoint Storage System for Large Supercomputers Paul Nowoczynski, Nathan Stone, Jared Yanovich, Jason Sommerfield Pittsburgh Supercomputing Center Pittsburgh, PA USA

        Add to Reading List

        Source URL: psc.edu

        Language: English - Date: 2014-04-21 17:04:39
        8Fault-tolerant computer systems / Application checkpointing / Concurrent computing / Parallel computing / Checkpoint / Kernel / Scheduling / Linux kernel / Thread / Computer cluster / Operating system / Job scheduler

        Microsoft PowerPoint - WTTC2008-BKK

        Add to Reading List

        Source URL: crd.lbl.gov

        Language: English - Date: 2012-09-05 12:41:12
        9Checkpoint / Cell cycle checkpoint / Algorithms for Recovery and Isolation Exploiting Semantics / MVAPICH / Fault-tolerant computer systems / Application checkpointing

        Fault-­‐Tolerance  Support  in  MVAPICH2   MVAPICH2  User  Group  (MUG)  MeeDng     by  

        Add to Reading List

        Source URL: mug.mvapich.cse.ohio-state.edu

        Language: English - Date: 2015-11-20 13:17:45
        10Fault-tolerant computer systems / Concurrent computing / Application checkpointing / Parallel computing / Computer cluster / Checkpoint / Thread / Single system image

        DMTCP Transparent Checkpointing for Cluster Computations and the Desktop Jason Ansel1 Kapil Arya2

        Add to Reading List

        Source URL: dmtcp.sourceforge.net

        Language: English - Date: 2009-05-30 11:44:49
        UPDATE