Professional Documents
Culture Documents
I.
Fault
tolerance;
Load
INTRODUCTION
$ env LD_PRELOAD=
executable
BLCR_LIBDIR/libcr_run.so.0
During this restore process slave will not use the same process
id specified into the context file but it will create new process
when restarting the process. If it is require that process id on
slave node is same as the process id on master node then
checkpoint needs to create on Master Node with -- no restorepid option.
Restart of the checkpoint process at slave node is done using
following command.
$ cr_restart context.15005
With above command slave can run the context file as many
times as slave wants to execute it but requires that once it
starts execution using cr_restart, it must completed and only
then after slave can restart the same context file It is possible
to execute any of the above command by writing code snippet
given below into Slave node file which is written using JAVA
programming language.
String SlaveCmd=;
SlaveCmd = " cr_restart Context.01"
Process p1 = Runtime.getRuntime().exec(SlaveCmd);
E. Steps of proposed approach
1) Start master node first and then calculate load of master
node using Load calculator module.
2) If load of master node exceeds some predefined
threshold limit then user selects some processes which
are makes master node highly loaded.
3) Master node check points all those process which are
selected by user using check pointer module.
4) Master node start SLAVE IDENTFYING module to
find total number of slave on LAN to accept migrating
processes.
5) Once total number of slave nodes are found then Master
starts its PROCESS DISTRIBUTION module to
equally distribute all checkpointed processes.
6) Repeat above procedure (Step 1 to 5) periodically.
7) Each of slave node accept process checkpoint file and
restart migrated process using Restarter module and
then send partial computed result to master node.
IV.
RESULTS OF EXPERIMENT
V.
CONCLUSION
FUTURE WORK
[3]
[4]
[5]