Grid Fault Tolerance Service for Quality of Service
 
Hwa Min Lee, Kwang Sik Chung, Sung Ho Jin, Dae Won Lee, Soon Young Jung, and Heon Chang Yu
 
Department of Computer Science Education
Korea University
1, 5-Ka, Anam-dong, Sungbuk-ku, Seoul, Korea
{zelkova, wingtop, ldw1996, jsy, yuhc}@comedu.korea.ac.kr
 
 
Abstract
 
 
This paper proposes fault tolerance service to satisfy quality of service (QoS) requirement in grid computing. Since the failure of resources affects job execution fatally, fault tolerance service is essential in grid computing. And grid services are often expected to meet some minimum levels of QoS for desirable operation. In order to provide fault tolerance service and satisfy QoS requirements, we expand the definition of failure, such as process failure, processor failure, and network failure. And we propose fault detection service and fault management services.