Advanced applications are continuing to generate ever larger amounts of valuable data, but we are in danger of being unable to extract fully the latent knowledge within the data because of insufficient technology. The grid can play a significant role in providing an effective computational support for knowledge discovery applications. This paper describes design principles and a service-oriented software architecture of a novel infrastructure for distributed and highperformance data mining in grid environments. This architecture is designed and...