ResearchSpace

Earth observation scientific workflows in a distributed computing environment

Show simple item record

dc.contributor.author Van Zyl, TL
dc.contributor.author McFerren, G
dc.contributor.author Vahed, Anwar
dc.date.accessioned 2012-01-05T09:33:23Z
dc.date.available 2012-01-05T09:33:23Z
dc.date.issued 2011-09
dc.identifier.citation Van Zyl, TL, McFerren, G and Vahed, A. 2011. Earth observation scientific workflows in a distributed computing environment. Free and Open Source Software for Geospatial (FOSS4G), Denver, Colorado, USA, 12-16 September 2011 en_US
dc.identifier.uri http://hdl.handle.net/10204/5435
dc.description Free and Open Source Software for Geospatial (FOSS4G), Denver, Colorado, USA, 12-16 September 2011 en_US
dc.description.abstract Geospatially Enabled Scientific Workflows offer a promising paradigm to facilitate researchers, in the earth observation domain, with many aspects of the scientific process. One such aspect is that of access to distributed earth observation data and computing resources. Earth observation research often utilises large datasets requiring extensive CPU and memory resources in their processing. These resource intensive processes can be chained; the sequence of processes (and their provenance) makes up a scientific workflow. Despite the exponential growth in capacity of desktop computing, resources available on such devices are often insufficient for the scientific workflow processing tasks at hand. By integrating distributed computing capabilities into a geospatially enabled scientific workflow environment, it is possible to provide researchers with a mechanism to overcome the limitations of the desktop computer. The majority of effort in regard to extending scientific workflows with distributed computing capabilities has focused on the web services approach as exemplified by the OGC's Web Processing Service and by GRID computing. The approach to leveraging distributed computing resources described in this paper uses instead remote objects via RPyC and the dynamic properties of the Python programming language. The Vistrails (http://www.vistrails.org) environment has been extended to allow for geospatial processing through the EO4Vistrails package (http://code.google.com/p/eo4vistrails/). In order to allow these geospatial processes to be seamlessly executed on distributed resources such as cloud computing nodes, the Vistrails environment has been extended with both multi-tasking capabilities and distributed processing capabilities. en_US
dc.language.iso en en_US
dc.relation.ispartofseries Workflow request;7727
dc.subject Earth observation en_US
dc.subject Earth observation data en_US
dc.subject Scientific workflows en_US
dc.subject Free open source software en_US
dc.subject FOSS4G en_US
dc.subject Geospatial en_US
dc.subject Computing environment en_US
dc.title Earth observation scientific workflows in a distributed computing environment en_US
dc.type Conference Presentation en_US
dc.identifier.apacitation Van Zyl, T., McFerren, G., & Vahed, A. (2011). Earth observation scientific workflows in a distributed computing environment. http://hdl.handle.net/10204/5435 en_ZA
dc.identifier.chicagocitation Van Zyl, TL, G McFerren, and Anwar Vahed. "Earth observation scientific workflows in a distributed computing environment." (2011): http://hdl.handle.net/10204/5435 en_ZA
dc.identifier.vancouvercitation Van Zyl T, McFerren G, Vahed A, Earth observation scientific workflows in a distributed computing environment; 2011. http://hdl.handle.net/10204/5435 . en_ZA
dc.identifier.ris TY - Conference Presentation AU - Van Zyl, TL AU - McFerren, G AU - Vahed, Anwar AB - Geospatially Enabled Scientific Workflows offer a promising paradigm to facilitate researchers, in the earth observation domain, with many aspects of the scientific process. One such aspect is that of access to distributed earth observation data and computing resources. Earth observation research often utilises large datasets requiring extensive CPU and memory resources in their processing. These resource intensive processes can be chained; the sequence of processes (and their provenance) makes up a scientific workflow. Despite the exponential growth in capacity of desktop computing, resources available on such devices are often insufficient for the scientific workflow processing tasks at hand. By integrating distributed computing capabilities into a geospatially enabled scientific workflow environment, it is possible to provide researchers with a mechanism to overcome the limitations of the desktop computer. The majority of effort in regard to extending scientific workflows with distributed computing capabilities has focused on the web services approach as exemplified by the OGC's Web Processing Service and by GRID computing. The approach to leveraging distributed computing resources described in this paper uses instead remote objects via RPyC and the dynamic properties of the Python programming language. The Vistrails (http://www.vistrails.org) environment has been extended to allow for geospatial processing through the EO4Vistrails package (http://code.google.com/p/eo4vistrails/). In order to allow these geospatial processes to be seamlessly executed on distributed resources such as cloud computing nodes, the Vistrails environment has been extended with both multi-tasking capabilities and distributed processing capabilities. DA - 2011-09 DB - ResearchSpace DP - CSIR KW - Earth observation KW - Earth observation data KW - Scientific workflows KW - Free open source software KW - FOSS4G KW - Geospatial KW - Computing environment LK - https://researchspace.csir.co.za PY - 2011 T1 - Earth observation scientific workflows in a distributed computing environment TI - Earth observation scientific workflows in a distributed computing environment UR - http://hdl.handle.net/10204/5435 ER - en_ZA


Files in this item

This item appears in the following Collection(s)

Show simple item record