Sei la
ISSN 0103-9741
Monografias em Ciência da Computação n° 03/11
Processamento de dados Semânticos na Cloud: um estudo de caso com o Protein World Database
Carlos Juliano Moura Viana
Sérgio Lifschitz
Antonio Basílio de Miranda
Edward Hermann Haeusler
Departamento de Informática
PONTIFÍCIA UNIVERSIDADE CATÓLICA DO RIO DE JANEIRO
RUA MARQUÊS DE SÃO VICENTE, 225 - CEP 22453-900
RIO DE JANEIRO - BRASIL
Monografias em Ciência da Computação, No. 03/11
Editor: Prof. Carlos José Pereira de Lucena
ISSN: 0103-9741
Abril, 2011
Processamento de dados Semânticos: um estudo de caso com o Protein World Database
Carlos Juliano Moura Viana, Sérgio Lifschitz, Edward Hermann Haeusler e
Antonio Basílio de Miranda cviana@inf.puc-rio.br, sergio@inf.puc-rio.br, herman@inf.puc-rio.br, antonio@fiocruz.br
Abstract. the Semantic Web has not only brought many opportunities, but also many other challenges into the data management problem. For instance, biological researchers make their genome findings publicly available, but as much data on the web; those findings are unrelated, and difficult to integrate with other data. In this manner, semantic web technologies could offer possibilities such as automatically infer relationships, which in turn could help to cure diseases. However, as described by Hey et al., scientific data is being generated at exponentially growing rates which makes its processing even more resource consuming. In an effort to assist researchers, this work proposes to make semantic data processing in a flexible and scalable way in order to enable inference over available genomic data. This paper presents a hybrid cloud architecture used for processing and sharing large amounts of biological data while exploiting the MapReduce programming model, and semantic web technologies to enable inference over generated semantic genomic data and related data.
Keywords: Protein World DB, Biological Databases, Gene, Genome, Data Integration