Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/46975

Registo completo
Campo DCValorIdioma
dc.contributor.authorCosta, Eduarda Alexandra Pintopor
dc.contributor.authorCosta, Carlos Filipe Machado Silvapor
dc.contributor.authorSantos, Maribel Yasminapor
dc.date.accessioned2017-11-02T15:14:20Z-
dc.date.issued2017-09-
dc.identifier.citationCosta, Eduarda, Carlos Costa and Maribel Yasmina Santos, “Efficient Big Data Modelling and Organization for Hadoop Hive-based Data Warehouses”, Proceedings of the European, Mediterranean and Middle Eastern Conference on Information Systems (EMCIS’2017), Coimbra, Portugal, 8-9 September, 2017. DOI: 10.1007/978-3-319-65930-5_1.por
dc.identifier.isbn978-3-319-65929-9-
dc.identifier.issn1865-1348por
dc.identifier.urihttps://hdl.handle.net/1822/46975-
dc.description.abstractThe amount of data has increased exponentially as a consequence of the availability of new data sources and the advances in data collection and storage. This data explosion was accompanied by the popularization of the Big Data term, addressing large volumes of data, with several degrees of complexity, often without structure and organization, which cannot be processed or analyzed using traditional processes or tools. Moving towards Big Data Warehouses (BDWs) brings new problems and implies the adoption of new logical data models and tools to query them. Hive is a DW system for Big Data contexts that organizes the data into tables, partitions and buckets. Several studies have been conducted to understand ways of optimizing its performance in data storage and processing, but few of them explore whether the way data is structured has any influence on how quickly Hive responds to queries. This paper investigates the role of data organization and modelling in the processing times of BDWs implemented in Hive, benchmarking multidimensional star schemas and fully denormalized tables with different Scale Factors (SFs), and analyzing the impact of adequate data partitioning in these two data modelling strategies.por
dc.description.sponsorshipThis work is supported by COMPETE: POCI-01-0145- FEDER- 007043 and FCT – Fundação para a Ciência e Tecnologia within the Project Scope: UID/CEC/00319/2013, and funded by the SusCity project, MITP-TB/CS/0026/2013, and by the Portugal Incentive System for Research and Technological Development, Project in co-promotion no. 002814/2015 (iFACTORY 2015– 2018).por
dc.language.isoengpor
dc.publisherSpringer Verlagpor
dc.relationinfo:eu-repo/grantAgreement/FCT/5876/147280/PTpor
dc.rightsrestrictedAccesspor
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/por
dc.subjectBig Datapor
dc.subjectData Warehousingpor
dc.subjectHivepor
dc.subjectModellingpor
dc.subjectPartitioningpor
dc.titleEfficient Big Data Modelling and Organization for Hadoop Hive-Based Data Warehousespor
dc.typeconferencePaperpor
dc.peerreviewedyespor
dc.relation.publisherversionhttps://link.springer.com/chapter/10.1007/978-3-319-65930-5_1por
oaire.citationStartPage3por
oaire.citationEndPage16por
oaire.citationConferencePlaceCoimbra, Portugalpor
oaire.citationVolume299por
dc.identifier.doi10.1007/978-3-319-65930-5_1por
dc.identifier.eisbn978-3-319-65930-5-
dc.subject.fosEngenharia e Tecnologia::Outras Engenharias e Tecnologiaspor
dc.description.publicationversioninfo:eu-repo/semantics/publishedVersionpor
dc.subject.wosScience & Technologypor
sdum.journalLecture Notes in Business Information Processingpor
sdum.conferencePublicationEuropean, Mediterranean and Middle Eastern Conference on Information Systems (EMCIS’2017)por
sdum.bookTitleInformation Systems. EMCIS 2017. Lecture Notes in Business Information Processingpor
Aparece nas coleções:CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
EMCIS2017_EC_CC_MYS.pdf
Acesso restrito!
949,59 kBAdobe PDFVer/Abrir

Este trabalho está licenciado sob uma Licença Creative Commons Creative Commons

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID