Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/43853
Registo completo
Campo DC | Valor | Idioma |
---|---|---|
dc.contributor.author | Bruno Martinho | por |
dc.contributor.author | Santos, Maribel Yasmina | por |
dc.date.accessioned | 2016-12-23T14:20:49Z | - |
dc.date.issued | 2016-12 | - |
dc.identifier.citation | Martinho, Bruno, and Maribel Yasmina Santos, “An Architecture for Data Warehousing in Big Data Environments”, Proceedings of the 10th. International Conference on Research and Practical Issues of Enterprise Information Systems (Confenis’2016), IFIP International Federation for Information Processing, LNBIP 268, pp. 237–250, 2016, Springer-Verlag, 13-14 December 2016, Viena, Aústria, 2016. (DOI: 10.1007/978-3-319-49944-4_18). | por |
dc.identifier.isbn | 9783319499437 | por |
dc.identifier.issn | 1865-1348 | por |
dc.identifier.uri | https://hdl.handle.net/1822/43853 | - |
dc.description.abstract | Recent advances in Information Technologies facilitate the increasing capacity to collect and store data, being the Big Data term often mentioned. In this context, many challenges need to be addressed, being Data Warehousing one of them. In this sense, the main purpose of this work is to propose an architecture for Data Warehousing in Big Data, taking as input a data source stored in a traditional Data Warehouse, which is transformed into a Data Warehouse in Hive. Before proposing and implementing the architecture, a benchmark was conducted to verify the processing times of Hive and Impala, understanding how these technologies could be integrated in an architecture where Hive plays the role of a Data Warehouse and Impala is the driving force for the analysis and visualization of data. After the proposal of the architecture, it was implemented using tools like the Hadoop ecosystem, Talend and Tableau, and validated using a data set with more than 100 million records, obtaining satisfactory results in terms of processing times. | por |
dc.description.sponsorship | This work has been supported by COMPETE: POCI-01-0145FEDER-007043 and FCT (Fundação para a Ciência e Tecnologia) within the Project Scope: UID/CEC/00319/2013, and by Portugal Incentive System for Research and Technological Development, Project in co-promotion nº 002814/2015 (iFACTORY 2015-2018). Some of the figures in this paper use icons made by Freepik, from www.flaticon.com. | por |
dc.language.iso | eng | por |
dc.publisher | Springer Verlag | por |
dc.relation | info:eu-repo/grantAgreement/FCT/5876/147280/PT | por |
dc.rights | restrictedAccess | por |
dc.subject | Big Data | por |
dc.subject | Data Warehouse | por |
dc.subject | NoSQL | por |
dc.subject | Hadoop | por |
dc.subject | Hive | por |
dc.subject | Impala | por |
dc.title | An architecture for data warehousing in big data environments | por |
dc.type | conferencePaper | por |
dc.peerreviewed | yes | por |
sdum.publicationstatus | info:eu-repo/semantics/publishedVersion | por |
sdum.event.title | The 10th. International Conference on Research and Practical Issues of Enterprise Information Systems (Confenis’2016), IFIP International Federation for Information Processing | - |
oaire.citationStartPage | 237 | por |
oaire.citationEndPage | 250 | por |
oaire.citationConferencePlace | Vienna, Austria | por |
oaire.citationVolume | 268 | por |
dc.identifier.doi | 10.1007/978-3-319-49944-4_18 | por |
dc.subject.fos | Engenharia e Tecnologia::Outras Engenharias e Tecnologias | por |
dc.subject.wos | Science & Technology | por |
sdum.journal | Lecture Notes in Business Information Processing | por |
sdum.conferencePublication | RESEARCH AND PRACTICAL ISSUES OF ENTERPRISE INFORMATION SYSTEMS, 10TH IFIP WG 8.9 WORKING CONFERENCE, CONFENIS 2016 | por |
Aparece nas coleções: |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
ArchitectureBigDataWarehousing_Final.pdf Acesso restrito! | 992,14 kB | Adobe PDF | Ver/Abrir |