Utilize este identificador para referenciar este registo:
https://hdl.handle.net/1822/66785
Título: | Challenging SQL-on-Hadoop performance with Apache Druid |
Autor(es): | Correia, José Costa, Carlos A. P. Santos, Maribel Yasmina |
Palavras-chave: | Big Data Big Data Warehouse SQL-on-Hadoop Druid OLAP |
Data: | 2019 |
Editora: | Springer Verlag |
Revista: | Lecture Notes in Business Information Processing |
Resumo(s): | In Big Data, SQL-on-Hadoop tools usually provide satisfactory performance for processing vast amounts of data, although new emerging tools may be an alternative. This paper evaluates if Apache Druid, an innovative column-oriented data store suited for online analytical processing workloads, is an alternative to some of the well-known SQL-on-Hadoop technologies and its potential in this role. In this evaluation, Druid, Hive and Presto are benchmarked with increasing data volumes. The results point Druid as a strong alternative, achieving better performance than Hive and Presto, and show the potential of integrating Hive and Druid, enhancing the potentialities of both tools. |
Tipo: | Artigo em ata de conferência |
URI: | https://hdl.handle.net/1822/66785 |
ISBN: | 9783030204846 |
DOI: | 10.1007/978-3-030-20485-3_12 |
ISSN: | 1865-1348 |
Versão da editora: | https://link.springer.com/chapter/10.1007%2F978-3-030-20485-3_12 |
Arbitragem científica: | yes |
Acesso: | Acesso aberto |
Aparece nas coleções: |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
BIS_2019_paper_137.pdf | 642,43 kB | Adobe PDF | Ver/Abrir |