Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/66818

TítuloAn automated and distributed machine learning framework for telecommunications risk management
Autor(es)Ferreira, Luís
Pilastri, André
Martins, Carlos
Santos, Pedro
Cortez, Paulo
Palavras-chaveAutomated Machine Learning
Distributed Machine Learning
Risk Management
Supervised Learning
Data2020
EditoraSCITEPRESS
Resumo(s)Automation and scalability are currently two of the main challenges of Machine Learning. This paper proposes an automated and distributed ML framework that automatically trains a supervised learning model and produces predictions independently of the dataset and with minimum human input. The framework was designed for the domain of telecommunications risk management, which often requires supervised learning models that need to be quickly updated by non-ML-experts and trained on vast amounts of data. Thus, the architecture assumes a distributed environment, in order to deal with big data, and Automated Machine Learning (AutoML), to select and tune the ML models. The framework includes several modules: task detection (to detect if classification or regression), data preprocessing, feature selection, model training, and deployment. In this paper, we detail the model training module. In order to select the computational technologies to be used in this module, we first analyzed the capabilities of an initial set of five modern AutoML tools: Auto-Keras, Auto-Sklearn, Auto-Weka, H2O AutoML, and TransmogrifAI. Then, we performed a benchmarking of the only two tools that address distributed ML (H2O AutoML and TransmogrifAI). Several comparison experiments were held using three real-world datasets from the telecommunications domain (churn, event forecasting, and fraud detection), allowing us to measure the computational effort and predictive capability of the AutoML tools.
TipoArtigo em ata de conferência
URIhttps://hdl.handle.net/1822/66818
ISBN9789897583957
DOI10.5220/0008952800990107
Arbitragem científicayes
AcessoAcesso aberto
Aparece nas coleções:CAlg - Artigos em livros de atas/Papers in proceedings

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
irmda_artigo.pdf225,67 kBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID