Utilize este identificador para referenciar este registo: https://hdl.handle.net/1822/81556

Registo completo
Campo DCValorIdioma
dc.contributor.authorCosta, Miguel Ângelo Peixotopor
dc.contributor.authorCosta, Diogo André Veigapor
dc.contributor.authorGomes, Tiago Manuel Ribeiropor
dc.contributor.authorPinto, Sandropor
dc.date.accessioned2023-01-05T14:18:07Z-
dc.date.available2023-01-05T14:18:07Z-
dc.date.issued2022-12-
dc.identifier.citationMiguel Costa, Diogo Costa, Tiago Gomes, and Sandro Pinto. 2022. Shifting Capsule Networks from the Cloud to the Deep Edge. ACM Trans. Intell. Syst. Technol. 13, 6, Article 105 (December 2022), 25 pages. https://doi.org/10.1145/3544562por
dc.identifier.issn2157-6904-
dc.identifier.urihttps://hdl.handle.net/1822/81556-
dc.description.abstractCapsule networks (CapsNets) are an emerging trend in image processing. In contrast to a convolutional neural network, CapsNets are not vulnerable to object deformation, as the relative spatial information of the objects is preserved across the network. However, their complexity is mainly related to the capsule structure and the dynamic routing mechanism, which makes it almost unreasonable to deploy a CapsNet, in its original form, in a resource-constrained device powered by a small microcontroller (MCU). In an era where intelligence is rapidly shifting from the cloud to the edge, this high complexity imposes serious challenges to the adoption of CapsNets at the very edge. To tackle this issue, we present an API for the execution of quantized CapsNets in Arm Cortex-M and RISC-V MCUs. Our software kernels extend the Arm CMSIS-NN and RISC-V PULP-NN to support capsule operations with 8-bit integers as operands. Along with it, we propose a framework to perform post-training quantization of a CapsNet. Results show a reduction in memory footprint of almost 75%, with accuracy loss ranging from 0.07% to 0.18%. In terms of throughput, our Arm Cortex-M API enables the execution of primary capsule and capsule layers with medium-sized kernels in just 119.94 and 90.60 milliseconds (ms), respectively (STM32H755ZIT6U, Cortex-M7 @ 480 MHz). For the GAP-8 SoC (RISC-V RV32IMCXpulp @ 170 MHz), the latency drops to 7.02 and 38.03 ms, respectively.por
dc.description.sponsorshipMiguel Costa is supported by FCT-Fundacao para a Ciencia e Tecnologia (grant SFRH/BD/146780/2019). This work has been also supported by FCT-Fundacao para a Ciencia e Tecnologia within the R&D Units Project Scope: UIDB/00319/2020.por
dc.language.isoengpor
dc.publisherACM Presspor
dc.rightsopenAccesspor
dc.subjectCapsule networkspor
dc.subjectCapsule network quantizationpor
dc.subjectEdgepor
dc.subjectCloudpor
dc.subjectCMSIS-NNpor
dc.subjectPULP-NNpor
dc.titleShifting capsule networks from the cloud to the deep edgepor
dc.typearticlepor
dc.peerreviewedyespor
dc.relation.publisherversionhttps://dl.acm.org/doi/10.1145/3544562por
oaire.citationStartPage1por
oaire.citationEndPage21por
oaire.citationIssue6por
oaire.citationVolume13por
dc.identifier.eissn2157-6912-
dc.identifier.doi10.1145/3544562por
dc.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informáticapor
dc.subject.wosScience & Technologypor
sdum.journalACM Transactions on Intelligent Systems and Technologypor
dc.identifier.articlenumber105por
dc.subject.odsIndústria, inovação e infraestruturaspor
Aparece nas coleções:CAlg - Artigos em revistas internacionais / Papers in international journals

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
2110.02911.pdfPreprint1,94 MBAdobe PDFVer/Abrir
3544562.pdf
Acesso restrito!
Article1,06 MBAdobe PDFVer/Abrir

Partilhe no FacebookPartilhe no TwitterPartilhe no DeliciousPartilhe no LinkedInPartilhe no DiggAdicionar ao Google BookmarksPartilhe no MySpacePartilhe no Orkut
Exporte no formato BibTex mendeley Exporte no formato Endnote Adicione ao seu ORCID