Repositório Colecção: ARTARThttps://hdl.handle.net/1822/178152024-03-28T14:51:51Z2024-03-28T14:51:51ZBeyond relational databases: preserving the dataRamalho, José CarlosFerreira, BrunoFaria, LuísFerreira, Miguelhttps://hdl.handle.net/1822/734042021-12-22T19:58:45Z2021-06-14T16:42:26ZTítulo: Beyond relational databases: preserving the data
Autor: Ramalho, José Carlos; Ferreira, Bruno; Faria, Luís; Ferreira, Miguel
Resumo: Relational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool.
<b>Tipo</b>: article2021-06-14T16:42:26ZRODA-in: A generic tool for the mass creation of Submission Information PackagesRamalho, José CarlosPereira, AndréFerreira, MiguelFaria, Luíshttps://hdl.handle.net/1822/460932020-02-11T14:14:59Z2017-06-29T14:49:20ZTítulo: RODA-in: A generic tool for the mass creation of Submission Information Packages
Autor: Ramalho, José Carlos; Pereira, André; Ferreira, Miguel; Faria, Luís
Resumo: RODA-in is an offline tool designed to easily create thousands of SIPs with gigabytes of data in an easy to use way. This is possible by using aggregation rules, which map files and folders to SIPs, and metadata association rules, which add metadata to the created SIPs. The basic workflow can be defined in a sequence of easy steps where the user starts by selecting the folders to be archived and then chooses which patterns will be used to transform the data in SIPs. As an optional step, it’s possible to edit the generated SIPs to either enrich them or fix exceptions to the rule. Lastly, it’s possible to export to two different formats: BagIt and E-ARK SIP. In this paper we present and discuss all the decisions and ideas taken to implement RODA-in like which workflow should be used, what aggregation and metadata association options are currently implemented, how the metadata templating system works and which other features can be used to enrich the SIPs.
<b>Tipo</b>: conferencePaper2017-06-29T14:49:20ZDatabase Preservation Toolkit: A relational database conversion and normalization toolFerreira, BrunoFaria, LuísRamalho, José CarlosFerreira, Miguelhttps://hdl.handle.net/1822/434792017-04-24T14:32:46Z2016-12-16T09:49:56ZTítulo: Database Preservation Toolkit: A relational database conversion and normalization tool
Autor: Ferreira, Bruno; Faria, Luís; Ramalho, José Carlos; Ferreira, Miguel
Resumo: The Database Preservation Toolkit is a software that automates the migration of a relational database to the second version of the Software Independent Archiving of Relational Databases format. This flexible tool supports the currently most popular Relational Database Management Systems and can also convert a preserved database back to a Database Management System, allowing for some specific usage scenarios in an archival context. The conversion of databases between different formats, whilst retaining the databases' significant properties, poses a number of interesting implementation issues, which are described along with their current solutions.
To complement the conversion software, the Database Visualization Toolkit is introduced as a software that allows access to preserved databases, enabling a consumer to quickly search and explore a database without knowing any query language. The viewer is capable of handling big databases and promptly present search and filter results on millions of records.
This paper describes the features of both tools and the methods used to pilot them in the context of the European Archival Records and Knowledge Preservation project on several European national archives.
<b>Tipo</b>: conferencePaper2016-12-16T09:49:56ZResearch projects as a driving force for open source development and a fast route to market: RODA, SCAPE and E-ARK - a case studySilva, HélderFerreira, MiguelFaria, Luíshttps://hdl.handle.net/1822/352812017-09-28T14:09:56Z2015-05-26T14:43:52ZTítulo: Research projects as a driving force for open source development and a fast route to market: RODA, SCAPE and E-ARK - a case study
Autor: Silva, Hélder; Ferreira, Miguel; Faria, Luís
Resumo: Research projects, specially in the computer science domain, have consistently provided outputs as open source products or updates to long-standing open source projects. This occurs due to the shared openness nature of both research and open source, that enables re-use by the community spanning new developments in both research and open source products. But when an open source project serves a community and a real-world problem, the impetuousity of research can clash with the inertia of real-world application. Nevertheless, research projects can bring the much needed innovation to open source projects, and open source projects can bring the much needed route to market that research funders look for the outputs of the research they fund, ensuring the budget spent in research actually reaches the community and improves the world.
This paper presents an analysis of this dynamic with a case study about RODA, an open source repository for digital preservation, used in memory institutions such as archives, and two research projects, SCAPE, focused on digital preservation scalable services, and E-ARK, focused on standardization of information packages, integration with real-world applications, and database preservation.
The paper further tries to identify good practices for using existing open source projects in research and assure that research outputs are further carried into main versions of open source projects and find their way to the final user.; Výzkumné projekty, zvláště v oblasti počítačové vědy, trvale poskytovaly výstupy jako open
source produkty nebo updaty k dlouholetým open source projektům. K tomu dochází díky
sdílné a otevřené povaze vědeckého výzkumu i open source hnutí, které umožňuje opětovné
využití komunitami, což pozitivně ovlivňuje rozvoj jak výzkumu, tak open source produktů.
Ale slouží-li open source projekty společnosti a řeší-li problémy skutečného světa, může se
rychlost výzkumu dostat do střetu se setrvačností aplikace ve skutečném světě. Přesto
mohou výzkumné projekty přinést velice potřebnou inovaci open source projektů a ty mohou
otevřít potřebnou cestu na trh, kde investoři hledají výstupy výzkumů, jež financují, a ujišťují
se, že rozpočet vynaložený na výzkum skutečně pomůže komunitě a zlepší svět.
Toto pojednání představuje analýzu této dynamiky v případové studii o systému RODA, open
source digitálním repozitáři, využívaném v paměťových institucích jako jsou archivy, a dvou
výzkumných projektech: SCAPE, který je zaměřen na škálovatelné služby digitální ochrany,
a E-ARK, zaměřený na standardizaci informačních balíků, integraci s reálnými aplikacemi a
uchování databází.
Článek se dále pokouší identifikovat osvědčené postupy pro použití stávajících open source
projektů ve výzkumu a ujistit se, že výstupy výzkumů jsou přenášeny do hlavních verzí open
source projektů a najdou si cestu k uživateli.
<b>Tipo</b>: conferencePaper2015-05-26T14:43:52ZScalable decision support for digital preservationBecker, ChristophFaria, LuísDuretec, Kresimirhttps://hdl.handle.net/1822/309042017-09-06T16:50:29Z2014-11-13T18:26:12ZTítulo: Scalable decision support for digital preservation
Autor: Becker, Christoph; Faria, Luís; Duretec, Kresimir
Resumo: Preservation environments such as repositories need scalable and context-aware preservation planning and monitoring capabilities to ensure continued accessibility of content over time. This article identifies a number of gaps in the systems and mechanisms currently available and presents a new, innovative architecture for scalable decision-making and control in such environments.
<b>Tipo</b>: article2014-11-13T18:26:12Z