Show simple item record

dc.contributor.authorStrodl, Stephanen
dc.contributor.authorBeran, Peter Paulen
dc.contributor.authorRauber, Andreasen
dc.coverage.spatialGR - Κέρκυραen
dc.date.available2014-03-24T10:18:44Z
dc.date.issued2009
dc.identifier.urihttp://hdl.handle.net/10797/14078en
dc.descriptionΠεριέχει το πλήρες κείμενοel_GR
dc.description.abstractHeritage institutions all over the world started on harvesting and preserving resources of the World Wide Web for future generations as part of our culture heritage. This task tends to be a non-trivial one because of two complex challenges: (1) crawling the enormous data amount located in the Internet and (2) performing long term preservation strategies on these data. Nowadays a lot of effort is made in the development ofWeb crawlers and there exist many years’ experience with bit storage of large data amounts. However the support for the logical preservation of Internet archives is very limited. The continuous development of technologies that are used in the Web and especially the rapid change in using a tremendous variety of different file formats put the digital assets in the Web archives at risk of becoming inaccessible and unusable in the near future. This paper presents a workflow to apply digital preservation strategies on the content of WARC archives. The migration of the objects within a WARC archive allows accessing and using the information in the future. The new WARC format that is widely used to store Internet crawl results supports migration of its content. Moreover a set of tools is presented that supports the extraction, migration and injection of objects in WARC files.en
dc.language.isoengen
dc.relation.ispartofThe 9th International Web Archiving Workshop (IWAW 2009)en
dc.rightsinfo:eu-repo/semantics/openAccessen
dc.source13th European Conference, ECDL 2009en
dc.titleMigrating Content in WARC Filesen
dc.typeWorkshopen
dc.subject.uncontrolledtermMigrationen
dc.subject.uncontrolledtermDigital preservationen
dc.subject.uncontrolledtermWeb Archiveen
dc.subject.uncontrolledtermWARCen
dc.subject.JITAΤεχνικές υπηρεσίες σε βιβλιοθήκες, αρχεία και μουσεία, Ψηφιακή διατήρησηel_GR
dc.subject.JITATechnical services in libraries, archives and museums, Digital preservationen
dc.contributor.conferenceorganizerLaboratory on Digital Libraries and Electronic Publishing, Department of Archives and Library Sciences, Ionian Universityen
dc.identifier.JITAJHen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record