ΛΗΚΥΘΟΣ
    • Ελληνικά
    • English
  • English 
    • Ελληνικά
    • English
  • Login
View Item 
  •   DSpace Home
  • Βιβλιοθηκονομία και Επιστήμη της Πληροφόρησης / Library and Information Science
  • Eλληνική BIβλιοθηκονομική BAση (ΕΒΙΒΑ)
  • Παρουσιάσεις και ομιλίες σε συνέδρια, διημερίδες, ημερίδες και σεμινάρια
  • View Item
  •   DSpace Home
  • Βιβλιοθηκονομία και Επιστήμη της Πληροφόρησης / Library and Information Science
  • Eλληνική BIβλιοθηκονομική BAση (ΕΒΙΒΑ)
  • Παρουσιάσεις και ομιλίες σε συνέδρια, διημερίδες, ημερίδες και σεμινάρια
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Migrating Content in WARC Files

Thumbnail
View/Open
diafofa/ecdl133 (458.2Kb)
Date
2009
Author
Strodl, Stephan
Beran, Peter Paul
Rauber, Andreas
Metadata
Show full item record
Abstract
Heritage institutions all over the world started on harvesting and preserving resources of the World Wide Web for future generations as part of our culture heritage. This task tends to be a non-trivial one because of two complex challenges: (1) crawling the enormous data amount located in the Internet and (2) performing long term preservation strategies on these data. Nowadays a lot of effort is made in the development ofWeb crawlers and there exist many years’ experience with bit storage of large data amounts. However the support for the logical preservation of Internet archives is very limited. The continuous development of technologies that are used in the Web and especially the rapid change in using a tremendous variety of different file formats put the digital assets in the Web archives at risk of becoming inaccessible and unusable in the near future. This paper presents a workflow to apply digital preservation strategies on the content of WARC archives. The migration of the objects within a WARC archive allows accessing and using the information in the future. The new WARC format that is widely used to store Internet crawl results supports migration of its content. Moreover a set of tools is presented that supports the extraction, migration and injection of objects in WARC files.
URI
http://hdl.handle.net/10797/14078
Collections
  • Παρουσιάσεις και ομιλίες σε συνέδρια, διημερίδες, ημερίδες και σεμινάρια [2186]

DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
 

 

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

LoginRegister

Statistics

View Usage Statistics

DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback