The Croatian Web Archive (HAW) is a collection of content archived from the web for the purpose of archiving and permanently storing online legal deposit copy. Content on the web is a significant part of contemporary Croatian cultural and scientific heritage. It is built on the concepts of selective harvesting, annual harvesting of the .hr domain and thematic harvesting of content of national importance.
Croatian Web Archive (HAW) is part of the Digital Collections of the National and University Library in Zagreb.
Publishers are legally bound to deliver to the National and University Library in Zagreb a legal deposit copy of a resource published online. In turn, the Library is legally bound to collect, process, store and provide access to the resources.
HAW uses the DAMP-based software customized by the University of Zagreb’s Computing Center (SRCE), by which copies are archived in their original format and in a way that they can be used in the future. Also, in carrying out regular national web domain crawls and crawls targeting specific content, it uses a version of the Heritrix free software customised by Srce.
Selective harvesting on the web is a process that involves creating a bibliographic description, permanent storing and making archived publications available. Copies of such publications are also available in the NSK online catalog. Search is available in the “Search HAW” search engine by title, URL and keywords. Advanced search is available here.
HAW harvests publicly available content from the Croatian web space (entire .hr domain) once a year and saves it to the server. Search is available in the “Search .hr domain” search engine by entering a full URL.
HAW periodically harvests selected content from the web relating to current topics or events of national importance. The most relevant subject matter is selected. Browse them here.
Assignment of a permanent URN:NBN identifier for all titles and archived copies begins.
HAW’s metadata available at Europeana.
HAW begins with the annual harvesting of the .hr domain and building thematic collections.
The Digital Archive of Croatian Online Publications is renamed to the Croatian Web Archive. The new name more accurately describes HAW’s purpose and purpose of archived content from the web. Also, it is consistent with the names of the web archives of other national libraries.
The Croatian Web Archive becomes a member of the International Internet Preservation Consortium (IIPC).
NSK, in collaboration with SRCE, is establishing a Digital Archive of Croatian Online Publications – DAMP in order to collect and preserve the online part of Croatian publishing production.
NSK begins cataloging online publications.