News Column

Information technology services

February 15, 2014

Contract award: top-level domain harvesting of "de" for german national library. The German National Library (DNB) has the statutory mandate to collect publications in German networks, archive, and provide. The aim is to achieve together with a service provider, a collection of all web pages with address in the range of the top-level domain "de" (TLD-DE). The technical collection, storage (hosting) and the provision including full-text search is to be performed by the service provider. Due to lack of experience on the scope of an as complete as possible ingathering at first only a crawl to be performed. The Contracting Authority reserves the option to purchase up to two renewal of this contract, by subsequent crawl. The provider must collect a crawling process whose URL contains the TLD DE starting from an initial list of URLs (seeds) as many websites as technically possible or as described below. The entry list is to be found by the vendor optimized (eg in cooperation with DENIC) and with the aim to carry out the acquisition of the affected sites as completely as possible. How this is done should be stated by the manufacturer. The absence of detailed knowledge of the volume of all the sites within the TLD-DE exist, the implementation in a gradual rapprochement can occur. This can be done by a restriction on a storage amount, at which point the crawl is canceled. This can also be done by repeated experimental crawls with different parameters. However, the aim and the supplier to be delivered result has the fullest possible crawl of the TLD-EN be, and an assessment of the extent of Web sites not covered by it. Own hardware and software of the provider must be used for the implementation, the choice of the appropriate infrastructure and programs is left to the provider. The storage of the contents of the crawls must be on the provider~s servers. The provider must index all content of the crawl for a full text search. An access page for the full text search and access the saved Web pages must be provided by the manufacturer. Access to it with a web browser (Internet Explorer, Firefox) may only be carried out by computers in the buildings of the German National Library, which must be guaranteed by the supplier by appropriate technical measures (eg restrictions on IP ranges). All archived websites must be characterized by a banner, which is designed according to the specifications of the German National Library in accessing an archive. However, the actual Web pages should be available internally unchanged. The design of the banner is set by DNB. The search results must be listed according to the user time slices. In addition, the service provider should provide an automatic search interface for full-text search of the collected archive copies. Recommendation by the DNB is a SRU interface with support of the query language CQL response forma Contractor name : INTERNET MEMORY RESEARCH

Contractor address : 45, Ter Rue de la RÉvolution 93100 Montreuil

Agency : Deutsche Nationalbibliothek Adickesallee 1 For the attention of: Frau Stefan 60322 Frankfurt am Main GERMANY Telephone: +49 6915252302 Fax: +49 6915252002 E-mail: Internet address(es): General address of the contracting authority:

country :Germany

For more stories covering the world of technology, please see HispanicBusiness' Tech Channel

Source: TendersInfo (India)

Story Tools