Petr Ilgner, Dan Komosný, Saeed Ur Rehman
Website Hosting Data and Analysis
Číslo: 1/2019
Periodikum: Statistika
Klíčová slova: Internet, web content, hosting, geographical location, Czech Republic, CZ.NIC
Pro získání musíte mít účet v Citace PRO.
Anotace:
We have collected a large dataset – more than 21 000 websites – through web-crawling the public resources of the Czech Internet. The proposed method for website hosting detection along with their geographic location and software were applied on the collected data to extend basic statistical information about the Czech websites published by the national domain registrar CZ.NIC. For analysis, we divided the data into nine categories to show differences between them, for example, between the public and private sector. The procedures used in this paper may also be applied for an extended analysis of websites in other countries, for example, for verification of fulfillment of legal directives to be implemented by public sector.