Have a personal or library account? Click to login
Survey vs Scraped Data: Comparing Time Series Properties of Web and Survey Vacancy Data Cover

Survey vs Scraped Data: Comparing Time Series Properties of Web and Survey Vacancy Data

Open Access
|Sep 2019

Abstract

This paper studies the relationship between a vacancy population obtained from web crawling and vacancies in the economy inferred by a National Statistics Office (NSO) using a traditional method. We compare the time series properties of samples obtained between 2007 and 2014 by Statistics Netherlands and by a web scraping company. We find that the web and NSO vacancy data present similar time series properties, suggesting that both time series are generated by the same underlying phenomenon: the real number of new vacancies in the economy. We conclude that, in our case study, web-sourced data are able to capture aggregate economic activity in the labor market.

Language: English
Published on: Sep 13, 2019
Published by: Sciendo
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2019 Pablo de Pedraza, Stefano Visintin, Kea Tijdens, Gábor Kismihók, published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 License.