Digitalisation, Labour Market

The role of data skills in the modern labour market

7 minute read

By Julia Schmidt (Julia.Schmidt@oecd.org), Graham Pilgrim (Graham.Pilgrim@oecd.org), Annabelle Mourougane (Annabelle.Mourougane@oecd.org), Statistics and Data Directorate, OECD.

Originally written for VOX EU

In recent years, online job advertisement data have gained popularity as an alternative data source relating to labour markets, as they provide timely and granular information. In many cases, they have advantages over existing occupation classifications and are often complementary to official employment or vacancy statistics (Atalay et al. 2022). Recent studies using online job advertisements showed that digital skillsets have evolved over the past decade and can be found at the core of some traditionally non-digital domains (Sostero and Tolan 2022). Online job advertisements have also contributed to an improved understanding of the impact of crises on labour markets, such as the COVID-19 recession in the US and Canada (Soh et al. 2022, Bellatin and Galassi 2022) and the war in Ukraine (Pham et al. 2023).

In a recent paper (Schmidt et al. 2023), we aim to estimate the data intensity of occupations and sectors (i.e. the share of data-related jobs involved in the production of data). First, we put forward a novel methodology applying natural language processing (NLP) to online job advertisements from Lightcast to generate occupation- and industry-level estimates of data intensity. Second, the methodology can be used to advance cross-country comparable results on measuring the value of data assets in the data economy and the evolution of digital skills in the labour market. Third, the NLP algorithm is flexible and can be applied to concepts that are difficult to capture in traditional labour market statistics, such as green and AI-related jobs. The algorithm can also be adapted to over 66 languages, meaning the scope of the analysis could be broadened.

The most data-intensive occupations are linked to data analytics skills

Note: The Lightcast data provide occupation classifications. Data intensity takes values between 0 and 100.
Source: Authors’ calculations based on Lightcast data.

Differences in data intensity across the countries are concentrated in a handful of sectors

Per cent of labour demand, 2020

Note: Sectors are based on the ISIC version 4 classification. Activities of extraterrestrial organisations and activities of households are excluded. Data intensity takes values between 0 and 100.
Source: Authors’ calculations based on Lightcast data.

Professions with a low level of data intensity contribute most to the aggregate data intensity in the UK

Aggregate data intensity, per cent, 2020

Notes: Data intensity takes values between 0 and 100. Low data-intensive occupations: 0<10%, medium data-intensive occupations: 10-50% and high data-intensive occupations > 50%.
Source: Authors’ calculations based on Lightcast data.

Data intensity of an occupation in per cent, contribution to aggregate data intensity in percentage points, 2020

A) United Kingdom

B) Canada

C) United States

Notes: Contributions are computed as the data intensity of occupation classes weighted by their share of employees. Data intensity takes values between 0 and 100. Low data-intensive occupations: 0<10%, medium data-intensive occupations: 10-50% and high data-intensive occupations: > 50%. Contribution to aggregate data intensity is displayed in percentage points.
Source: Authors’ calculation based on Lightcast data.

Acemoglu, D and P Restrepo (2017), “Robots and Jobs: Evidence from US Labor Markets”, NBER Working Paper No. 23285.

Atalay, E, S Sotelo and D Tannenbaum (2022), “The geography of job tasks”, VoxEU.org, 12 November.  

Bellatin, A and G Galassi (2022), “What COVID-19 May Leave Behind: Technology-Related Job Postings in Canada”, Bank of Canada Staff Working Paper 2022/17.

Calvino, F, C Criscuolo, L Marcolin and M Squicciarini (2018), “A taxonomy of digital intensive sectors”, OECD Science, Technology and Industry Working Paper 2018/14.

Pham, T, O Talavera and Z Wu (2023), “Labour markets during wartime: Evidence from online job advertisements”, VoxEU.org, 22 July.

Schmidt, J, G Pilgrim and A Mourougane (2023), “What is the role of data in jobs in the United Kingdom, Canada, and the United States? A natural language processing approach”, OECD Statistics Working Paper 2023/05.

Soh, J, M Oikonomou, C Pizzinelli, I Shibata and M Mendes Tavares (2022), “Did the COVID-19 Recession Increase the Demand for Digital Occupations in the United States? Evidence from Employment and Vacancies Data”, IMF Working Paper 2022/195.

Sostero, M and S Tolan (2022), “Digital skills for all? From computer literacy to AI skills in online job advertisements”, JRC Working Papers Series on Labour Education and Technology.

spaCy (2022), “Language Processing Pipelines“.