Part 3: Future Talent /

About the report

Why is research on future talent important?

Tech Nation Talent is an exploration of tech skills in the UK. We want to understand the current supply of tech talent, but also where the UK’s future talent will come from. This report looks at young people’s perceptions of tech careers to get to grips with this supply, young people’s views on some of the barriers that stand in the way of inclusivity, and opportunities for developing the tech workforce.

We will use this research to inform practical steps for tech companies, founders and CEOs, and young people to help move the UK towards a more inclusive and diverse future tech workforce.

What data is used?

Survey of young people

Over 1,000 young people responded to Tech City UK’s online survey on tech careers in August 2017. Sampling was structured to ensure that the results were as representative as possible of people aged 15-21 across UK regions.

Young people were asked a series of questions about their career plans, preferences and the rationale for their choices. Respondent were able to select more than one response in the survey, hence not all totals sum to 100.

Characteristics of young people in the survey sample

The sampling framework was structured to enable a close mach to country level population estimates, to ensure a close to representative sample of young people was gathered by UK constituent country. Though not entirely commensurable, as a guide, we compare the sample with ONS population estimates for 16 – 21 year old people for 2017.

As can be seen from the two graphs below, the sample is closely aligned to the country level populations of young people across the UK. However, the survey over-samples England – with the proportion of 15 – 21 year old people in this nation at 87% compared to population projection of 84%. For Scotland, Wales and Northern Ireland, the sample is 1% below the national projection based on population estimates.

Location of young people surveyed in Tech City UK survey of Young People (2017)

Proportion of respondents (%) Proportion of respondents (%)
England 87
Scotland 7
Wales 4
Northern Ireland 2

National Population projections of people aged 16 – 21 (2017) by country (Source: ONS Population Projections/ Estimates, Nomis)

Proportion of national population of young people (%) Proportion of national population of young people (%)
England 84
Scotland 8
Wales 5
Northern Ireland 3

Data from Reddit

Reddit is a US based social news aggregation, content rating, and discussion website. Members post content such as questions and replies, links – both to external sites, and other Reddit feeds, and images, which are then voted up or down by other members.

Posts are organised by subject into user-created boards called ‘subreddits’, which cover a variety of topics including news, science, jobs, movies, video games, music, books, fitness, food, and image-sharing. Submissions with more up-votes appear towards the top of their subreddit and, if they receive enough votes, ultimately on the website’s home page. Despite strict rules prohibiting harassment, Reddit’s administrators spend considerable resources on moderating the site.

As of 2017, Reddit had 542 million monthly visitors (234 million unique users), ranking it as the 4th most visited website in US and 8th in the world. According to 2015 data, Reddit experienced 82.54 billion page views, 73.15 million submissions, 725.85 million comments, and 6.89 billion upvotes from its users. Use of Reddit has only increased in a blog from Reddit in late December 2017, the total number of comments had reached over 900 million, and over 12 billion upvotes – making Reddit one of the definitive sources of information directly from people around the world.

In this research, we used a small subset of this global conversation curator’s data – focusing on tech career discussions. We scraped just under 80,000 Reddit responses to around 7,000 questions on careers to listen to how young people were talking about tech careers in the UK, and across the world.

Given that Reddit works by allowing users to pose questions under themed threads, or sub-Reddits, we identified a number of sub-Reddits associated with tech careers, and careers more broadly. Reddit’s mission is to ‘help people discover places where they can be their true selves, and empower their community to flourish’. We use their data accordingly, surfacing hidden conversations that users are having about careers, to listen to their thoughts when it comes to important issues, like the skills they think they need for tech, and their perceptions of tech.

We use the Thomson- Reuters Business Classification (TRBC) to classify career areas into sectors for our analysis using Reddit data. Critically, the TRBC is a global industry classification system – allowing us to capture sectoral activity in a way that is not biased by nationally specific classifications, which is appropriate for Reddit given its international reach and user base.

Responses to questions on sub-Reddits range from very brief answers – such as single word responses, to very lengthy passages of text, some of which are up to 2,000 words in length. As such, we have access to extremely rich text data which is directly reflective of young people’s experiences and perceptions of tech careers. However, this long text data is notoriously difficult to analyse, and distill. This is why we partnered with the Department of Computer Science at the University of Sheffield.

The tools that The University of Sheffield used as a data partner for the report, through their General Architecture for Textual Engineering (GATE), include data collection, semantic analysis, information aggregation, search and visualisation tools, which allow analysts to dig deep into the data and to perform complex queries over large volumes of data. The infrastructure enables users to collect and structure data from Reddit, analyse the posts and make the analysis results available for searching (using an indexing system called Mimir).

Characteristics of Reddit use

Most Reddit users post only 1 or 2 responses to questions – there is a long tail of users who post more than 10, however, there are some prolific users who have posted up to 235 responses to questions.

Likewise, with the number of responses, there is a strong skew in the distribution of questions posed by users. We see that most users tend to post just 1 question (around 5,500 users), and very few users post more than 3 questions. In terms of what this means for Reddit use, it suggests that users have a specific requirement of the platform and Reddit community – they tap into Reddit to seek responses for a single burning question, in the case of the data that we have investigated, around careers, or employment related themes.

Report partners


Hays Digital Recruitment

Finding future tech talent

Finding the right talent is one of the leading challenges that high growth digital companies and their leadership teams are facing today. One of the solutions to the shortage of skills is to encourage more young people and home-grown talent into the industry. In doing so, we can continue the rapid growth of the UK digital economy and maintain the UK’s position as the best place to found and grow a digital business.

The findings of this report indicate that many young people perceive obstacles to pursuing their ambitions, particularly those without university education, 59% of whom believed there were too few opportunities available to them. Our partnership with Tech City UK helps us to identify and overcome challenges like these by creating more opportunities for young people to learn and thrive in this exciting and innovative industry.

How we’re helping young people reach their ambitions

To encourage young people into the sector, we partner with initiatives such as Coderdojo and Teen-Turn in which young people are able to learn to code, build websites and consider digital technology as a career.

For us, recruitment is an ongoing relationship and that’s why we form partnerships with many digital communities and eco-systems, such as StackOverflow, Silicon Republic and Empact Ventures. This allows us to engage with talented professionals at each stage of their career, and find the perfect opportunities for their strengths, skills and ambitions.

By working with Tech City UK, we aim to support the next generation of digital entrepreneurs and leaders, so they can continue the legacy of innovation in the UK.

Data partner

Department of Computer Science, The University of Sheffield

Since 2010, Prof. Bontcheva and her research team in the Department of Computer Science have carried out world-leading research on social media analysis and summarisation, with specific focus on computational methods for detecting and tracking mis- and disinformation online. They also develop the widely used, open source GATE text and social media analysis platform. This includes TwitIE – one of the best performing Twitter named entity recognition systems according to a recent independent evaluation. GATE also includes text and semantic similarity metrics, a wide range of machine learning algorithms, corpus annotation, and evaluation tools – many of which were used to carry out this data-driven analysis of Reddit posts.

The team is based at the Department of Computer Science, University of Sheffield and includes world-class teams in the areas of speech, language, knowledge and information processing, biotechnology, and machine learning for medical informatics. The Department of Computer Science was founded in 1982 and since then has established national and international renown for many aspects of its teaching and research. It was awarded a top Grade 5 in the most recent nationwide Research Assessment Exercise.