HMRC is trawling the internet, including social media and other websites in which people share information, in a bid to find potential evidence of tax fraud that it can feed into its new Connect data warehouse.
Although HMRC is reluctant to go into detail, Mike Hainey, head of analytics at HMRC, has told Computing that since the implementation of Connect, the organisation's "big data" analytics system, the organisation has started feeding in information from the internet to help it better target tax investigations.
"[It's] information that we can obtain that is visible and available legally for HMRC to review," Hainey told Computing. While such information would include the easily identifiable accounts people run on Twitter and Facebook, it would also almost certainly include websites where traders ply their trade, such as RatedPeople.com, and where their customers leave comments.
Indeed, HMRC has always taken data feeds from a variety of sources to support its Enforcement and Compliance organisation. "It's departmental data at one end of the spectrum, commercial data, buying in information around businesses et cetera. We also get information from other government departments and other foreign FISCs [fiscal regimes] through various treaties and arrangements," says Hainey.
He adds: "Also, on occasions, we will bring in information that we may obtain from the internet and bring that into the picture."
The commercial data, he says, is typically information from Companies House about companies and directorships, or from credit reference agencies.
Of course, many organisations trawl social media for all kinds of purposes, often using automated tools. At their most innocent, they represent little more than an extension to the press-cuttings services that have been offered to companies and wealthy celebrities for decades.
More seriously, reputation management companies also trawl social networks for evidence of potential libellous comments, or misuse of corporate imagery and other copyrighted material.
HMRC's Connect analytics system won the prize for Best Big Data Project at the UK IT Industry Awards in November 2012, which Computing recently covered in a case study feature.
The new system, which HMRC is still in the process of ramping up following an 18-month trial, helps to unify the siloed data collected under different tax systems, such as National Insurance and VAT. The aim is to build up more rounded pictures of the taxpaying public based on recognisable entities, such as individuals, their families and the circles within which they do business.
To put together such holistic pictures of taxpayers previously required several weeks of work just to pull the data from the disparate systems.
Thank Zuck it's Friday #9 - Home Office 'super database', the software reseller claiming £270m from Microsoft and social media data breaches
This week on the IT news podcast the team discusses the Home Office's 'super database' on race, health and biometrics, the British software reseller bringing at £170m claim against Microsoft and the recent data breaches involving both Facebook and LinkedIn....
Approved vendors can pitch for ICT hardware and services contracts under the Link 3 Framework to support the NHS and the wider public sector organisations
Nesta in Scotland showcases a number of projects in which humans and AI work as a team
Experts from around the world discuss the benefits artificial intelligence and machine learning techniques are bringing to the battle to beat the pandemic, and the ways in which we should help the technology be more effective
Join us to hear from highly skilled, innovative and motivated individuals from the public sector on how they are learning from the challenges of 2020