OpenAI and other firms are using synthetic data to train AI models

Skirts complaints related to IP abuse, privacy and data access

clock • 3 min read
Advanced AI models now being trained using computer-made 'synthetic' data
Image:

Advanced AI models now being trained using computer-made 'synthetic' data

Major tech firms developing generative AI models are actively exploring a new approach to acquiring the vast amounts of information they need for their advanced models: creating it from scratch using computer-generated data.

Players like Microsoft, OpenAI, and Cohere are employing synthetic data to train their large language models (LLMs), primarily due to the constraints in the availability of human-made data. Micr...

To continue reading this article...

Join Computing

  • Unlimited access to real-time news, analysis and opinion from the technology industry
  • Receive important and breaking news in our daily newsletter
  • Be the first to hear about our events and awards programmes
  • Join live member only interviews with IT leaders at the ‘IT Lounge’; your chance to ask your burning tech questions and have them answered
  • Access to the Computing Delta hub providing market intelligence and research
  • Receive our members-only newsletter with exclusive opinion pieces from senior IT Leaders

Join now

 

Already a Computing member?

Login

You may also like
Liberal Democrat manifesto: What's in it for tech?

Government

The industry gets short shrift in a manifesto for the young

clock 13 June 2024 • 4 min read
SAS: If we can't bridge the AI trust gap, we're going nowhere

Artificial Intelligence

Josefin Rosén, principal trustworthy AI specialist, on the importance of clarity

clock 13 June 2024 • 4 min read
UK lags behind Europe in tech skills despite government investment

Skills

In 2022, the UK was at 38th position globally

clock 13 June 2024 • 2 min read

More on Big Data and Analytics

Blenheim Estate: How tech is protecting 'the finest view in England'

Blenheim Estate: How tech is protecting 'the finest view in England'

Data analysis and a sprawling sensor network are saving money and boosting biodiversity

Tom Allen
clock 12 June 2024 • 5 min read
Belfast to spearhead UK's digital revolution with £37m Digital Twin Centre

Belfast to spearhead UK's digital revolution with £37m Digital Twin Centre

Aim is to foster innovation across engineering sectors

clock 02 May 2024 • 2 min read
Even CERN has to queue for GPUs. Here's how they optimise what they have

Even CERN has to queue for GPUs. Here's how they optimise what they have

'There's a tendency to say that all ML workloads need a GPU, but for inference you probably don't need them'

John Leonard
clock 17 April 2024 • 4 min read