Linkrot: A growing problem

Up to 30 per cent of current hyperlinks don't work

Written by James Middleton

Linkrot has been a growing problem since the conception of the world wide web, and the problem has attracted attention from internet watchers in very high places.

Some believe that the only solution will be the evolution of a 'semantic' internet which allows machines to process and 'understand' data rather than merely display it.

Everyone has experienced linkrot: clicking on a promising hyperlink only to be confronted by 'this page has been moved' or 'that page no longer exists' or the frustrating '404 error'.

And the problem is growing. The last proper survey on linkrot, carried out in 1997 by the University of Georgia, revealed that around six per cent of all links are bad.

This figure was up 50 per cent from the year before and, although there isn't the research to confirm it, if that rate continued then around 25 to 30 per cent of all links on the internet could suffer.

In his Webword.com weblog this week John Rhodes put forward the idea for a self repairing internet. An integrated application would sit on a web server capable of doing reverse lookups on links to pages using referrer logs or even by piggybacking on search engines.

Once broken incoming links were found the software would send repair information to the server at fault. But aside from ensuring secure connections, the big problem with this is the extra traffic that is generated. A site like Yahoo would drown from bandwidth demand.

But the idea isn't as crazy as it seems. Tim Berners-Lee, the father of the world wide web, and director of the World Wide Web Consortium, has been working for years on a semantic web with a small group of developers.

Last year Berners-Lee wrote a feature for Scientific American in which he described the semantic web as "not a separate web but an extension of the current one in which information is given well-defined meaning, better enabling computers and people to work in co-operation ... as machines become much better able to process and 'understand' the data that they merely display at present".

He explained that most of the web's content today is designed for humans to read, not for computer programs to manipulate meaningfully.

"Computers can adeptly parse web pages for layout and routine processing: here a header, there a link to another page. But, in general, computers have no reliable way to process the semantics," said Berners-Lee.

The semantic web intends to make up for this. Berners-Lee thinks that, since its creation, the web has developed into a medium of documents for people, rather than for data and information that can be processed automatically.

And so the aim of the SemanticWeb.org community is to "bridge the gap between the one end of the scale where we have everything from the five-second TV commercial to poetry, and at the other end where we have databases, programs and sensor output".

It promises to be a hot topic at the European and National conferences on artificial intelligence in the summer, where the group will be holding workshops.

Last month SymantecWeb.org released Triple, a language that allows other languages such as XML, based on a Resource Description Framework, to be defined with rules.

Tags:

reader comments

related articles

Websites suffer high decay factor

Half the internet is out of date after three months 30 Apr 2002

 

W3C backs XML-based signatures

Web consortium recommends new standard. 12 Mar 2002

What on earth is? HTML

A look at Hyper Text Markup Language. 04 Feb 2002

A question of semantics

World Wide Web inventor Tim Berners-Lee talks to Liesbeth Evers about his new vision of the internet. 29 Jan 2002

related whitepapers

today's top stories

How to maximise the value of your IT networking investment

A panel of experts discuss networking strategies that deliver real value to business 03 Jul 2009

Habitat gets a web site makeover

The furniture retailer is revamping its online presence to provide a fully transactional web site. CIO Jacques Dekock explains why 02 Jul 2009

Government aims to bolster UK's cyber defences

Is the UK’s first national cyber security strategy up to the task of co-ordinating the country’s response to digital threats? Computing investigates 02 Jul 2009

Focus resources on what really matters

IT has become too caught up in the drive for efficiency, at the expense of business success 02 Jul 2009

From tracks man to tax man

Phil Pavitt, outgoing chief information officer for Transport for London, talks to Rosalie Marshall about the lessons he will take to his new role at HMRC 02 Jul 2009

Advertisement

Newsletter signup

Sign up for our range of FREE newsletters:

More available - click 'submit' to view

Existing User

Newsletter user login:

Advertisement

Jobs

Related jobs

Job of the week

Job alerts

Sign up here

Find your next job

IT Salary Checker

Check salary here

Advertisement

White papers

Search white papers

Top categories

VPN, Extranet and Intranet Solutions

WAN/ LAN Solutions

Network Security

Interoperability-Connectivity

Grid/ Utility Computing

Latest poll

Would you use social networking sites to look for a job?

Would you use social networking sites to look for a job?

Tell us what you think about job hunting through LinkedIn, Facebook, Twitter etc

View poll results

Latest audio and video articles

network cablesVideo

How to maximise the value of your IT networking investment

A panel of experts discuss networking strategies that deliver real value to business 03 Jul 2009

green footprintsVideo

How to manage enterprise energy use - and the role IT can play

A panel of experts explore how firms can get to grips with their carbon footprint and make smarter use of energy 01 Jul 2009

Latest in-depth articles

Phil PavittAnalysis

From tracks man to tax man

Phil Pavitt, outgoing chief information officer for Transport for London, talks to Rosalie Marshall about the lessons he will take to his new role at HMRC 02 Jul 2009

UPS worker making a deliveryAnalysis

Global standardisation delivers benefits at UPS

Delivery giant sees benefits of central IT solution 02 Jul 2009

Advertisement

Primary Navigation