Picture of Amazon's UK distribution centre
Amazon's new NPS system houses over 25 terabytes of clickstream and other data

Case study: Amazon

Amazon boosts speed and performance of its vast database

Written by Linda More

With large volumes of information created daily, global internet retailer Amazon.com needs an infrastructure and systems in place capable of storing and analysing extremely large volumes of data.

Given the company’s growth and increasing analytical sophistication, Amazon anticipated that inherent limitations in its deep analysis clickstream database system could prevent scaling this part of its information warehouse environment.

It decided to replace its Oracle clickstream database with a Netezza Performance Server (NPS) system that now houses more than 25 terabytes of clickstream and other transactional data.

The result is significantly reduced analysis time allowing greater routine examination, to be performed against 15 months worth of data rather than weekly data sets. More than 25 hours of processing has been reduced to a single hour working with five times the original volume of data.

Improved analytical capabilities have already had a significant impact on the business. ‘Our analysts were concerned that a feature on the web site was not functioning properly,’ says Diane Lye, director of data mining and BI content. ‘To verify the hypothesis, we needed to analyse the clickstream data spanning several weeks. Previously it might have been considered too costly even to attempt.’

The performance and speed advantages make it easier to perform new analyses that may have been too cumbersome using the previous system. Amazon now has the ability to capture even finer detail associated with page view – including content and placement details – and is more closely analysing the impact of different web treatments of a single page on customer behaviour. The potential of the analysis is to provide new insights that will allow the business to create a more effective web experience for its customers.

The new data warehouse appliance has also eliminated time-consuming administrative and maintenance tasks. Jeff Parker, data warehouse infrastructure manager, says that a small portion of one database administrator’s time is needed for the NPS system, rather than the four full-time administrators that are needed to maintain Amazon’s other data warehouse systems.

‘Administrative workload is minimal with the NPS system,’ he says. ‘On the core data warehouse we find ourselves spending time extending table spaces, creating partitions and rebuilding indexes. With the NPS system we just create the

table and it manages the rest for us. We have our largest dataset on the NPS system, and it runs so smoothly, you almost forget that it is there.’

reader comments

related articles

Picture of people on a Center Parc's water ride

Artificial intelligence

More companies see data warehouses as the answer to their content management problems 29 Mar 2007

 

Case study: Center Parcs

Holiday resort speeds data processing 29 Mar 2007

Tesco analyses sales data with Netezza

New system will identify stolen, destroyed, out-of-date or lost items 09 Sep 2008

HP bolsters Adaptive Infrastructure portfolio

Vendor promises to help ease server management woes 16 Jan 2009

Sybase unveils analytics appliance

Sybase touts appliance as easy way to tackle "exploding" data volumes 05 Aug 2008

related whitepapers

today's top stories

Best practice: Five steps to achieving your e-commerce goals

Brian Walker of Forrester Research gives his top tips for ensuring e-commerce success 06 Jul 2009

Google meets the NHS? Politicians show their IT naivety again

The Tories like technology. They increasingly seem to think IT is going to help them win the General Election due next year.... 06 Jul 2009

How to maximise the value of your IT networking investment

A panel of experts discuss networking strategies that deliver real value to business 03 Jul 2009

Reaching the email zero count

I have noticed something quite bizarre today. Both my inboxes (work and personal) are empty – somehow I have managed to work... 06 Jul 2009

Habitat gets a web site makeover

The furniture retailer is revamping its online presence to provide a fully transactional web site. CIO Jacques Dekock explains why 02 Jul 2009

Advertisement

Newsletter signup

Sign up for our range of FREE newsletters:

More available - click 'submit' to view

Existing User

Newsletter user login:

Advertisement

Jobs

Related jobs

Job of the week

Job alerts

Sign up here

Find your next job

IT Salary Checker

Check salary here

Advertisement

White papers

Search white papers

Top categories

VPN, Extranet and Intranet Solutions

WAN/ LAN Solutions

Network Security

Interoperability-Connectivity

Grid/ Utility Computing

Latest poll

Would you use social networking sites to look for a job?

Would you use social networking sites to look for a job?

Tell us what you think about job hunting through LinkedIn, Facebook, Twitter etc

View poll results

Latest audio and video articles

network cablesVideo

How to maximise the value of your IT networking investment

A panel of experts discuss networking strategies that deliver real value to business 03 Jul 2009

green footprintsVideo

How to manage enterprise energy use - and the role IT can play

A panel of experts explore how firms can get to grips with their carbon footprint and make smarter use of energy 01 Jul 2009

Latest in-depth articles

Phil PavittAnalysis

From tracks man to tax man

Phil Pavitt, outgoing chief information officer for Transport for London, talks to Rosalie Marshall about the lessons he will take to his new role at HMRC 02 Jul 2009

UPS worker making a deliveryAnalysis

Global standardisation delivers benefits at UPS

Delivery giant sees benefits of central IT solution 02 Jul 2009

Advertisement

Primary Navigation