Data Trends: Petabyte and Beyond

By Fred Hapgood

PAGE 5

Cavers believes that petabyte-level data stores will force IT people to minimize the number of mass copy operations as much as possible. "This is a paradigm shift in the way people think about computing," he says.

The Petabyte Paradigm

Gerry Higgins, senior vice president of Verizon information processing services in New York City, points out that maintaining a petabyte of data raises distribution management issues in hardware as well as software. In the petabyte world, data is usually spread over thousands of disks. "Vendors always want to talk to me about how great their mean-time-between-failure numbers are. I tell them not to bother. All I’m interested in is what happens when there is a failure," Higgins says. "When you deal with so many disks, some are always crashing. I tell them that when you’re a petabyte guy like me, you have to expect failures."

Many observers think the transition to petabyte levels is going to introduce changes even more sweeping than those associated with previous leaps in storage. "Traditionally vendors have built standalone data mining engines and moved the data into them," says Winter. "But are you going to be able to move a petabyte around like that?" Winter foresees radical changes in engine architecture, probably involving breakthroughs in the engineering of parallelization.

"The whole notion of storage takes on a new meaning," says Scot Klimke, vice president and CIO for Network Appliance, a storage services vendor in Sunnyvale, Calif. "It starts to be defined less as simple retention and more as the struggle for information quality."

Perhaps the worst such issue is consistency. A petabyte of data is so big, and the quality of the information it contains is perforce so low, that it is bound to contain and create inconsistent information, which means that any petabyte-level system has to contain ways of detecting and resolving data conflicts.

Another issue is aging: Information quality varies, roughly, with age, but present systems are poorly equipped to track the age of material, especially material within a file. "I have five priorities for this fiscal year," Klimke says. "Two involve data quality."

Klimke argues that as the petabyte revolution picks up steam, the struggle to measure and manage data quality will increasingly define the CIO’s job. While he might or might not be right about this specific point, it’s clear that anyone exploring the petabyte world should bring a good map, watch out for booby traps and carry a rabbit’s foot for luck.

$firstKeyword

Loading...
Data Center MarketSpace
From Chaos to Order-Winning the Information Management Game
Learn how Oracle Application Express delivers an easy, fast, and free way to manage your business information. Learn more »
Optimizing Information Insight
This paper will argue that the key to enabling midsize organizations to make even better business decisions is by simplifying the extraction of specific, actionable information from large volumes of data. Learn more »
Looking for a fast payback?
Learn how you can boost ROI and productivity with a JDE technology refresh. Learn more »
3 Minutes with Free Tool Can Save Thousands!
See how you can improve decision-making while reducing your total cost of ownership through process efficiencies and technology simplification. Learn more »
Informatica 9: What it means for the CIO?
Hear from Informatica's CIO on how Informatica 9 will improve... Learn more »
Lower Costs with New Servers and Consolidation
When it comes to server technology staying the course will cost you. Lower costs and create an efficient datacenter with newer server technology. Learn more »
 
SPONSORED LINKS
 

White Paper: Right-Sizing Your Power Infrastructure

Lower IT Costs with Oracle Database 11g Release 2

New technology that addresses challenges organizations are facing.

Return on Information: Google Enterprise Search pays you back

Cut Costs & Green Your IT Operations with PC Power Management

White Paper: 4 Customer Service Myths

White Paper: Managed Security for a Not-So-Secure World

White Paper: 5 Best Practices for Smartphone Support

Global Research: CIOs Weigh In On Virtualization

5 Key Virtualization Management Challenges

Secure Email and Web-Based Communication from Evolving Attacks

WagerWorks Takes Fraudsters Out of the Game using iovation

Seven Design Requirements for Web 2.0 Threat Protection

Increase UPS efficiency without sacrificing protection.

Learn how advanced forecasting tools can deliver significant business results for global corporations.

Achieving Business Agility with Application Grid

Ready to virtualize tier one applications? Check your virtualization maturity.

Seven Ways ITIL Can Help You in an Economic Downturn

Tips for successful virtualization management.

AT&T Synaptic Storage as a Service. Expand on demand

Trend Micro ranked #1 against real-world malware. Read more.

Webinar: Jump-start your in-house e-discovery with Ringtail QuickCull from FTI Technology

Streamline IT Costs. Boost Performance with WAN Optimization.

Build your 1st app FREE with Force.com

TDWI checklist helps define data readiness for analytics. Download report.

State of the Data Integration Market

Server Consolidation: Leveraging the Benefits of Virtualization

Upgrading to VMware vSphere with vWire

Maximizing website Return on Information with high-quality search

See how AT&T can help protect your network.

Webcast: Unleashing the Power of Customer Data

White Paper: Improve Agility with Operational Responsiveness

White Paper: Legacy Tools: Not Built for the Helpdesk

Taking a Seat at the Executive Table: The Reality of Virtualization

White Paper: Next Generation Remote Infrastructure Management

Keeping Your Members Safe from Online Scams and Predators

The Total Economic Impact of Network Security Intrusion Prevention

Generation Remote Infrastructure Management - Changing the Paradigm

Cloud-Based Email Management: Opinion Shifts In Favor

eBook: How Can You Make Your People Productive Anywhere?

White Paper: Visibility and the New Normal of Mobile Work

Taking the Service Desk to the Next Level

Learn about The Information Technology Infrastructure Library.

Return on Information: Google Enterprise Search pays you back. Get the facts.

VMware. The source for Business Infrastructure Virtualization.

ShoreTel tells businesses to untangle from competitors' complexity and turn to its brilliantly simple UC solution

Top Five CIO Challenges

Read the RSA report: Security for Business Innovation

64-page prescriptive guide to security, compliance, and IT operations.

A Clear View Toward Virtualization

 
 
RESOURCE CENTER