Data Trends: Petabyte and Beyond

By Fred Hapgood

PAGE 3

Searches conducted on large volumes of data naturally generate more errors. At some point, the number of errors so overwhelm the user’s ability to cope that the system essentially becomes useless. The only solution is to rewrite the search programs so that they make fewer errors, and no IT development task is harder to do predictably than boosting the IQ of computer programs. Finally, according to John Parkinson, CTO of Cap Gemini Ernst & Young for the America’s region, even the cost of the core overhead tasks (such as buffer management) typically grow faster than linearly.

One school of thought is that transition to petabyte levels is just not worth it.

Faisal Shah, cofounder and CTO of Chicago-based systems integrator Knightsbridge Solutions, says that data quality naturally drifts down as more space opens up in the corporate attic, in part because you are now saving things you used to throw away. Shah believes that companies will be better off spending their now-restricted IT dollars on trying to extract more intelligence from current data stores rather than piling up haystacks with fewer and fewer needles hidden in them.

Petabyte Solutions

Other observers are betting that new technologies will be able to keep those penalties under control. Like many IT problems, the solutions being explored fall along the spectrum of centralized to distributed.

Ron Davis, senior IT architect of Equifax, the Atlanta-based consumer data company, is working with a centralized management solution from Corworks. Equifax’s business is to buy raw data from state agencies or directory companies, and turn it into information products. Equifax wants to control the data it buys for as long as possible as it never knows what a new product design might call for or when. While the data could, in theory, be left with its suppliers, Davis’s experience is that retention policies and practices vary too widely over Equifax’s 14,000 data sources to make such dependence practical. He believes that at least over the short run, companies near the end of the value chain will have to take on the responsibility of archiving raw data. Shouldering this responsibility has put Equifax on the road to becoming a petabyte company, and it has forced Davis to search for an architecture competent to deal with the petabyte problems of cost, error and time.

Corworks’ basic idea is to beat the time penalties inherent in handling large volumes of data by loading it into electronic memory. This seems counterintuitive, rather like making a quart easier to drink by squeezing it into a pint, but the feat is done by stripping out the structural data (such as converting everything into flat files), compressing the result, and then relying on fast processors to decompress and restore the data structures only as needed. In other words, just-in-time logic.

$firstKeyword

Loading...
Data Center MarketSpace
From Chaos to Order-Winning the Information Management Game
Learn how Oracle Application Express delivers an easy, fast, and free way to manage your business information. Learn more »
Optimizing Information Insight
This paper will argue that the key to enabling midsize organizations to make even better business decisions is by simplifying the extraction of specific, actionable information from large volumes of data. Learn more »
Looking for a fast payback?
Learn how you can boost ROI and productivity with a JDE technology refresh. Learn more »
3 Minutes with Free Tool Can Save Thousands!
See how you can improve decision-making while reducing your total cost of ownership through process efficiencies and technology simplification. Learn more »
Informatica 9: What it means for the CIO?
Hear from Informatica's CIO on how Informatica 9 will improve... Learn more »
Lower Costs with New Servers and Consolidation
When it comes to server technology staying the course will cost you. Lower costs and create an efficient datacenter with newer server technology. Learn more »
 
SPONSORED LINKS
 

White Paper: Right-Sizing Your Power Infrastructure

Lower IT Costs with Oracle Database 11g Release 2

New technology that addresses challenges organizations are facing.

Return on Information: Google Enterprise Search pays you back

Cut Costs & Green Your IT Operations with PC Power Management

White Paper: 4 Customer Service Myths

White Paper: Managed Security for a Not-So-Secure World

White Paper: 5 Best Practices for Smartphone Support

Global Research: CIOs Weigh In On Virtualization

5 Key Virtualization Management Challenges

Secure Email and Web-Based Communication from Evolving Attacks

WagerWorks Takes Fraudsters Out of the Game using iovation

Seven Design Requirements for Web 2.0 Threat Protection

Increase UPS efficiency without sacrificing protection.

Learn how advanced forecasting tools can deliver significant business results for global corporations.

Achieving Business Agility with Application Grid

Ready to virtualize tier one applications? Check your virtualization maturity.

Seven Ways ITIL Can Help You in an Economic Downturn

Tips for successful virtualization management.

AT&T Synaptic Storage as a Service. Expand on demand

Trend Micro ranked #1 against real-world malware. Read more.

Webinar: Jump-start your in-house e-discovery with Ringtail QuickCull from FTI Technology

Streamline IT Costs. Boost Performance with WAN Optimization.

Build your 1st app FREE with Force.com

TDWI checklist helps define data readiness for analytics. Download report.

State of the Data Integration Market

Server Consolidation: Leveraging the Benefits of Virtualization

Upgrading to VMware vSphere with vWire

Maximizing website Return on Information with high-quality search

See how AT&T can help protect your network.

Webcast: Unleashing the Power of Customer Data

White Paper: Improve Agility with Operational Responsiveness

White Paper: Legacy Tools: Not Built for the Helpdesk

Taking a Seat at the Executive Table: The Reality of Virtualization

White Paper: Next Generation Remote Infrastructure Management

Keeping Your Members Safe from Online Scams and Predators

The Total Economic Impact of Network Security Intrusion Prevention

Generation Remote Infrastructure Management - Changing the Paradigm

Cloud-Based Email Management: Opinion Shifts In Favor

eBook: How Can You Make Your People Productive Anywhere?

White Paper: Visibility and the New Normal of Mobile Work

Taking the Service Desk to the Next Level

Learn about The Information Technology Infrastructure Library.

Return on Information: Google Enterprise Search pays you back. Get the facts.

VMware. The source for Business Infrastructure Virtualization.

ShoreTel tells businesses to untangle from competitors' complexity and turn to its brilliantly simple UC solution

Top Five CIO Challenges

Read the RSA report: Security for Business Innovation

64-page prescriptive guide to security, compliance, and IT operations.

A Clear View Toward Virtualization

 
 
RESOURCE CENTER