Amazon S3 Lets Customers Ship Big Data

Amazon's S3 cloud storage service has a new option, called AWS Import/Export, for quickly uploading large amounts of information to its data centers. It uses a well-developed, multimodal content delivery network that can transmit terabytes of data faster than a T-3 leased line.

By Stephen Lawson
Thu, May 21, 2009

IDG News Service — Amazon's S3 cloud storage service has a new option, called AWS Import/Export, for quickly uploading large amounts of information to its data centers. It uses a well-developed, multimodal content delivery network that can transmit terabytes of data faster than a T-3 leased line.

The fact that this network is based on jets, trucks and messengers with walkie-talkies doesn't make it any less useful to enterprises, many of which have been using overnight shipping services for backups for several years, according to 451 Group analyst Henry Baltazar. Just make sure the data's encrypted in case it falls off the back of a truck or otherwise gets lost, he said.

AWS (Amazon Web Services) described the new service in a recent blog posting. AWS Import/Export from Amazon Web Services lets customers send virtually unlimited amounts of data to Amazon when they want to start using S3 for the first time, back up their content offsite, or streamline the Direct Data Interchange process with their partners. All customers will have to do is copy their data to a device, such as an external hard drive, create a manifest file with authentication information and a digital signature, e-mail loading instructions and ship the device. AWS lays out guidelines for the storage devices on an information page at its Web site.

When it arrives, the device will go to an AWS Import/Export station and the data will be loaded onto the customer's S3 data bucket, generally the next business day. Customers will pay US$80 per device handled and $2.49 per hour for the labor involved in loading the data, plus the standard charges for storing that data on S3. The service is available now in beta testing, for importing only, but will be expanded to include exporting in the coming months, Amazon said.

With many enterprise Internet connections, Import/Export will often be faster than online uploads or downloads, according to Amazon. For example, on a T-1 leased line (1.5Mb per second), with 80 percent of that line devoted to the transfer, it would take 82 days to send 1TB of data, Amazon said. As a general rule, S3 customers with only a T-1 should think about using Import/Export for sending 100GB or more of data, the company said.

Even a faster T-3 leased line (just under 45Mb per second) would take three days to send 1TB, so shipping would be a good option for anything above 2TB, Amazon said. A Gigabit Ethernet Internet connection could send 1TB in less than a day, Amazon said. But even if an enterprise is using a metro Ethernet link like that, it's unlikely to have that amount of capacity all the way to Amazon, 451's Baltazar pointed out.

"If you really have to have that data up there fast, it does make sense," Baltazar said. The method isn't new: For example, when banks set up new branches and want to have large amounts of information available on site, they typically ship drives because they don't have days to wait for a transfer. Online backup and disaster-recovery vendors also offer this approach. It's developed in just the past few years as the growth in data, driven by multimedia, has outpaced the acceleration of Internet connections, he said.

What's new is that Amazon, a cloud storage provider that offers more than just backup, is using the technique. The core business model for AWS is providing storage on S3 for applications that run on Amazon's EC2 cloud computing infrastructure.

Learn how your answer to this question compares to your peers by taking this quick poll. See how your peers are dealing with the challenge of ensuring a highly capable server infrastructure as technological shifts impact the application server platform.
With increasing data growth, comes increased need for data security.  The existing DLP model, with a focus on compliance/enforcement is not sufficient as the data discovery and classification capabilities are not granular enough.  Read this paper to find how you can efficiently and accurately manage your risk by rapidly inventorying and classifying your data and then developing remediation workflows that support business needs. 
This paper breaks down attack sources into four categories: external, malicious insiders, accidental insiders, and unknown.
The rapid growth of data and technology is creating challenges for organizations as this digital data is considered to be business communications and must be preserved according the same industry-specific regulations governing the retention and discovery of emails and more traditional forms of electronic communications. This paper examines the role that Data Loss Prevention ("DLP") technology can play in helping organizations address the challenges of locating information in response to electronic discovery.
This research, conducted by the Ponemon Institute, focuses on issues relating to the use of data protection solutions such as endpoint encryption and data loss prevention within the workplace.
This report, by Jon Oltsik from Enterprise Strategy Group, examines the need for a new business-centric approach to DLP in order to align business and security requirements.
As greater numbers of datacenter servers transition from the physical to the virtual world, the components of virtualization success come to the fore. What scores of organizations have discovered is that success is derived from an optimal pairing of the right software platform with the right hardware platform.
Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn about VMware customer, Navicure, and their experiences testing and evaluating the recovery manager, their progress in implementing it in their environment and their advice other customers considering using vCenter.
Many enterprises have discovered that the use of virtualization to support desktop workloads creates a range of significant benefits. These benefits include price efficiencies, improved IT management and greater agility and choice for end users.

This VMware sponsored webcast with IDC will provide both quantitative measurement of the business value -- defined as the expected ROI -- and qualitative analysis associated with the use of VMware View™. IDC will also provide an analysis of the View Composer and ThinApp™ features of VMware View, including the business value of these solutions and an overview of how they work.

Attend this webcast to learn about:
- Challenges and barriers that might impede the adoption of desktop virtualization
- Navigating roadblocks to facilitate a strategic implementation
- Optimizing qualitative and quantitative benefits to IT and your business
VMware recently announced VMware vFabric™ Data Director, a new database deployment and operations platform that enables enterprise IT organizations to offer database as a private cloud service. Built on top of VMware vSphere 5, vFabric Data Director enables IT organizations to ontrol database sprawl through automation and consistent policy enforcement and accelerate application development cycles with self-service database management. Attend this webcast to learn how vFabric Data Director can help you build database-as-a-service in your datacenter.
A simple, cost-effective disaster-recovery solution for virtual environments is high on the agenda for IT organizations as they virtualize more business-critical applications with VMware. VMware vCenter™ Site Recovery Manager-the market-leading disaster-recovery product-ensures the simplest and most reliable disaster protection for all virtualized applications. VMware vCenter Site Recovery Manager provides centralized management of recovery plans, enables nondisruptive testing and automates site-failover processes.
Traditional disaster recovery solutions are often too expensive, complex and unreliable to meet business requirements. As a result, IT departments are hesitant to expand disaster protection beyond their most critical applications, largely because they are uncertain whether the quality of the protection is really worth its cost. VMware vCenter™ Site Recovery Manager 5 is the market-leading disaster recovery product that addresses this situation for organizations of all kinds. It complements VMware vSphere to ensure the simplest and most reliable disaster protection for all virtualized applications.
Newsletter Sign-Up »

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all Newsletters | Privacy Policy
Resource Center