White House Set to Unleash 100,000 Federal Data Sources Via Data.Gov

The U.S. plans to make more than 100,000 data sources available by the end of next week on its data.gov site, in what may be the real start of government's effort to share its vast database with the world.

By Patrick Thibodeau
Thu, June 04, 2009

Computerworld — WASHINGTON -- The U.S. plans to make more than 100,000 data sources available by the end of next week on its data.gov site, in what may be the real start of government's effort to share its vast database with the world.

Data.gov has been open for business for about two weeks but with fewer than 100 data sources available it's now just a teaser of a site.

Data.gov is cataloging data and presenting it in standard formats, such as CVS or XLS, or Keyhole Markup Language (KML) used in Google Earth and XML, among others. In many cases, agencies will develop widgets and other tools make the data accessible and interesting. A simple example is the FBI's Top Ten Wanted widget.

But the real test will be public adoption. Federal CIO Vivek Kundra said the effort to build out data.gov is "a very high priority" because he believes it has the potential of "unlocking the innovation and tapping into the ingenuity" of the private sector as well as Americans generally. Users will also be able to rank data sets on their utility, usefulness, and ease of access.

Over time, the U.S. will continue to expand the data sets, as well as add tools to help users extract and work with government data.

Kundra's hope is that people will take data from multiple sources and develop new insights. "The intersection of true value is generally around multiple disciplines," he said, in a briefing today with reporters.

Kundra said he doesn't know how many sources are available. The U.S. has more than 10,000 systems, some of which contain rivers of data but getting at it may take investments and more processing power to serve up the information, he said.

As the U.S. upgrades systems, a core requirement will be to ensure the new systems are capable of data sharing. But government transparency will be the "default," he said.

The Sunlight Foundation in Washington is running a contest for developers, with some $20,000 in prize money, to build anything from client applications, iPhone applications, Web-based apps, working with federal data. The contest's first criteria: "Does the app help citizens see things that they see before the app existed?"

Sunlight Labs director Clay Johnson said that most of the government is now doing is consolidating data that is already public but is often difficult to find. He said that creating a catalog is no small thing considering that there may be may be forgotten gold mines of data in government systems.

What may be the test for the government over time is whether it is willing to release data that hasn't been easily available, such as financial disclosure forms for Senate appointed administration officials. "Don't just release the data that convenient for you to release, release the data that should be released," said Johnson.

Learn how your answer to this question compares to your peers by taking this quick poll. See how your peers are dealing with the challenge of ensuring a highly capable server infrastructure as technological shifts impact the application server platform.
With increasing data growth, comes increased need for data security.  The existing DLP model, with a focus on compliance/enforcement is not sufficient as the data discovery and classification capabilities are not granular enough.  Read this paper to find how you can efficiently and accurately manage your risk by rapidly inventorying and classifying your data and then developing remediation workflows that support business needs. 
This paper breaks down attack sources into four categories: external, malicious insiders, accidental insiders, and unknown.
The rapid growth of data and technology is creating challenges for organizations as this digital data is considered to be business communications and must be preserved according the same industry-specific regulations governing the retention and discovery of emails and more traditional forms of electronic communications. This paper examines the role that Data Loss Prevention ("DLP") technology can play in helping organizations address the challenges of locating information in response to electronic discovery.
This research, conducted by the Ponemon Institute, focuses on issues relating to the use of data protection solutions such as endpoint encryption and data loss prevention within the workplace.
This report, by Jon Oltsik from Enterprise Strategy Group, examines the need for a new business-centric approach to DLP in order to align business and security requirements.
As greater numbers of datacenter servers transition from the physical to the virtual world, the components of virtualization success come to the fore. What scores of organizations have discovered is that success is derived from an optimal pairing of the right software platform with the right hardware platform.
Have you been looking to hear about customer's experiences with the new VMware vCenter Site Recovery Manager product? View this webcast to learn about VMware customer, Navicure, and their experiences testing and evaluating the recovery manager, their progress in implementing it in their environment and their advice other customers considering using vCenter.
Many enterprises have discovered that the use of virtualization to support desktop workloads creates a range of significant benefits. These benefits include price efficiencies, improved IT management and greater agility and choice for end users.

This VMware sponsored webcast with IDC will provide both quantitative measurement of the business value -- defined as the expected ROI -- and qualitative analysis associated with the use of VMware View™. IDC will also provide an analysis of the View Composer and ThinApp™ features of VMware View, including the business value of these solutions and an overview of how they work.

Attend this webcast to learn about:
- Challenges and barriers that might impede the adoption of desktop virtualization
- Navigating roadblocks to facilitate a strategic implementation
- Optimizing qualitative and quantitative benefits to IT and your business
VMware recently announced VMware vFabric™ Data Director, a new database deployment and operations platform that enables enterprise IT organizations to offer database as a private cloud service. Built on top of VMware vSphere 5, vFabric Data Director enables IT organizations to ontrol database sprawl through automation and consistent policy enforcement and accelerate application development cycles with self-service database management. Attend this webcast to learn how vFabric Data Director can help you build database-as-a-service in your datacenter.
A simple, cost-effective disaster-recovery solution for virtual environments is high on the agenda for IT organizations as they virtualize more business-critical applications with VMware. VMware vCenter™ Site Recovery Manager-the market-leading disaster-recovery product-ensures the simplest and most reliable disaster protection for all virtualized applications. VMware vCenter Site Recovery Manager provides centralized management of recovery plans, enables nondisruptive testing and automates site-failover processes.
Traditional disaster recovery solutions are often too expensive, complex and unreliable to meet business requirements. As a result, IT departments are hesitant to expand disaster protection beyond their most critical applications, largely because they are uncertain whether the quality of the protection is really worth its cost. VMware vCenter™ Site Recovery Manager 5 is the market-leading disaster recovery product that addresses this situation for organizations of all kinds. It complements VMware vSphere to ensure the simplest and most reliable disaster protection for all virtualized applications.
Newsletter Sign-Up »

Receive the latest news test, reviews and trends on your favorite technology topics

Choose a newsletter
  1. View all Newsletters | Privacy Policy
Resource Center