Big data: messy, difficult and valuable

People have been talking about ‘big data’ for years, but the buzz has now intensified as more have begun to understand its potential and look for ways to exploit it for their organisations.

One of the big challenges for CIOs is to make use of the relevant tools, and to create a culture in which others appreciate how they can support the effort.

Big data is a collection of data sets too large and complex for regular database management tools, in volumes of petabytes (1m gigabytes), even exabytes (1bn gigabytes).

It makes it possible to measure human, business or scientific patterns in fine detail, and can provide highly valuable insights to support the development of products and services.

The potential to exploit big data is increasing along with the overall volume.

It can take in streams of data from sources such as digital sensors and cameras, which can track industrial activity and environmental change, and social media, which can provide evidence of people’s attitudes and preferences.

But it is very messy. As the volume of data grows it can be expensive to store, requires multiple servers for processing, and there is no ‘silver bullet’ IT solution.

There are software frameworks that can be used in managing big data, such as MapReduce, which supports developers in writing relevant programmes; but it still requires a lot of work to establish how the data should be split, valued then pulled together.

In addition, the process is aimed at extracting packets of information that have a high value for the organisation, and these are unlikely to align cleanly with the original structure of the dataset, and to come from only a small proportion of the total.

Some experts have pointed out that as more data is used much of it is duplicated – just think about data back-ups or tweets that are retweeted - and this reduces the proportion of extracted information against the total.

It is also necessary to convey the results in terms that make sense to business leaders. Data has to be presented in terms that are clearly relevant to the challenges and opportunities facing an organisation, and this requires the specialists to tell a story that others can understand.

To have any chance of exploiting big data successfully, you need people with the programme writing skills to identify the information amid the mountain of data.

A recent report by the Said Business School of Oxford, Analytics: the real world use of big data, suggests that organisations have been acquiring some of these skills: a worldwide survey of businesses and IT professionals showed that about three-quarters now have big data projects either in development or under way.

1 2 Page 1
Page 1 of 2
7 secrets of successful remote IT teams