9 Open Source Big Data Technologies to Watch

With more and more companies storing more and more data and hoping to leverage it for actionable insights, Big Data is making a big splash these days. Open source technology is at the core of most Big Data initiatives. Here are nine key open source Big Data technologies to keep an eye on.
  • 1 of 9

Apache Hadoop

Apache Hadoop is an open source software framework for data-intensive distributed applications originally created by Doug Cutting to support his work on Nutch, an open source Web search engine. To meet Nutch's multimachine processing requirements, Cutting implemented a MapReduce facility and a distributed file system that together became Hadoop. He named it after his son's toy elephant. Through MapReduce, Hadoop distributes Big Data in pieces over a series of nodes running on commodity hardware. Hadoop is now among the most popular technologies for storing the structured, semi-structured and unstructured data that comprise Big Data. Hadoop is available under the Apache License 2.0.

Return to slideshow directright
Join the discussion
Be the first to comment on this article. Our Commenting Policies