Hadoop

Hadoop news, analysis, how-to, opinion and video. | CIO

credit report
eyeing big data in the cloud

data integration

Splice Machine boosts hybrid relational data platform

The startup adds support for columnar storage, in-memory caching and cost-optimized AWS storage to its hybrid transactional and analytical processing platform.

open source share letting go free bird

Splice Machine releases dual-engine RDBMS to open source

The startup hopes to dramatically increase adoption of its dual-engine relational database management system, powered by Apache Hadoop and Apache Spark, by making the technology open source.

data lakes

Why some Data Lakes are built to last

Hadoop-based Data Lakes can be game-changers, but too many are under performing. Here's a checklist to make your data lake a wild success.

real estate online 2

How an online real estate company optimized its Hadoop clusters

Every night, Trulia crunches more than a terabyte of new data and cross-references it with about 2 petabytes of existing data to deliver the most up-to-data real estate information to its users. Here's how it ensures consistent...

intro

Cool new products from big data’s Hadoop World show

There’s a big world of big data tools and services, and many of the leading ones are on display this week at Strata World/Hadoop World in San Jose. From the latest distributions of open source database technology to handy tools for...

hybrid query service

AtScale simplifies connecting BI tools to Hadoop

The startup, which specializes in creating OLAP-like virtual cubes on Hadoop, has added a Hybrid Query Service Service that allows its platform to natively support MDX and SQL queries.

big data rescue

Review: Databricks makes big data dreams come true

Cloud-based Spark machine learning and analytics platform is an excellent, full-featured product for data scientists

Hadoop

Big data gets runtime specification

The ODPi Runtime Specification and test suite is designed to help organizations write Hadoop-based applications once with the confidence that they will run on a variety of Hadoop distributions.

data integration

SAP's HANA Vora bridges divide between enterprise and Hadoop data

The SAP HANA Vora software is designed to allow companies to analyze data stored in Hadoop, enterprise systems and other distributed data sources.

MapR adds in-Hadoop document database

MapR delivers support for containers, security

With the general availability of its Converged Data Platform, MapR Technologies brings Hadoop together with Spark, Web-scale storage, NoSQL and streaming capabilities in a unified cluster designed to support next-generation big data...

statistics stats big data analytics

Hortonworks and HP Labs join forces to boost Spark

Hortonworks is working with Hewlett Packard Labs to enhance the efficiency and scale of memory for the enterprise and to dramatically improve memory utilization.

Hortonworks sign

Hortonworks release cadence balances innovation with reliable Hadoop core

The Hadoop distribution vendor will update core Apache Hadoop components once a year, while continually updating services that run on top of Hadoop.

bi business intelligence ts

How different SQL-on-Hadoop engines satisfy BI workloads

A new benchmark of SQL-on-Hadoop engines Impala, Spark and Hive finds they each have their own strengths and weaknesses when it comes to Business Intelligence (BI) workloads.

walgreens

Walgreens CIO starts with the customer and works backward

Having already produced a successful mobile application, Walgreens CIO Abhi Dhar is implementing Hadoop to lower the cost of storing and processing the large amounts of data that app is generating.

happy birthday hadoop
Q&A

Apache Hadoop turns 10

On the 10-year-anniversary of the birth of the Apache Hadoop project, co-creator Doug Cutting reflects on Hadoop's beginnings and where its future.

Hadoop

The top 5 Hadoop distributions, according to Forrester

In a new report Forrester Research’s big data analysts say that adopting Hadoop is “mandatory” for any organization that wishes to do advanced analytics and get actionable insights on their data. But which vendor is best to use?

partnerships tech

Tableau partners for BI on Hadoop

Startup AtScale will help Tableau accelerate BI on Hadoop with improvements to performance and enterprise-grade security without the need to move, transform or sample data.

Network room and mainframes with virtual city in the cloud

Q&A: Why Syncsort introduced the mainframe to Hadoop

In an interview with IDGE, Josh Rogers and Lonne Jaffe of Syncsort explain how they plan to transform big iron and traditional data warehouse/analytics

Load More