Hadoop

Hadoop news, analysis, how-to, opinion and video. | CIO

blue duo ton compass
3 keys to keep your data lake from becoming a data swamp

nyse building

Cloudera’s IPO is overshadowed by a rival it won’t mention

The big data poster child is selling itself to investors as a machine learning company, but is really still in the Hadoop business just like Hortonworks, the competitor it would like to ignore.

App dev trends going hot -- and cold

21 hot programming trends—and 21 going cold

Hot or not? From the web to the motherboard to the training ground, get the scoop on what's in and what's out in app dev

bi business intelligence ts

Why Google BigQuery excels at BI on big data concurrency

Should you use Hadoop for your big data business intelligence needs? BigQuery? What's the difference between on-premises Hadoop, in-cloud Hadoop and a serverless model like Google BigQuery's? A new benchmark from AtScale seeks to help...

hadoop aws primary

New Hortonworks release focuses on SQL and EDW

With the release of its HDP 2.6 Hadoop distribution, Hortonworks seeks to help you boost the performance of SQL interactive queries and optimize their existing enterprise data warehouse investments.

big data spending

Jethro automates data engineering tasks for BI on Hadoop

By automating costly and time-consuming data engineering tasks associated with business intelligence on Hadoop, Jethro aims to accelerate BI queries.

Big data

Recharge your knowledge of the modern data warehouse

Data warehousing is evolving from centralized repositories to logical data warehouses leveraging data virtualization and distributed processing. Make sure you’re not using old terminology to explain new initiatives.

credit report

How Hadoop helps Experian crunch credit reports

Experian is quickly crunching massive amounts of data and making it available to customers thanks to the open source software as well as microservices and API technologies.

eyeing big data in the cloud

Businesses eye cloud for big data deployments

Big data cloud deployments have surged over the past year and business intelligence is now the number 1 big data workload, according to a new survey of big data professionals.

data integration

Splice Machine boosts hybrid relational data platform

The startup adds support for columnar storage, in-memory caching and cost-optimized AWS storage to its hybrid transactional and analytical processing platform.

open source share letting go free bird

Splice Machine releases dual-engine RDBMS to open source

The startup hopes to dramatically increase adoption of its dual-engine relational database management system, powered by Apache Hadoop and Apache Spark, by making the technology open source.

data lakes

Why some Data Lakes are built to last

Hadoop-based Data Lakes can be game-changers, but too many are under performing. Here's a checklist to make your data lake a wild success.

real estate online 2

How an online real estate company optimized its Hadoop clusters

Every night, Trulia crunches more than a terabyte of new data and cross-references it with about 2 petabytes of existing data to deliver the most up-to-data real estate information to its users. Here's how it ensures consistent...

intro

Cool new products from big data’s Hadoop World show

There’s a big world of big data tools and services, and many of the leading ones are on display this week at Strata World/Hadoop World in San Jose. From the latest distributions of open source database technology to handy tools for...

hybrid query service

AtScale simplifies connecting BI tools to Hadoop

The startup, which specializes in creating OLAP-like virtual cubes on Hadoop, has added a Hybrid Query Service Service that allows its platform to natively support MDX and SQL queries.

big data rescue

Review: Databricks makes big data dreams come true

Cloud-based Spark machine learning and analytics platform is an excellent, full-featured product for data scientists

Hadoop

Big data gets runtime specification

The ODPi Runtime Specification and test suite is designed to help organizations write Hadoop-based applications once with the confidence that they will run on a variety of Hadoop distributions.

data integration

SAP's HANA Vora bridges divide between enterprise and Hadoop data

The SAP HANA Vora software is designed to allow companies to analyze data stored in Hadoop, enterprise systems and other distributed data sources.

MapR adds in-Hadoop document database

MapR delivers support for containers, security

With the general availability of its Converged Data Platform, MapR Technologies brings Hadoop together with Spark, Web-scale storage, NoSQL and streaming capabilities in a unified cluster designed to support next-generation big data...

Load More