Databricks, the company behind the commercialization of the Apache Spark data-processing framework, is certifying third-party software to run on the platform. Spark is gaining popularity as a faster, easier alternative to… Read more »
Sqrrl co-founder and VP of business development Ely Kahn came on the Structure Show this week to break down the state of cybersecurity and the cutting edge of data analysis within… Read more »
Cloudera is working on an open source project called Oryx that aims bring machine learning to Hadoop in a way that previous attempts such as Apache Mahout could not. Read more »
Apache Spark, an in-memory data-processing framework, is now a top-level Apache project. That’s an important step for Spark’s stability as it increasingly replaces MapReduce in next-generation big data applications. Read more »
It didn’t take long for the Hadoop market to become a juggernaut, and it won’t take long for it to undergo some significant technological changes. Cloudera co-founder and chief strategy officer… Read more »
MapR is continuing along its path to Hadoop glory with new support for the YARN resource manager and a direct integration with the HP Vertica analytic database. In such a competitive… Read more »
Red Hat and Hortonworks are integrating a number of technologies to give joint customers a more seamless experience running their Hadoop workloads on private cloud or virtualized infrastructure. In an upstart… Read more »
Hadoop can stretch necessary network investments by a modest yet meaningful amount. Here are a few ways carriers are already using it to improve their networks and keep costs down. Read more »
Making sense of big data can be hard enough without spending untold hours having to write code or manually clean datasets that simply won’t work with existing BI tools. Trifacta is… Read more »