The social scrapbook and visual discovery site, which processes about a petabyte of data daily, uses the Hadoop-as-a-service startup Qubole to handle Hadoop jobs and Puppet for its data processing configuration. Read more »
As Hadoop moves towards establishing itself as a key data management platform for the enterprise, there is a new set of challenges it must meet to be regarded as a true… Read more »
In the second quarter of 2014, new de facto standards emerged and galvanized, major cloud providers launched new analytics offerings, and mainstream databases began to take on attributes and capabilities of… Read more »
Analytics is all the rage, with Hadoop and big data leading the hype. But the new technologies are not yet mature, a market full of startups is yet to shake out,… Read more »
Facebook’s open source engine for interactive queries on Hadoop is now available as a cloud service thanks to startup Qubole. Facebook claims Presto is 10 times faster than Hive for most queries. Read more »
A team of professors behind the open source Spark and Shark in-memory big data projects has raised $13.9 million to commercialize the products via a company called Databricks. Spark and Shark… Read more »
Hortonworks is making progress on its mission (via a project called Stinger) to speed up SQL-like queries in Hadoop using Apache Hive. New features in the latest version of Hortonworks’ Hadoop… Read more »
10gen has added some new features to its MongoDB connector for Hadoop, including support for Hive and the ability to backup MongoDB files in HDFS. Read more »