ResearchExtending Hadoop Towards the Data Lake
Early adopters of the data lake are integrating Hadoop into current workflows and addressing challenges around the cleanliness, validity, and protection of their data. Read more »
{"source":"http:\/\/search.gigaom.com\/tag\/data-lake\/wijax\/383af13036138b805db0da550b1f5914","varname":"wijax_bfd609350d1a889df0a8c719f9042892","title_element":"h2","title_class":"widget-title","title_before":"%3Ch2%20class%3D%22widget-title%22%3E","title_after":"%3C%2Fh2%3E"}
Alerts notify you of new stories or reports as soon as they are published. They are delivered via email and can be customized by topic and frequency.
Keep an eye on the future, by getting new results in your inbox.
Use the filters below to edit your Alert.
Early adopters of the data lake are integrating Hadoop into current workflows and addressing challenges around the cleanliness, validity, and protection of their data. Read more »
There was a time, a little over two years ago, when SQL-on-Hadoop was about cracking open access to Hadoop data for those with SQL skillsets and eliminating the exclusivity of access… Read more »
With virtually every Hadoop distribution vendor offering SQL-on-Hadoop solutions, the key factor in the market is now the integration between Hadoop and data warehouse technology. Read more »
Projects like Apache YARN expand the types of workloads for which Hadoop is a viable and compelling solution, leading practitioners to think more creatively managing data. Read more »
EMC-VMware spinoff Pivotal is putting its money behind Tachyon, an in-memory distributed file system developed by the same research lab that created Apache Spark. The goal is to improve the company’s… Read more »
New features included in Hadoop’s latest releases go some way towards freeing an increasingly capable data platform from the constraints of its early dependence on one specific technical approach: MapReduce. Read more »