Monday, November 26, 2012 | | | Doug Henschen | | | | Enterprise data warehouse (EDW) tools and techniques are proven, the SQL query language is well known, and there's plenty of expertise available to keep EDWs humming. But the downside for many warehouses built on relational databases is that they're hard to change because you have to start by schema. When new data sources and new questions arise, the schema and related ETL and BI applications require expensive and time-consuming changes.
Enter Hadoop, which lets you store data on a massive scale at low cost (compared with similarly scaled commercial databases). What's more it easily handles variety, complexity and change because you don't have to conform data to a predefined schema. Sounds great, but where do you find qualified people who know how to use Pig, Hive, Scoop and other tools needed to run Hadoop? And how do you get fast answers out of a batch-oriented platform that relies on slow, iterative MapReduce processing?
Scott Gnau of Teradata and Ben Werther of Platfora square off in an EDW vs. Hadoop debate. Chime in with your comments on the future of data warehousing.
Read More.
Doug Henschen Executive Editor, InformationWeek | | | JOIN THE CONVERSATION Posted By Rick Biehl: "Our warehouses are highly generic, with very few fact columns into which we generically load lots of different kinds of data. Queries are driven by the metadata, not the schema design. It takes only a couple of months to implement new data sources, and we never have to change our schema. So it seems to me that our enterprise warehouse already works in a relational form in much the same way as Hadoop is being described here." In reply to: Big Data Debate: End Near For Data Warehousing? View Entire Response | Post Your Own Reply
| | COMMENTARY REPORTS & WHITEPAPERS FEATURED REPORTS Research: 2013 Analytics & Info Management Trends Predictive analysis is faster, more accurate and more accessible than ever before. And now, combined with big data, it's driving a new age of experimentation. The University of Kentucky and Taylor University are among the new users of advanced analytics. Both schools are using predictive modeling to improve student retention, and their work offers lessons for any business trying to minimize customer churn. Insurers UnitedHealth Group and Allstate are old hands at advanced analytics, but that doesn't mean they're resting on past achievements. In addition to these use cases, this report explores how emerging techniques, including in-database analysis, text mining and crowdsourced competitions, are yielding breakthrough results. Download Now (Registration Required)
| | | | | | FEATURED WHITEPAPERS Decision-Driven Data Management: Better Decisions with Better Data This white paper examines three areas of focus when considering the implementation of an operational data quality process. The first is variables that can disrupt data-driven decision making, such as time, big data, pattern recognition, human biases, intuitions and errors. The second is why decision-driven data management is a strategy for enabling your organization to achieve better decisions with better data. And finally, why closing the decision-data feedback loop is the most critical success factor in data-driven decision making.
Download Now (Registration Required) | RESOURCES AND EVENTS Online Marketing Summit San Diego Attend Online Marketing Summit San Diego and gather the insights and strategies you need to make the right online marketing choices to deliver the most value for your business. OMS San Diego offers three days of inspiration, connections, and practical learning. Register using code WETW01 to secure the early bird rate on Conference Passes or a free Expo Pass. It happens February 11-13
Register today!
| | | | | |
No comments:
Post a Comment