Data Architecture: A Primer for the Data Scientist: Big Data, Data Warehouse and Data Vault

Overview

Today, the world is trying to create and educate data scientists because of the phenomenon of Big Data. And everyone is looking deeply into this technology. But no one is looking at the larger architectural picture of how Big Data needs to fit within the existing systems (data warehousing systems). Taking a look at the larger picture into which Big Data fits gives the data scientist the necessary context for how pieces of the puzzle should fit together. Most references on Big Data look at only one tiny part of a ...

See more details below
Sending request ...

Overview

Today, the world is trying to create and educate data scientists because of the phenomenon of Big Data. And everyone is looking deeply into this technology. But no one is looking at the larger architectural picture of how Big Data needs to fit within the existing systems (data warehousing systems). Taking a look at the larger picture into which Big Data fits gives the data scientist the necessary context for how pieces of the puzzle should fit together. Most references on Big Data look at only one tiny part of a much larger whole. Until data gathered can be put into an existing framework or architecture it can't be used to its full potential. Data Architecture a Primer for the Data Scientist addresses the larger architectural picture of how Big Data fits with the existing information infrastructure, an essential topic for the data scientist.

Drawing upon years of practical experience and using numerous examples and an easy to understand framework. William Inmon, Jay Brophy and Daniel Linstedt define the importance of data architecture and how it can be used effectively to harness big data within existing systems. You'll be able to:

  • Turn textual information into a form that can be analyzed by standard tools.
  • Make the connection between analytics and Big Data
  • Understand how Big Data fits within an existing systems environment
  • Conduct analytics on repetitive and non-repetitive data
  • Discusses the value in Big Data that is often overlooked, non-repetitive data, and why there is significant business value in using it
  • Shows how to turn textual information into a form that can be analyzed by standard tools.
  • Explains how Big Data fits within an existing systems environment
  • Presents new opportunities that are afforded by the advent of Big Data
  • Demystifies the murky waters of repetitive and non-repetitive data in Big Data
Read More Show Less

Product Details

  • ISBN-13: 9780128020449
  • Publisher: Elsevier Science
  • Publication date: 12/29/2014
  • Pages: 304

Meet the Author

Dan Linstedt is founder and principle of Empowered Holdings, LLC - a holding company for LearnDataVault.com, and RapidGenDS.com. LearnDataVault.com and a is a world-renowned expert in Data Warehousing and Business Intelligence. He has 20+ years of experience in the IT industry, and has worked with companies like Nike, PepsiCo, Amex, and Visa. His experience extends through data modeling, process design to ETL/ELT performance and tuning. He has a background in SEI/CMMI Level 5, and has contributed architecture efforts to petabyte scale data warehouses offers high quality on-line training and consulting services for Data Vault. He is the inventor and founder of the Data Vault modeling and methodology.
Read More Show Less

Table of Contents

  1. Corporate Data
  2. Big Data
  3. Data Warehouse
  4. Data Vault
  5. Operational Systems
  6. Architecture
  7. Analysis and Visualization of Data
  8. Analytics for Structured Data
  9. Analytics for Unstructured Repetitive Data
  10. Analytics for Unstructured Non-Repetitive Data
  11. Glossary of Terms
Read More Show Less

If you find inappropriate content, please report it to Barnes & Noble
Why is this product inappropriate?
Comments (optional)