Learning Big Data with Amazon Elastic MapReduce

Overview

Amazon Elastic MapReduce is a web service used to process and store vast amounts of data, and it is one of the largest Hadoop operators in the world. With the increase in the amount of data generated and collected by many businesses and the arrival of cost-effective cloud-based solutions for distributed computing, the feasibility to crunch large amounts of data to get deep insights within a short span of time has increased greatly. Using Elastic MapReduce, you will be able to design and create solutions using ...

See more details below
Sending request ...

Overview

Amazon Elastic MapReduce is a web service used to process and store vast amounts of data, and it is one of the largest Hadoop operators in the world. With the increase in the amount of data generated and collected by many businesses and the arrival of cost-effective cloud-based solutions for distributed computing, the feasibility to crunch large amounts of data to get deep insights within a short span of time has increased greatly. Using Elastic MapReduce, you will be able to design and create solutions using Apache Hadoop and execute those solutions on Amazon EMR clusters.

This book will get you started with AWS so that you can quickly create your account and explore the services provided, many of which you might be delighted to use. This book covers the architectural details of the MapReduce framework, Apache Hadoop, various job models on EMR, how to manage clusters on EMR, and the command-line tools available with EMR. Each chapter builds on the knowledge of the previous one, leading to the final chapter where you will learn about solving a real-world use case using Apache Hadoop and EMR. This book will therefore get you up and running with major big data technologies quickly and efficiently.

Read More Show Less

Product Details

  • ISBN-13: 9781782173434
  • Publisher: Packt Publishing, Limited
  • Publication date: 10/23/2014
  • Pages: 116

Meet the Author

Amarkant Singh is a big data specialist. Being one of the initial users of Amazon's Elastic MapReduce, he has used it extensively to build and deploy many big data solutions. He has been working with Apache Hadoop and EMR for almost 4 years now. He is also a certified AWS solution architect. As an engineer, he has designed and developed enterprise applications of various scales. He is currently leading the product development team at one of the most happening cloud-based enterprises in the Asia-Pacific region. He is also the all-time top user on stackoverflow for EMR, as of the time of writing.

Vijay Rayapati is CEO of Minjar, one of the leading providers of cloud and big data solutions on public cloud platforms. Vijay has over 10 years of experience in building business rule engines, data analytics platforms, and real-time analysis systems used by many leading enterprises across the world, including Fortune 500 businesses. He has worked on various technologies including LISP, .NET, Java, Python, and many NoSQL databases. He has re-architected and led the initial development of a very large-scale location intelligence and analytics platform using Hadoop and AWS EMR. He has worked with many AdNetworks, e-commerce, financial, and retail companies in helping them design, implement, and scale their data analysis and BI platforms on AWS Cloud. He is passionate about open source software, large-scale systems, and performance engineering. He is active on Twitter at @amnigos, blogs at amigos.com, and his Github profile is https://github.com/amnigos.

Read More Show Less

If you find inappropriate content, please report it to Barnes & Noble
Why is this product inappropriate?
Comments (optional)