If you find
yourself at this post you know or at the very least are interested in Big Data
and how significant it is today. To put it like a loosely it is a combination
of technologies that enable us to work with massive data sets. Work here
entails processing, analysis, transfer, visuialization and querying data, among
anything else.
Big data skills
are highly in demand at the present moment. It is estimated by various data
gurus that this demand is only going to grown for at least the next 5
years.
Are you looking
for a complete list of all that you need to know to call yourself a big data
guru?
I have compiled here the books I believe
are extremely useful through your journey into this amazing field.
1. Hadoop: The Definitive Guide, Tom White
Topics
Covered:
• Introduction to Hadoop
• HDFS and YARN deep dive
• The traditional MapReduce API for Java
• Hadoop IO - Sqoop and Flume, Avro and
SequenceFiles
2. Hadoop Operations, Eric Sammer, O’Reilly.
Topic
Covered: Hadoop administration
3. Programming Hive,
Capriolo, Wampler and Rutherglen, O'Reilly
Topics Covered: Hive basics
4. Programming Hive, Capriolo, Wampler and Rutherglen, O'Reilly
Topics Covered: Hive UDFs Pig
5. HBase: The Definitive Guide, Lars George, O'Reilly
Topics
Covered: HBase
6. Learning
Spark: Lightning-Fast Big Data Analysis, Konwinski, O'Reilly
Topics
Covered: Spark
analysis: Intro to the Spark ecosystem
Disclaimer: I am in no way associated with the writers, sellers or distributors of these books.







No comments:
Post a Comment