An edition of Big data analytics for beginners (2014)

Big data analytics for beginners

  • 0 Ratings
  • 2 Want to read
  • 0 Currently reading
  • 0 Have read

My Reading Lists:

Create a new list

Check-In

×Close
Add an optional check-in date. Check-in dates are used to track yearly reading goals.
Today

  • 0 Ratings
  • 2 Want to read
  • 0 Currently reading
  • 0 Have read

Buy this book

Last edited by ImportBot
March 17, 2022 | History
An edition of Big data analytics for beginners (2014)

Big data analytics for beginners

  • 0 Ratings
  • 2 Want to read
  • 0 Currently reading
  • 0 Have read

We've had a drastic growth in volume of information in recent years, so it could be a challenge or in another view an opportunity for big companies and high turnover corporations. Although we've had different types of databases and also frameworks for analyzing data, this amount of data could make a indissoluble problem for those traditional systems. In this book we are going to show you a new way for analyzing this amount of data.

Publish Date
Publisher
Crux Tech Limited
Language
English
Pages
178

Buy this book

Previews available in: English

Edition Availability
Cover of: Big data analytics for beginners
Big data analytics for beginners
2014, Crux Tech Limited
in English

Add another edition?

Book Details


Table of Contents

Interoduction: What is Big Data ; Defining structured data ; Defining unstructured data ; Rethinking data management ; Big data capabilities ; Is new technology needed? ; Big data or business intelligence ; The value of big data.
Chapter 1. Hadoop: What is Hadoop? ; Hadoop vs. RDBMS ; Hadoop installation and running ; Cluster Mode installing and running ; Hadoop startup ; Hadoop shutdown.
Chapter 2. MapReduce: Apache Hadoop core components ; Hadoop distributed file system (HDFS) ; MapReduce ; Underneath of MapReduce process ; Hadoop data flow instructor ; Fault tolerance ; Speculative execution ; Key-value pair databases in a Big Data environment ; MapReduce algorithms ; General reducer-side join ; Optimized reducer-side join ; Map-size partition join ; Map-side partition merge join ; TF-IDF and Map-Reduce ; Implementation in Apache PIG ; Different MapReduce languages ; YARN.
Chapter 3. HDFS: A brief history ; Overview of HDFS ; Reasons for downtime ; Use cases ; HDFS formation process ; HDFS architecture ; Configuring HDFS ; Interacting with HDFS.
Chapter 4: HBase: What is HBase? ; Columnar databases ; Bloom filter ; Why and when HBase? ; HBase architecture ; Tables, rows, columns, and cells in HBase ; Master and region server ; Data model operations ; Better performance.
Appendix & references: Sqoop ; Prerequisites ; Usage of Sqoop ; Installing and running Sqoop ; Controlling the Hadoop installation ; Generic and specific arguments ; Connecting to a database server ; Controlling parallelism ; Best practices for selecting Apache Hadoop hardware.

Edition Notes

Book and table of contents contain spelling and grammatical errors.

Published in
[India]

Classifications

Dewey Decimal Class
006.312
Library of Congress
QA76.9.D343 .R33 2014

The Physical Object

Pagination
178 pages
Number of pages
178

ID Numbers

Open Library
OL37762560M
Internet Archive
bigdataanalytics0000rabb
ISBN 10
1495387348
ISBN 13
9781495387340
OCLC/WorldCat
892554067

Community Reviews (0)

Feedback?
No community reviews have been submitted for this work.

History

Download catalog record: RDF / JSON
March 17, 2022 Created by ImportBot import new book