Big Data Analytics with Java.

Learn the basics of analytics on big data using Java, machine learning and other big data toolsAbout This Book* Acquire real-world set of tools for building enterprise level data science applications* Surpasses the barrier of other languages in data science and learn create useful object-oriented co...

Full description

Saved in:
Bibliographic Details
Online Access: Full text (MCPHS users only)
Main Author: Mehta, Rajat
Format: Electronic eBook
Language:English
Published: Birmingham : Packt Publishing, 2017
Subjects:
Local Note:ProQuest Ebook Central

MARC

LEADER 00000cam a2200000uu 4500
001 in00000088086
006 m o d
007 cr |n|---|||||
008 170805s2017 enk o 000 0 eng d
005 20240626182832.2
019 |a 999636124  |a 999643773  |a 1000026927  |a 1003526396 
020 |a 9781787282193 
020 |a 1787282198 
029 1 |a AU@  |b 000067111656 
035 |a (OCoLC)999654609  |z (OCoLC)999636124  |z (OCoLC)999643773  |z (OCoLC)1000026927  |z (OCoLC)1003526396 
035 |a (OCoLC)ocn999654609 
037 |a 1024806  |b MIL 
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d IDEBK  |d YDX  |d OCLCQ  |d MERUC  |d COO  |d OCLCQ  |d WYU  |d OCLCQ  |d LVT  |d CNCEN  |d OCLCO  |d OCLCF  |d OCLCQ  |d OCLCO  |d OCLCQ  |d OCLCO  |d OCLCL  |d OCLCQ 
050 4 |a T55.4-60.8 
082 0 4 |a 005.133  |q OCoLC  |2 23/eng/20240417 
100 1 |a Mehta, Rajat. 
245 1 0 |a Big Data Analytics with Java. 
260 |a Birmingham :  |b Packt Publishing,  |c 2017. 
300 |a 1 online resource (419 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
500 |a Advantages of naïve bayes. 
505 0 |a Cover ; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Customer Feedback; Table of Contents; Preface; Chapter 1: Big Data Analytics with Java ; Why data analytics on big data?; Big data for analytics; Big data -- a bigger pay package for Java developers; Basics of Hadoop -- a Java sub-project; Distributed computing on Hadoop; HDFS concepts; Design and architecture of HDFS; Main components of HDFS; HDFS simple commands; Apache Spark; Concepts; Transformations; Actions; Spark Java API; Spark samples using Java 8; Loading data; Data operations -- cleansing and munging. 
505 8 |a Analyzing data -- count, projection, grouping, aggregation, and max/minActions on RDDs; Paired RDDs; Saving data; Collecting and printing results; Executing Spark programs on Hadoop; Apache Spark sub-projects; Spark machine learning modules; Mahout -- a popular Java ML library; Deeplearning4j -- a deep learning library; Summary; Chapter 2: First Steps in Data Analysis ; Datasets; Data cleaning and munging; Basic analysis of data with Spark SQL; Building SparkConf and context; Dataframe and datasets; Load and parse data; Analyzing data -- the Spark-SQL way. 
505 8 |a Spark SQL for data exploration and analyticsMarket basket analysis -- Apriori algorithm; Implementation of the Apriori algorithm in Apache Spark; Efficient market basket analysis using FP-Growth algorithm; Running FP-Growth on Apache Spark; Summary; Chapter 3: Data Visualization ; Data visualization with Java JFreeChart; Using charts in big data analytics; Time series chart; All India seasonal and annual average temperature series dataset; Simple single Time Series chart; Multiple Time Series on a single chart window; Bar charts; Histograms; When would you use a histogram? 
505 8 |a How to make histograms using JFreeChart?Line charts; Scatter plots; Box plots; Advanced visualization technique; Prefuse; IVTK Graph toolkit; Other libraries; Summary; Chapter 4: Basics of Machine Learning ; What is machine learning?; Real-life examples of machine learning; Type of machine learning; A small sample case study of supervised and unsupervised learning; Steps for machine learning problems; Choosing the machine learning model; What are the feature types that can be extracted from the datasets?; How do you select the best features to train your models? 
505 8 |a How do you run machine learning analytics on big data?Getting and preparing data in Hadoop; Training and storing models on big data; Apache Spark machine learning API; Summary; Chapter 5: Regression on Big Data ; Linear regression; What is simple linear regression?; Where is linear regression used?; Logistic regression; Which mathematical functions does logistic regression use?; Where is logistic regression used?; Predicting heart disease using logistic regression; Summary; Chapter 6: Naive Bayes and Sentiment Analysis ; Conditional probability; Bayes theorem; Naïve bayes algorithm. 
520 8 |a Learn the basics of analytics on big data using Java, machine learning and other big data toolsAbout This Book* Acquire real-world set of tools for building enterprise level data science applications* Surpasses the barrier of other languages in data science and learn create useful object-oriented codes* Extensive use of Java compliant big data tools like apache spark, Hadoop, etc. Who This Book Is ForThis book is for Java developers who are looking to perform data analysis in production environment. Those who wish to implement data analysis in their Big data applications will find this book helpful. What You Will Learn* Start from simple analytic tasks on big data* Get into more complex tasks with predictive analytics on big data using machine learning* Learn real time analytic tasks* Understand the concepts with examples and case studies* Prepare and refine data for analysis* Create charts in order to understand the data* See various real-world datasetsIn DetailThis book covers case studies such as sentiment analysis on a tweet dataset, recommendations on a movielens dataset, customer segmentation on an ecommerce dataset, and graph analysis on actual flights dataset. This book is an end-to-end guide to implement analytics on big data with Java. Java is the de facto language for major big data environments, including Hadoop. This book will teach you how to perform analytics on big data with production-friendly Java. This book basically divided into two sections. The first part is an introduction that will help the readers get acquainted with big data environments, whereas the second part will contain a hardcore discussion on all the concepts in analytics on big data. It will take you from data analysis and data visualization to the core concepts and advantages of machine learning, real-life usage of regression and classification using Naive Bayes, a deep discussion on the concepts of clustering, and a review of simple neural networks on big data using deepLearning4j or plain Java Spark code. This book is a must-have book for Java developers who want to start learning big data analytics and want to use it in the real world. Style and approachThe approach of book is to deliver practical learning modules in manageable content. Each chapter is a self-contained unit of a concept in big data analytics. Book will step by step builds the competency in the area of big data analytics. Examples using real world case studies to give ideas of real applications and how to use the techniques mentioned. The examples and case studies will be shown using both theory and code. 
588 0 |a Print version record. 
590 |a ProQuest Ebook Central  |b Ebook Central College Complete 
650 0 |a Data mining. 
650 2 |a Data Mining 
758 |i has work:  |a Big data analytics with Java (Text)  |1 https://id.oclc.org/worldcat/entity/E39PCGhrfT3JyhD3rdC9MY6GRC  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |a Mehta, Rajat.  |t Big Data Analytics with Java.  |d Birmingham : Packt Publishing, ©2017 
852 |b E-Collections  |h ProQuest 
856 4 0 |u https://ebookcentral.proquest.com/lib/mcphs/detail.action?docID=4933223  |z Full text (MCPHS users only)  |t 0 
938 |a EBL - Ebook Library  |b EBLB  |n EBL4933223 
938 |a ProQuest MyiLibrary Digital eBook Collection  |b IDEB  |n cis38588572 
938 |a YBP Library Services  |b YANK  |n 14736477 
947 |a FLO  |x pq-ebc-base 
999 f f |s c5f2d536-27d7-4038-8827-4ca534e0a2f2  |i 23a41bc4-3caa-4c8d-921b-cbd92e3b9408  |t 0 
952 f f |a Massachusetts College of Pharmacy and Health Sciences  |b Online  |c Online  |d E-Collections  |t 0  |e ProQuest  |h Other scheme 
856 4 0 |t 0  |u https://ebookcentral.proquest.com/lib/mcphs/detail.action?docID=4933223  |y Full text (MCPHS users only)