Apache Spark for Data Science Cookbook.
Annotation
Saved in:
Online Access: |
Full text (MCPHS users only) |
---|---|
Main Author: | |
Format: | Electronic eBook |
Language: | English |
Published: |
Packt Publishing,
2016
|
Edition: | 1. |
Subjects: | |
Local Note: | ProQuest Ebook Central |
MARC
LEADER | 00000cam a2200000ua 4500 | ||
---|---|---|---|
001 | in00000099307 | ||
006 | m o d | ||
007 | cr |n||||||||| | ||
008 | 170120s2016 xx o 000 0 eng d | ||
005 | 20240626184031.4 | ||
020 | |a 1785288806 |q (ebk) | ||
020 | |a 9781785288807 | ||
020 | |z 1785880101 | ||
029 | 1 | |a AU@ |b 000066230664 | |
029 | 1 | |a CHNEW |b 000949316 | |
029 | 1 | |a CHVBK |b 483154911 | |
035 | |a (OCoLC)967393459 | ||
035 | |a (OCoLC)ocn967393459 | ||
037 | |a 984338 |b MIL | ||
040 | |a IDEBK |b eng |e pn |c IDEBK |d COO |d OCLCQ |d EBLCP |d MERUC |d REB |d CHVBK |d OCLCQ |d OCLCF |d OCLCO |d OCL |d OCLCQ |d LVT |d OCLCQ |d OCLCO |d OCLCQ |d OCLCO | ||
050 | 4 | |a T55 | |
082 | 0 | 4 | |a 006.3 |2 23 |
100 | 1 | |a Chitturi, Padma Priya. | |
245 | 1 | 0 | |a Apache Spark for Data Science Cookbook. |
250 | |a 1. | ||
260 | |b Packt Publishing, |c 2016. | ||
300 | |a 1 online resource (392) | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
505 | 0 | |a Cover; Copyright; Credits; About the Author; About the Reviewer; www.PacktPub.com; Customer Feedback; Table of Contents; Preface; Chapter 1: Big Data Analytics with Spark; Introduction; Initializing SparkContext; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Working with Spark's Python and Scala shells; How to do it ... ; How it works ... ; There's more ... ; See also; Building standalone applications; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Working with the Spark programming model; How to do it ... ; How it works ... ; There's more ... ; See also. | |
505 | 8 | |a Working with pair RDDsGetting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Persisting RDDs; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Loading and saving data; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Creating broadcast variables and accumulators; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Submitting applications to a cluster; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Working with DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also. | |
505 | 8 | |a Working with Spark StreamingGetting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Chapter 2: Tricky Statistics with Spark; Introduction; Working with Pandas; Variable identification; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Sampling data; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Summary and descriptive statistics; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Generating frequency tables; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Installing Pandas on Linux. | |
505 | 8 | |a Getting readyHow to do it ... ; How it works ... ; There's more ... ; See also; Installing Pandas from source; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Using IPython with PySpark; Getting ready; How to do it ... ; How it work ... ; There's more ... ; See also; Creating Pandas DataFrames over Spark; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Splitting, slicing, sorting, filtering, and grouping DataFrames over Spark; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Implementing co-variance and correlation using Pandas; Getting ready. | |
505 | 8 | |a How to do it ... How it works ... ; There's more ... ; See also; Concatenating and merging operations over DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Complex operations over DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Sparkling Pandas; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Chapter 3: Data Analysis with Spark; Introduction; Univariate analysis; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Bivariate analysis; Getting ready; How to do it ... ; How it works ... ; There's more ... | |
520 | 8 | |a Annotation |b Over insightful 90 recipes to get lightning-fast analytics with Apache SparkAbout This Book Use Apache Spark for data processing with these hands-on recipes Implement end-to-end, large-scale data analysis better than ever before Work with powerful libraries such as MLLib, SciPy, NumPy, and Pandas to gain insights from your dataWho This Book Is ForThis book is for novice and intermediate level data science professionals and data analysts who want to solve data science problems with a distributed computing framework. Basic experience with data science implementation tasks is expected. Data science professionals looking to skill up and gain an edge in the field will find this book helpful. What You Will Learn Explore the topics of data mining, text mining, Natural Language Processing, information retrieval, and machine learning. Solve real-world analytical problems with large data sets. Address data science challenges with analytical tools on a distributed system like Spark (apt for iterative algorithms), which offers in-memory processing and more flexibility for data analysis at scale. Get hands-on experience with algorithms like Classification, regression, and recommendation on real datasets using Spark MLLib package. Learn about numerical and scientific computing using NumPy and SciPy on Spark. Use Predictive Model Markup Language (PMML) in Spark for statistical data mining models. In DetailSpark has emerged as the most promising big data analytics engine for data science professionals. The true power and value of Apache Spark lies in its ability to execute data science tasks with speed and accuracy. Spark's selling point is that it combines ETL, batch analytics, real-time stream analysis, machine learning, graph processing, and visualizations. It lets you tackle the complexities that come with raw unstructured data sets with ease. This guide will get you comfortable and confident performing data science tasks with Spark. You will learn about implementations including distributed deep learning, numerical computing, and scalable machine learning. You will be shown effective solutions to problematic concepts in data science using Spark's data science libraries such as MLLib, Pandas, NumPy, SciPy, and more. These simple and efficient recipes will show you how to implement algorithms and optimize your work. Style and approachThis book contains a comprehensive range of recipes designed to help you learn the fundamentals and tackle the difficulties of data science. This book outlines practical steps to produce powerful insights into Big Data through a recipe-based approach. | |
588 | 0 | |a Print version record. | |
590 | |a ProQuest Ebook Central |b Ebook Central Academic Complete | ||
630 | 0 | 0 | |a Spark (Electronic resource : Apache Software Foundation) |
650 | 0 | |a Data mining. | |
650 | 0 | |a Information retrieval. | |
650 | 0 | |a Big data. | |
650 | 2 | |a Data Mining | |
650 | 2 | |a Information Storage and Retrieval | |
650 | 7 | |a information retrieval. |2 aat | |
852 | |b E-Collections |h ProQuest | ||
856 | 4 | 0 | |u https://ebookcentral.proquest.com/lib/mcphs/detail.action?docID=4773721 |z Full text (MCPHS users only) |t 0 |
938 | |a ProQuest Ebook Central |b EBLB |n EBL4773721 | ||
938 | |a ProQuest MyiLibrary Digital eBook Collection |b IDEB |n cis34515041 | ||
947 | |a FLO |x pq-ebc-base | ||
999 | f | f | |s 81503ae3-f18b-4e60-8adc-293eb6aac0b6 |i b422130d-b228-41da-aa66-ac680df1912b |t 0 |
952 | f | f | |a Massachusetts College of Pharmacy and Health Sciences |b Online |c Online |d E-Collections |t 0 |e ProQuest |h Other scheme |
856 | 4 | 0 | |t 0 |u https://ebookcentral.proquest.com/lib/mcphs/detail.action?docID=4773721 |y Full text (MCPHS users only) |