Limit search to available items
Record 39 of 114
Previous Record Next Record
Book Cover
E-book
Author Aven, Jeffrey, author

Title Data analytics with Spark using Python / Jeffrey Aven
Published Boston : Addison-Wesley, [2018]
©2018

Copies

Description 1 online resource (1 volume) : illustrations
Series Addison-Wesley data & analytics series
Addison-Wesley data and analytics series.
Summary Spark for Data Professionals introduces and solidifies the concepts behind Spark 2.x, teaching working developers, architects, and data professionals exactly how to build practical Spark solutions. Jeffrey Aven covers all aspects of Spark development, including basic programming to SparkSQL, SparkR, Spark Streaming, Messaging, NoSQL and Hadoop integration. Each chapter presents practical exercises deploying Spark to your local or cloud environment, plus programming exercises for building real applications. Unlike other Spark guides, Spark for Data Professionals explains crucial concepts step-by-step, assuming no extensive background as an open source developer. It provides a complete foundation for quickly progressing to more advanced data science and machine learning topics. This guide will help you: Understand Spark basics that will make you a better programmer and cluster "citizen" Master Spark programming techniques that maximize your productivity Choose the right approach for each problem Make the most of built-in platform constructs, including broadcast variables, accumulators, effective partitioning, caching, and checkpointing Leverage powerful tools for managing streaming, structured, semi-structured, and unstructured data
Notes Includes index
Copyright © Addison-Wesley Professional
Online resource; title from title page (Safari, viewed June 8, 2018)
SUBJECT Spark (Electronic resource : Apache Software Foundation) http://id.loc.gov/authorities/names/no2015027445
Spark (Electronic resource : Apache Software Foundation) fast
Subject Electronic data processing -- Distributed processing -- Management.
Big data.
Python (Computer program language)
Big data
Electronic data processing -- Distributed processing -- Management
Python (Computer program language)
Form Electronic book
ISBN 9780134844855
0134844858
9780134844879
0134844874