Franklin

Learning Spark / Holden Karau, Andy Konwinski, Patrick Wendell, and Matei Zaharia.

Author/Creator:
Karau, Holden, author.
Edition:
First edition.
Publication:
Beijing ; Sebastopol : O'Reilly, [2015]
Format/Description:
Book
xvi, 256 pages : illustrations ; 24 cm
Subjects:
Spark (Electronic resource : Apache Software Foundation).
Big data.
Data mining -- Computer programs.
Summary:
This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.-- Source other than Library of Congress.
Contents:
Introduction to data analysis with Spark
Downloading Spark and getting started
Programming with RDDs
Working with key/value pairs
Loading and saving your data
Advanced Spark programming
Running on a cluster
Tuning and debugging Spark
Spark SQL
Spark streaming
Machine learning with MLlib.
Notes:
Subtitle on cover: Lightning-fast data analysis.
Includes index.
Local notes:
Acquired for the Penn Libraries with assistance from the Class of 1932 Fund.
Contributor:
Konwinski, Andy, author.
Wendell, Patrick, author.
Zaharia, Matei, author.
Class of 1932 Fund.
ISBN:
1449358624
9781449358624
OCLC:
844872440
Publisher Number:
99964949884
Loading...
Location Notes Your Loan Policy
Description Status Barcode Your Loan Policy