Limit search to available items
Book Cover
E-book
Author Gates, Alan, author

Title Programming Pig / Alan Gates and Daniel Dai
Edition Second edition
Published Sebastopol, CA : O'Reilly Media, Inc., 2016
©2017

Copies

Description 1 online resource : illustrations
Contents 1. What is Pig? -- 2. Installing and running Pig -- 3. Pig's data model -- 4. Introduction to Pig Latin -- 5. Advanced Pig Latin -- 6. Developing and testing Pig Latin scripts -- 7. Making Pig fly -- 8. Embedding Pig -- 9. Writing evaluation and filter functions -- 10. Writing load and store functions -- 11. Pig on Tez -- 12. Pig and other members of the Hadoop community -- 13. Use cases and programming examples
Summary For many organizations, Hadoop is the first step for dealing with massive amounts of data. The next step? Processing and analyzing datasets with the Apache Pig scripting platform. With Pig, you can batch-process data without having to create a full-fledged application, making it easy to experiment with new datasets. Updated with use cases and programming examples, this second edition is the ideal learning tool for new and experienced users alike. You'll find comprehensive coverage on key features such as the Pig Latin scripting language and the Grunt shell. When you need to analyze terabytes of data, this book shows you how to do it efficiently with Pig. Delve into Pig's data model, including scalar and complex data typesWrite Pig Latin scripts to sort, group, join, project, and filter your dataUse Grunt to work with the Hadoop Distributed File System (HDFS)Build complex data processing pipelines with Pig's macros and modularity featuresEmbed Pig Latin in Python for iterative processing and other advanced tasksUse Pig with Apache Tez to build high-performance batch and interactive data processing applicationsCreate your own load and store functions to handle data formats and storage mechanisms
Notes "Dataflow scripting with Hadoop"--Cover
Includes index
Print version record
SUBJECT Apache Pig (Computer file)
Subject Programming languages (Electronic computers) -- Handbooks, manuals, etc
Open source software.
Pig Latin (Computer program language)
COMPUTERS -- Programming Languages -- General.
Open source software
Programming languages (Electronic computers)
Genre/Form handbooks.
Handbooks and manuals
Handbooks and manuals.
Guides et manuels.
Form Electronic book
Author Dai, Daniel, author
ISBN 9781491937068
1491937068
9781491937044
1491937041