دانلود کتاب جامع اس کیو ال در داده های عظیم
SQL on Big Data: Technology, Architecture, and Innovation
Wilmington, Massachusetts, USA
ISBN-13 (pbk): 978-1-4842-2246-1 ISBN-13 (electronic): 978-1-4842-2247-8
Library of Congress Control Number: 2016958437
Copyright © ۲۰۱۶ by Sumit Pal
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part
of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations,
recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission
or information storage and retrieval, electronic adaptation, computer software, or by similar or
dissimilar methodology now known or hereafter developed
Trademarked names, logos, and images may appear in this book. Rather than use a trademark symbol
with every occurrence of a trademarked name, logo, or image, we use the names, logos, and images only
in an editorial fashion and to the benefit of the trademark owner, with no intention of infringement of
The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are
not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject
to proprietary rights.
While the advice and information in this book are believed to be true and accurate at the date of
publication, neither the author nor the editors nor the publisher can accept any legal responsibility for
any errors or omissions that may be made. The publisher makes no warranty, express or implied, with
respect to the material contained herein.
Managing Director: Welmoed Spahr
Acquisitions Editor: Susan McDermott
Developmental Editor: Laura Berendson
Technical Reviewer: Dinesh Lokhande
Editorial Board: Steve Anglin, Pramila Balen, Laura Berendson, Aaron Black, Louise Corrigan,
Jonathan Gennick, Robert Hutchinson, Celestin Suresh John, Nikhil Karkal,
James Markham, Susan McDermott, Matthew Moodie, Natalie Pao, Gwenan Spearing
Coordinating Editor: Rita Fernando
Copy Editor: Michael G. Laraque
Compositor: SPi Global
Indexer: SPi Global
Cover Image: Selected by Freepik
Distributed to the book trade worldwide by Springer Science+Business Media New York,
۲۳۳ Spring Street, 6th Floor, New York, NY 10013. Phone 1-800-SPRINGER, fax (201) 348-4505, e-mail
email@example.com, or visit www.springer.com. Apress Media, LLC is a California LLC and
the sole member (owner) is Springer Science + Business Media Finance Inc (SSBM Finance Inc).
SSBM Finance Inc is a Delaware corporation.
For information on translations, please e-mail firstname.lastname@example.org, or visit www.apress.com.
Apress and friends of ED books may be purchased in bulk for academic, corporate, or promotional
use. eBook versions and licenses are also available for most titles. For more information, reference our
Special Bulk Sales–eBook Licensing web page at www.apress.com/bulk-sales.
Any source code or other supplementary materials referenced by the author in this text are available
to readers at www.apress.com. For detailed information about how to locate your book’s source code,
go to www.apress.com/source-code/.
Printed on acid-free pape
This chapter discusses the history of SQL on big data and why SQL is so essential for
commoditization and adoption of big data in the enterprise. The chapter discusses how
SQL on big data has evolved and where it stands today. It discusses why the current breed
of relational databases cannot live up to the requirements of volume, speed, variability,
and scalability of operations required for data integration and data analytics. As more
and more data is becoming available on big data platforms, business analysts, business
intelligence (BI) tools, and developers all must have access to it, and SQL on big data
provides the best way to solve the access problem. This chapter covers the following:
• Why SQL on big data?
• SQL on big data goals
• SQL on big data landscape—commercial and open source tools
• How to choose an SQL on big data
The world is generating humongous amount of data. Figure 1-1 shows the amount of
data being generated over the Internet every minute. This is just the tip of the iceberg. We
do not know how much more data is generated and traverses the Internet in the deep Web