[FOX-Ebook]Data Engineering with Apache Spark, Delta Lake, and Lakehouse

¥34.99 市场价 ¥899.99
库存
9999
数量
-
+
联系卖家   QQ:316821785   微信:zbook8_com  电话:13111111111   
商品特色:担保交易手动发货商品,工作人员手动发货。

自动发货宝贝:购买后直接到我买到的商品-订单详情-收货信息获取下载链接。
手动发货宝贝:购买后请留言邮箱或联系方式,0-4小时内由工作人员发到您邮箱。
购买后任何问题请联系商家或直接联系本站站务微信或者QQ。
书籍格式:
isbn:
排版:
新旧程度:

-------如果这里没有任何信息,不是真没有,是我们懒!请复制书名上amazon搜索书籍信息。-------

Understand the complexities of modern-day data engineering platforms and explore strategies to deal with them with the help of use case scenarios led by an industry expert in big data

Key Features

  • Become well-versed with the core concepts of Apache Spark and Delta Lake for building data platforms
  • Learn how to ingest, process, and analyze data that can be later used for training machine learning models
  • Understand how to operationalize data models in production using curated data

Book Description

In the world of ever-changing data and schemas, it is important to build data pipelines that can auto-adjust to changes. This book will help you build scalable data platforms that managers, data scientists, and data analysts can rely on.

Starting with an introduction to data engineering, along with its key concepts and architectures, this book will show you how to use Microsoft Azure Cloud services effectively for data engineering. You’ll cover data lake design patterns and the different stages through which the data needs to flow in a typical data lake. Once you’ve explored the main features of Delta Lake to build data lakes with fast performance and governance in mind, you’ll advance to implementing the lambda architecture using Delta Lake. Packed with practical examples and code snippets, this book takes you through real-world examples based on production scenarios faced by the author in his 10 years of experience working with big data. Finally, you’ll cover data lake deployment strategies that play an important role in provisioning the cloud resources and deploying the data pipelines in a repeatable and continuous way.

By the end of this data engineering book, you’ll know how to effectively deal with ever-changing data and create scalable data pipelines to streamline data science, ML, and artificial intelligence (AI) tasks.

What you will learn

  • Discover the challenges you may face in the data engineering world
  • Add ACID transactions to Apache Spark using Delta Lake
  • Understand effective design strategies to build enterprise-grade data lakes
  • Explore architectural and design patterns for building efficient data ingestion pipelines
  • Orchestrate a data pipeline for preprocessing data using Apache Spark and Delta Lake APIs
  • Automate deployment and monitoring of data pipelines in production
  • Get to grips with securing, monitoring, and managing data pipelines models efficiently

Who this book is for

This book is for aspiring data engineers and data analysts who are new to the world of data engineering and are looking for a practical guide to building scalable data platforms. If you already work with PySpark and want to use Delta Lake for data engineering, you’ll find this book useful. Basic knowledge of Python, Spark, and SQL is expected.

Table of Contents

  1. The Story of Data Engineering and Analytics
  2. Discovering Storage and Compute Data Lake Architectures
  3. Data Engineering on Microsoft Azure
  4. Understanding Data Pipelines
  5. Data Collection Stage – The Bronze Layer
  6. Understanding Delta Lake
  7. Data Curation Stage – The Silver Layer
  8. Data Aggregation Stage – The Gold Layer
  9. Deploying and Monitoring Pipelines in Production
  10. Solving Data Engineering Challenges
  11. Infrastructure Provisioning
  12. Continuous Integration and Deployment (CI/CD) of Data Pipelines

暂无评价
暂时没有数据

交易规则

免责声明


1、本站所有分享材料(数据、资料)均为网友上传,如有侵犯您的任何权利,请您第一时间通过微信(zbook8_com) 、QQ(316821785)、 电话(13111111111)联系本站,本站将在24小时内回复您的诉求!谢谢!
2、本站所有商品,除特殊说明外,均为(电子版)Ebook,请购买分享内容前请务必注意。特殊商品有说明实物的,按照说明为准。

发货方式


1、自动:在上方保障服务中标有自动发货的宝贝,拍下后,将会自动收到来自卖家的宝贝获取(下载)链接   [个人中心->我的订单->点击订单 查看详情];
2、手动:未标有自动发货的的宝贝,拍下后,通过QQ或订单中的电话联系对方。

退款说明


1、描述:书籍描述(含标题)与实际不一致的(例:描述PDF,实际为epub、缺页少页、版本不符等);
2、链接:部分图书会给出链接,直接链接到官网或者其他站点,以便于提示,如与给出不符等;
3、发货:手动发货书籍,在卖家未发货前,已申请退款的;
4、其他:如质量方面的硬性常规问题等。
注:经核实符合上述任一,均支持退款,但卖家予以积极解决问题则除外。交易中的商品,卖家无法对描述进行修改!

注意事项


1、在未购买下前,双方在QQ上所商定的内容,亦可成为纠纷评判依据(商定与描述冲突时,商定为准);
2、在宝贝同时有网站演示与图片演示,且站演与图演不一致时,默认按图演作为纠纷评判依据(特别声明或有商定除外);
3、在没有"无任何正当退款依据"的前提下,写有"一旦售出,概不支持退款"等类似的声明,视为无效声明;
4、虽然交易产生纠纷的几率很小,但请尽量保留如聊天记录这样的重要信息,以防产生纠纷时便于网站工作人员介入快速处理。