HomeBooks, Movies and MusicEducation & LearningReferenceDATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp
product_image_name-Books-DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp-1

Share this product

Books DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp

KSh 2,199

In stock

+ shipping from KSh 90 to CBD - UON/Globe/Koja/River Road
0 out of 5
(No ratings available)

Promotions

Delivery & Returns

Choose your location

Pickup Station

Delivery Fees KSh 90
Ready for pickup between 30 December and 02 January if you place your order within the next 16hrs 25mins

Door Delivery

Delivery Fees KSh 200
Ready for delivery between 30 December and 02 January if you place your order within the next 16hrs 25mins

Return Policy

Easy Return, Quick Refund.Details

Seller Information

QABETE ENTERPRISES

62%Seller Score

64 Followers

Follow

Seller Performance

Shipping speed: Excellent

Quality Score: Very Poor

Customer Rating: Good

Product details

"Databricks and Apache Spark in Action: A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows" by Jeffrey Tromp serves as a hands-on manual for leveraging Databricks' unified platform atop Apache Spark to engineer robust data systems. It demystifies Spark's distributed computing for ETL pipelines, real-time analytics, and ML workflows, using Python, Scala, and SQL examples tailored to cloud environments.​

Core Coverage

The book progresses from Spark fundamentals—RDDs, DataFrames, lazy evaluation—to Databricks-specific tools like Delta Lake for ACID transactions, Unity Catalog for governance, and workflows for orchestration. Practical labs cover ingestion from diverse sources, transformations via Spark SQL/MLlib, and optimization techniques like caching, partitioning, and AQE for production-scale performance.​

Advanced Workflows

Sections detail scalable pipelines with Structured Streaming, feature stores, and end-to-end ML ops, including model training on massive datasets and deployment via MLflow. It addresses common pitfalls like shuffle spills and executor tuning, empowering data engineers to build resilient systems.
 

Specifications

Key Features

Aligns with your Python expertise and digital marketing analytics needs—apply for SEO data lakes or campaign optimization in Nairobi's tech scene, enhancing business strategy via big data insights.

What’s in the box

1 BOOK

Specifications

  • SKU: BO086BM504EMMNAFAMZ
  • GTIN Barcode: 09781974936618
  • Weight (kg): 0.1

Customer Feedback

This product has no ratings yet.

Books DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp

Books DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp

KSh 2,199
Questions about this product?

Recently Viewed

See All