Home Books, Movies and Music Education & Learning ReferenceDATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp

product_image_name-Books-DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp-1

Share this product

Books DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp

Name: DATABRICKS AND APACHE SPARK IN ACTION A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows By Jeffrey Tromp
Brand: Books
Price: 2199.00 KES
Availability: InStock

Brand: Books | Similar products from Books

KSh 2,199

In stock

+ shipping from KSh 90 to CBD - UON/Globe/Koja/River Road

0 out of 5

(No ratings available)

Promotions

Easy and safer payments via the JumiaPay App.

Report incorrect product information

Delivery & Returns

Choose your location

Pickup Station

Delivery Fees KSh 90

Ready for pickup between 14 April and 16 April if you place your order within the next 17hrs 21mins

Door Delivery

Delivery Fees KSh 200

Ready for delivery between 14 April and 16 April if you place your order within the next 17hrs 21mins

Return Policy

Easy Return, Quick Refund.Details

Seller Information

QABETE ENTERPRISES

94%Seller Score

76 Followers

Seller Performance

Shipping speed: Excellent

Quality Score: Excellent

Customer Rating: Good

Cancellation Rate: Excellent

Product details

"Databricks and Apache Spark in Action: A Practical Guide to Building Scalable Data Pipelines and Advanced Analytics Workflows" by Jeffrey Tromp serves as a hands-on manual for leveraging Databricks' unified platform atop Apache Spark to engineer robust data systems. It demystifies Spark's distributed computing for ETL pipelines, real-time analytics, and ML workflows, using Python, Scala, and SQL examples tailored to cloud environments.

Core Coverage

The book progresses from Spark fundamentals—RDDs, DataFrames, lazy evaluation—to Databricks-specific tools like Delta Lake for ACID transactions, Unity Catalog for governance, and workflows for orchestration. Practical labs cover ingestion from diverse sources, transformations via Spark SQL/MLlib, and optimization techniques like caching, partitioning, and AQE for production-scale performance.

Advanced Workflows

Sections detail scalable pipelines with Structured Streaming, feature stores, and end-to-end ML ops, including model training on massive datasets and deployment via MLflow. It addresses common pitfalls like shuffle spills and executor tuning, empowering data engineers to build resilient systems.

Specifications

Key Features

Aligns with your Python expertise and digital marketing analytics needs—apply for SEO data lakes or campaign optimization in Nairobi's tech scene, enhancing business strategy via big data insights.