|
|
|||
|
||||
OverviewIn the current digital era, data integration is the backbone of actionable insights and informed decision-making. As organizations grow across cloud and hybrid ecosystems, the ability to design secure, robust, and scalable pipelines has become essential for driving innovation, compliance, and operational efficiency. This book is a practical guide designed to help you bridge the gap between fragmented data sources and a unified, scalable ecosystem. This guide covers the entire data integration lifecycle, beginning with core ETL/ELT concepts and data mesh architectures. You will progress from data profiling and modeling to building production-ready pipelines using AWS Glue, Azure Data Factory, and Apache NiFi. The book explores real-time streaming with Kafka and Kinesis, workflow orchestration via Airflow, and real-world applications in sectors like banking and healthcare. Finally, you will master DataOps essentials, including CI/CD with Terraform, IAM security, PII masking, and Prometheus monitoring, to ensure robust data governance and reliability. By the end of this book, readers will be equipped to design and manage production-grade data pipelines that are resilient, auditable, and future-ready. They will gain practical skills in data pipeline orchestration, solution architecture, DevOps, governance, security, and observability, empowering them to deliver trusted end-to-end data solutions and lead enterprise transformations in the age of data-driven innovation. What you will learn ● Architect data integration solutions for batch and real-time systems. ● Design data pipelines across cloud and hybrid environments. ● Implement CI/CD workflows for automated data pipeline delivery. ● Secure pipelines with IAM, encryption, and secrets management. ● Monitor, log, and handle errors for operational reliability. ● Apply governance frameworks, ensuring compliance and data quality. ● Explore emerging trends like data mesh and AI-driven integration. Who this book is for This book targets data engineers, architects, analysts, and students transitioning to cloud-native practices. Prior experience with basic databases is helpful as you master secure, enterprise-scale pipelines. It also serves researchers and managers needing practical, real-time industry use cases. Table of Contents 1. Introduction to Data Integration 2. Core Concepts, Patterns, and Architectures 3. Planning Integration Projects and Designing Workflows 4. ETL and ELT Development 5. Real-time and Streaming Data Integrations 6. Orchestrating Pipelines 7. DevOps for Data Pipelines 8. Securing Data Pipelines 9. Monitoring, Logging, and Error Handling 10. Data Governance, Quality, and Metadata Management 11. Emerging New Trends in Data Integration Full Product DetailsAuthor: Sayan GuhaPublisher: Bpb Publications Imprint: Bpb Publications Dimensions: Width: 19.10cm , Height: 2.20cm , Length: 23.50cm Weight: 0.730kg ISBN: 9789365897456ISBN 10: 9365897459 Pages: 428 Publication Date: 13 February 2026 Audience: General/trade , General Format: Paperback Publisher's Status: Active Availability: Available To Order We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |
||||