Table of Contents
Fundamentals of Data Engineering - Plan and Build Robust Data Systems by Joe Reis and Matt Housley
Return to Data Engineering Bibliography, Data Engineering, Data Engineering DevOps - Data Engineering and SRE - Data Engineering and CI/CD, Cloud Native Data Engineering - Data Engineering and Microservices - Serverless and Data Engineering, Data Engineering and Security - Data Engineering and DevSecOps, Functional Data Engineering, Data Engineering and Concurrency, Data Engineering and Databases, Data Engineering and Machine Learning, Data Engineering Bibliography, Data Engineering Courses, Data Engineering Glossary, Awesome Data Engineering, Data Engineering GitHub, Data Engineering Topics
Fair Use Source: B0B4VH4T37 (FunDtEng 2022)
Book Summary
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available through the framework of the data engineering lifecycle.
Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, storage, and governance that are critical in any data environment regardless of the underlying technology.
This book will help you
- Get a concise overview of the entire data engineering landscape
- Assess data engineering problems using an end-to-end framework of best practices
- Cut through marketing hype when choosing data technologies, architecture, and processes
- Use the data engineering lifecycle to design and build a robust architecture
- Incorporate data governance and security across the data engineering lifecycle
Who Should Read This Book
Our primary intended audience for this book consists of technical practitioners, mid- to senior-level software engineers, data scientists, or analysts interested in moving into data engineering; or data engineers working in the guts of specific technologies, but wanting to develop a more comprehensive perspective. Our secondary target audience consists of data stakeholders who work adjacent to technical practitioners—e.g., a data team lead with a technical background overseeing a team of data engineers, or a director of data warehousing wanting to migrate from on-premises technology to a cloud-based solution.
Ideally, you’re curious and want to learn—why else would you be reading this book? You stay current with data technologies and trends by reading books and articles on data warehousing/data lakes, batch and streaming systems, orchestration, modeling, management, analysis, developments in cloud technologies, etc. This book will help you weave what you’ve read into a complete picture of data engineering across technologies and paradigms.
About the Author
Joe Reis is a business-minded data nerd who's worked in the data industry for 20 years, with responsibilities ranging from statistical modeling, forecasting, machine learning, data engineering, data architecture, and almost everything else in between. Joe is the CEO and cofounder of Ternary Data, a data engineering and architecture consulting firm based in Salt Lake City, Utah. In addition, he volunteers with several technology groups and teaches at the University of Utah. In his spare time, Joe likes to rock climb, produce electronic music, and take his kids on crazy adventures.
Matt Housley is a data engineering consultant and cloud specialist. After some early programming experience with Logo, Basic, and 6502 assembly, he completed a PhD in mathematics at the University of Utah. Matt then began working in data science, eventually specializing in cloud-based data engineering. He cofounded Ternary Data with Joe Reis, where he leverages his teaching experience to train future data engineers and advise teams on robust data architecture. Matt and Joe also pontificate on all things data on The Monday Morning Data Chat.
Product Details
- Publication date: July 26, 2022
- Paperback: 446 pages
- Time to Complete: 13h 35m
Research More
Python Data Science
JVM Data Science - Java-Kotlin-Scala-Clojure - Android - Spring Boot
C# .NET Data Science
Data Science with JavaScript - React / TypeScript Angular
Data Science Tutorials
Data Science Support
Fair Use Sources
Data Engineering: Data Engineer, Best Book: Fundamentals of Data Engineering - Plan and Build Robust Data Systems, Data Engineering Bibliography, Manning Data Science Series, Awesome Data Engineering. (navbar_data_engineering - see also navbar_datascience, navbar_database)
Data Science: Fundamentals of Data Science, DataOps, Big Data, Data Science IDEs (Jupyter Notebook, JetBrains DataGrip, Google Colab, JetBrains DataSpell, SQL Server Management Studio, MySQL Workbench, Oracle SQL Developer, SQLiteStudio), Data Science Tools (SQL, Apache Arrow, Pandas, NumPy, Dask, Spark, Kafka); Data Science Programming Languages (Python Data Science, NumPy Data Science, R Data Science, Java Data Science, C++ Data Science, MATLAB Data Science, Scala Data Science, Julia Data Science, Excel Data Science (Excel is the most popular "programming language") - Google Sheets, SAS Data Science, C# Data Science, Golang Data Science, JavaScript Data Science, Kotlin Data Science, Ruby Data Science, Rust Data Science, Swift Data Science, TypeScript Data Science, Bash Data Science); Databases, Data, Augmentation, Analysis, Analytics, Archaeology, Cleansing, Collection, Compression, Corruption, Curation, Degradation, Editing (EmEditor), Data engineering, ETL/ ELT ( Extract- Transform- Load), Farming, Format management, Fusion, Integration, Integrity, Lake, Library, Loss, Management, Migration, Mining, Pre-processing, Preservation, Protection (privacy), Recovery, Reduction, Retention, Quality, Science, Scraping, Scrubbing, Security, Stewardship, Storage, Validation, Warehouse, Wrangling/munging. ML-DL - MLOps. Data science history, Data Science Bibliography, Manning Data Science Series, Data science Glossary, Data science topics, Data science courses, Data science libraries, Data science frameworks, Data science GitHub, Data Science Awesome list. (navbar_datascience - see also navbar_python, navbar_numpy, navbar_data_engineering and navbar_database)
© 1994 - 2024 Cloud Monk Losang Jinpa or Fair Use. Disclaimers
SYI LU SENG E MU CHYWE YE. NAN. WEI LA YE. WEI LA YE. SA WA HE.