Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

469 Episodes

Unlocking The Potential Of Streaming Data Applications Without The Operational Headache At Grainite - E368

Summary The promise of streaming data is that it allows you to react to new information as it happens, rather than introducing latency by batching records together. The peril is that building a robust and scalable streaming architecture is always more complicated and error-prone than you think it's going to be. After experiencing this unfortunate…

Summary The promise of streaming data is that it allows you to react to new information as it…

25 March 2023 | 01:13:34


Aligning Data Security With Business Productivity To Deploy Analytics Safely And At Speed - E367

Summary As with all aspects of technology, security is a critical element of data applications, and the different controls can be at cross purposes with productivity. In this episode Yoav Cohen from Satori shares his experiences as a practitioner in the space of data security and how to align with the needs of engineers and business users. He also…

Summary As with all aspects of technology, security is a critical element of data applications, and…

19 March 2023 | 00:51:38


Use Your Data Warehouse To Power Your Product Analytics With NetSpring - E366

Summary With the rise of the web and digital business came the need to understand how customers are interacting with the products and services that are being sold. Product analytics has grown into its own category and brought with it several services with generational differences in how they approach the problem. NetSpring is a warehouse-native…

Summary With the rise of the web and digital business came the need to understand how customers are…

10 March 2023 | 00:49:22


Exploring The Nuances Of Building An Intentional Data Culture - E365

Summary The ecosystem for data professionals has matured to the point that there are a large and growing number of distinct roles. With the scope and importance of data steadily increasing it is important for organizations to ensure that everyone is aligned and operating in a positive environment. To help facilitate the nascent conversation about…

Summary The ecosystem for data professionals has matured to the point that there are a large and…

06 March 2023 | 00:45:45


Building A Data Mesh Platform At PayPal - E364

Summary There has been a lot of discussion about the practical application of data mesh and how to implement it in an organization. Jean-Georges Perrin was tasked with designing a new data platform implementation at PayPal and wound up building a data mesh. In this episode he shares that journey and the combination of technical and organizational…

Summary There has been a lot of discussion about the practical application of data mesh and how to…

27 February 2023 | 00:46:54


The View Below The Waterline Of Apache Iceberg And How It Fits In Your Data Lakehouse - E363

Summary Cloud data warehouses have unlocked a massive amount of innovation and investment in data applications, but they are still inherently limiting. Because of their complete ownership of your data they constrain the possibilities of what data you can store and how it can be used. Projects like Apache Iceberg provide a viable alternative in the…

Summary Cloud data warehouses have unlocked a massive amount of innovation and investment in data…

19 February 2023 | 00:55:07


Let The Whole Team Participate In Data With The Quilt Versioned Data Hub - E362

Summary Data is a team sport, but it's often difficult for everyone on the team to participate. For a long time the mantra of data tools has been "by developers, for developers", which automatically excludes a large portion of the business members who play a crucial role in the success of any data project. Quilt Data was created as an answer to…

Summary Data is a team sport, but it's often difficult for everyone on the team to participate. For…

11 February 2023 | 00:52:02


Reflecting On The Past 6 Years Of Data Engineering - E361

Summary This podcast started almost exactly six years ago, and the technology landscape was much different than it is now. In that time there have been a number of generational shifts in how data engineering is done. In this episode I reflect on some of the major themes and take a brief look forward at some of the upcoming…

Summary This podcast started almost exactly six years ago, and the technology landscape was much…

06 February 2023 | 00:32:21


Let Your Business Intelligence Platform Build The Models Automatically With Omni Analytics - E360

Summary Business intelligence has gone through many generational shifts, but each generation has largely maintained the same workflow. Data analysts create reports that are used by the business to understand and direct the business, but the process is very labor and time intensive. The team at Omni have taken a new approach by automatically…

Summary Business intelligence has gone through many generational shifts, but each generation has…

30 January 2023 | 00:50:44


Safely Test Your Applications And Analytics With Production Quality Data Using Tonic AI - E359

Summary The most interesting and challenging bugs always happen in production, but recreating them is a constant challenge due to differences in the data that you are working with. Building your own scripts to replicate data from production is time consuming and error-prone. Tonic is a platform designed to solve the problem of having reliable,…

Summary The most interesting and challenging bugs always happen in production, but recreating them…

22 January 2023 | 00:45:40