Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

454 Episodes

Speeding Up The Time To Insight For Supply Chains And Logistics With The Pathway Database That Thinks - E334

Summary

Logistics and supply chains are under increased stress and scrutiny in recent years. In order to stay ahead of customer demands, businesses need to be able to react quickly and intelligently to changes, which requires fast and accurate insights into their operations. Pathway is a streaming…

Summary

Logistics and supply chains are under…

16 October 2022 | 01:02:36


Making The Open Data Lakehouse Affordable Without The Overhead At Iomete - E332

Summary

The core of any data platform is the centralized storage and processing layer. For many that is a data warehouse, but in order to support a diverse and constantly changing set of uses and technologies the data lakehouse is a paradigm that offers a useful balance of scale and cost, with…

Summary

The core of any data platform is the…

10 October 2022 | 00:55:24


Investing In Understanding The Customer Journey At American Express - E331

Summary

For any business that wants to stay in operation, the most important thing they can do is understand their customers. American Express has invested substantial time and effort in their Customer 360 product to achieve that understanding. In this episode Purvi Shah, the VP of Enterprise Big…

Summary

For any business that wants to stay in…

10 October 2022 | 00:40:43


Gain Visibility And Insight Into Your Supply Chains Through Operational Analytics Powered By Roambee - E330

Summary

The global economy is dependent on complex and dynamic networks of supply chains powered by sophisticated logistics. This requires a significant amount of data to track shipments and operational characteristics of materials and goods. Roambee is a platform that collects, integrates, and…

Summary

The global economy is dependent on…

03 October 2022 | 01:00:04


Make Data Lineage A Ubiquitous Part Of Your Work By Simplifying Its Implementation With Alvin - E329

Summary

Data lineage is something that has grown from a convenient feature to a critical need as data systems have grown in scale, complexity, and centrality to business. Alvin is a platform that aims to provide a low effort solution for data lineage capabilities focused on simplifying the work of…

Summary

Data lineage is something that has grown…

03 October 2022 | 00:56:16


Power Your Real-Time Analytics Without The Headache Using Fivetran's Change Data Capture Integrations - E328

Summary

Data integration from source systems to their downstream destinations is the foundational step for any data product. With the increasing expecation for information to be instantly accessible, it drives the need for reliable change data capture. The team at Fivetran have recently introduced…

Summary

Data integration from source systems to…

26 September 2022 | 00:49:37


Build A Common Understanding Of Your Data Reliability Rules With Soda Core and Soda Checks Language - E327

Summary

Regardless of how data is being used, it is critical that the information is trusted. The practice of data reliability engineering has gained momentum recently to address that question. To help support the efforts of data teams the folks at Soda Data created the Soda Checks Language and the…

Summary

Regardless of how data is being used, it…

26 September 2022 | 00:41:02


Operational Analytics To Increase Efficiency For Multi-Location Businesses With OpsAnalitica - E325

Summary

In order to improve efficiency in any business you must first know what is contributing to wasted effort or missed opportunities. When your business operates across multiple locations it becomes even more challenging and important to gain insights into how work is being done. In this…

Summary

In order to improve efficiency in any…

19 September 2022 | 01:32:03


Building A Shared Understanding Of Data Assets In A Business Through A Single Pane Of Glass With Workstream - E326

Summary

There is a constant tension in business data between growing siloes, and breaking them down. Even when a tool is designed to integrate information as a guard against data isolation, it can easily become a silo of its own, where you have to make a point of using it to seek out information.…

Summary

There is a constant tension in business…

19 September 2022 | 00:54:52


Build Confidence In Your Data Platform With Schema Compatibility Reports That Span Systems And Domains Using Schemata - E324

Summary

Data engineering systems are complex and interconnected with myriad and often opaque chains of dependencies. As they scale, the problems of visibility and dependency management can increase at an exponential rate. In order to turn this into a tractable problem one approach is to define and…

Summary

Data engineering systems are complex and…

12 September 2022 | 00:59:40