Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

444 Episodes

The View From The Lakehouse Of Architectural Patterns For Your Data Platform - E303

Summary

The ecosystem for data tools has been going through rapid and constant evolution over the past several years. These technological shifts have brought about corresponding changes in data and platform architectures for managing data and analytical workflows. In this episode Colleen Tartow…

Summary

The ecosystem for data tools has been…

03 July 2022 | 00:58:44


Bring Geospatial Analytics Across Disparate Datasets Into Your Toolkit With The Unfolded Platform - E301

Summary

The proliferation of sensors and GPS devices has dramatically increased the number of applications for spatial data, and the need for scalable geospatial analytics. In order to reduce the friction involved in aggregating disparate data sets that share geographic similarities the Unfolded…

Summary

The proliferation of sensors and GPS…

27 June 2022 | 01:07:01


Strategies And Tactics For A Successful Master Data Management Implementation - E302

Summary

The most complicated part of data engineering is the effort involved in making the raw data fit into the narrative of the business. Master Data Management (MDM) is the process of building consensus around what the information actually means in the context of the business and then shaping…

Summary

The most complicated part of data…

27 June 2022 | 01:09:08


Combining The Simplicity Of Spreadsheets With The Power Of Modern Data Infrastructure At Canvas - E300

Summary

Data analysis is a valuable exercise that is often out of reach of non-technical users as a result of the complexity of data systems. In order to lower the barrier to entry Ryan Buick created the Canvas application with a spreadsheet oriented workflow that is understandable to a wide…

Summary

Data analysis is a valuable exercise that…

19 June 2022 | 00:42:58


Level Up Your Data Platform With Active Metadata - E299

Summary

Metadata is the lifeblood of your data platform, providing information about what is happening in your systems. A variety of platforms have been developed to capture and analyze that information to great effect, but they are inherently limited in their utility due to their nature as storage…

Summary

Metadata is the lifeblood of your data…

19 June 2022 | 00:52:36


Discover And De-Clutter Your Unstructured Data With Aparavi - E298

Summary

Unstructured data takes many forms in an organization. From a data engineering perspective that often means things like JSON files, audio or video recordings, images, etc. Another category of unstructured data that every business deals with is PDFs, Word documents, workstation backups, and…

Summary

Unstructured data takes many forms in an…

13 June 2022 | 00:49:12


Hire And Scale Your Data Team With Intention - E297

Summary

Building a well rounded and effective data team is an iterative process, and the first hire can set the stage for future success or failure. Trupti Natu has been the first data hire multiple times and gone through the process of building teams across the different stages of growth. In this…

Summary

Building a well rounded and effective…

13 June 2022 | 01:00:54


Simplify Data Security For Sensitive Information With The Skyflow Data Privacy Vault - E296

Summary

The best way to make sure that you don’t leak sensitive data is to never have it in the first place. The team at Skyflow decided that the second best way is to build a storage system dedicated to securely managing your sensitive information and making it easy to integrate with your…

Summary

The best way to make sure that you…

06 June 2022 | 00:54:05


Bringing The Modern Data Stack To Everyone With Y42 - E295

Summary

Cloud services have made highly scalable and performant data platforms economical and manageable for data teams. However, they are still challenging to work with and manage for anyone who isn’t in a technical role. Hung Dang understood the need to make data more accessible to the…

Summary

Cloud services have made highly scalable…

06 June 2022 | 00:59:02


Data Cloud Cost Optimization With Bluesky Data - E293

Summary

The latest generation of data warehouse platforms have brought unprecedented operational simplicity and effectively infinite scale. Along with those benefits, they have also introduced a new consumption model that can lead to incredibly expensive bills at the end of the month. In order to…

Summary

The latest generation of data warehouse…

30 May 2022 | 01:03:25