Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

469 Episodes

Discover And De-Clutter Your Unstructured Data With Aparavi - E298

Summary

Unstructured data takes many forms in an organization. From a data engineering perspective that often means things like JSON files, audio or video recordings, images, etc. Another category of unstructured data that every business deals with is PDFs, Word documents, workstation backups, and…

Summary

Unstructured data takes many forms in an…

13 June 2022 | 00:49:12


Hire And Scale Your Data Team With Intention - E297

Summary

Building a well rounded and effective data team is an iterative process, and the first hire can set the stage for future success or failure. Trupti Natu has been the first data hire multiple times and gone through the process of building teams across the different stages of growth. In this…

Summary

Building a well rounded and effective…

13 June 2022 | 01:00:54


Simplify Data Security For Sensitive Information With The Skyflow Data Privacy Vault - E296

Summary

The best way to make sure that you don’t leak sensitive data is to never have it in the first place. The team at Skyflow decided that the second best way is to build a storage system dedicated to securely managing your sensitive information and making it easy to integrate with your…

Summary

The best way to make sure that you…

06 June 2022 | 00:54:05


Bringing The Modern Data Stack To Everyone With Y42 - E295

Summary

Cloud services have made highly scalable and performant data platforms economical and manageable for data teams. However, they are still challenging to work with and manage for anyone who isn’t in a technical role. Hung Dang understood the need to make data more accessible to the…

Summary

Cloud services have made highly scalable…

06 June 2022 | 00:59:02


Data Cloud Cost Optimization With Bluesky Data - E293

Summary

The latest generation of data warehouse platforms have brought unprecedented operational simplicity and effectively infinite scale. Along with those benefits, they have also introduced a new consumption model that can lead to incredibly expensive bills at the end of the month. In order to…

Summary

The latest generation of data warehouse…

30 May 2022 | 01:03:25


A Multipurpose Database For Transactions And Analytics To Simplify Your Data Architecture With Singlestore - E294

Summary

A large fraction of data engineering work involves moving data from one storage location to another in order to support different access and query patterns. Singlestore aims to cut down on the number of database engines that you need to run so that you can reduce the amount of copying that…

Summary

A large fraction of data engineering work…

30 May 2022 | 00:41:22


Unlocking The Value Of Data Across The Organization Through User Friendly Data Tools With Prophecy - E292

Summary

The interfaces and design cues that a tool offers can have a massive impact on who is able to use it and the tasks that they are able to perform. With an eye to making data workflows more accessible to everyone in an organization Raj Bains and his team at Prophecy designed a powerful and…

Summary

The interfaces and design cues that a…

23 May 2022 | 01:10:56


Cloud Native Data Orchestration For Machine Learning And Data Engineering With Flyte - E291

Summary

Machine learning has become a meaningful target for data applications, bringing with it an increase in the complexity of orchestrating the entire data flow. Flyte is a project that was started at Lyft to address their internal needs for machine learning and integrated closely with…

Summary

Machine learning has become a meaningful…

23 May 2022 | 01:07:08


Designing And Deploying IoT Analytics For Industrial Applications At Vopak - E289

Summary

Industrial applications are one of the primary adopters of Internet of Things (IoT) technologies, with business critical operations being informed by data collected across a fleet of sensors. Vopak is a business that manages storage and distribution of a variety of liquids that are critical…

Summary

Industrial applications are one of the…

16 May 2022 | 00:47:55


Insights And Advice On Building A Data Lake Platform From Someone Who Learned The Hard Way - E290

Summary

Designing a data platform is a complex and iterative undertaking which requires accounting for many conflicting needs. Designing a platform that relies on a data lake as its central architectural tenet adds additional layers of difficulty. Srivatsan Sridharan has had the opportunity to…

Summary

Designing a data platform is a complex…

16 May 2022 | 00:58:11