Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

454 Episodes

Tame The Entropy In Your Data Stack And Prevent Failures With Sifflet - E343

Summary

The problems that are easiest to fix are the ones that you prevent from happening in the first place. Sifflet is a platform that brings your entire data stack into focus to improve the reliability of your data assets and empower collaboration across your teams. In this episode CEO and…

Summary

The problems that are easiest to fix are…

21 November 2022 | 00:46:47


Taking A Look Under The Hood At CreditKarma's Data Platform - E341

Summary

CreditKarma builds data products that help consumers take advantage of their credit and financial capabilities. To make that possible they need a reliable data platform that empowers all of the organization’s stakeholders. In this episode Vishnu Venkataraman shares the journey that he…

Summary

CreditKarma builds data products that…

14 November 2022 | 00:52:03


Build Data Products Without A Data Team Using AgileData - E342

Summary

Building data products is an undertaking that has historically required substantial investments of time and talent. With the rise in cloud platforms and self-serve data technologies the barrier of entry is dropping. Shane Gibson co-founded AgileData to make analytics accessible to companies…

Summary

Building data products is an undertaking…

14 November 2022 | 01:12:30


Build Better Data Products By Creating Data, Not Consuming It - E339

Summary

A lot of the work that goes into data engineering is trying to make sense of the "data exhaust" from other applications and services. There is an undeniable amount of value and utility in that information, but it also introduces significant cost and time requirements. In this…

Summary

A lot of the work that goes into data…

07 November 2022 | 01:05:20


Clean Up Your Data Using Scalable Entity Resolution And Data Mastering With Zingg - E340

Summary

Despite the best efforts of data engineers, data is as messy as the real world. Entity resolution and fuzzy matching are powerful utilities for cleaning up data from disconnected sources, but it has typically required custom development and training machine learning models. Sonal Goyal…

Summary

Despite the best efforts of data…

07 November 2022 | 00:46:47


Expanding The Reach of Business Intelligence Through Ubiquitous Embedded Analytics With Sisense - E338

Summary

Business intelligence has grown beyond its initial manifestation as dashboards and reports. In its current incarnation it has become a ubiquitous need for analytics and opportunities to answer questions with data. In this episode Amir Orad discusses the Sisense platform and how it…

Summary

Business intelligence has grown beyond…

31 October 2022 | 00:54:00


Analytics Engineering Without The Friction Of Complex Pipeline Development With Optimus and dbt - E337

Summary

One of the most impactful technologies for data analytics in recent years has been dbt. It’s hard to have a conversation about data engineering or analysis without mentioning it. Despite its widespread adoption there are still rough edges in its workflow that cause friction for data…

Summary

One of the most impactful technologies…

30 October 2022 | 00:40:10


How To Bring Agile Practices To Your Data Projects - E336

Summary

Agile methodologies have been adopted by a majority of teams for building software applications. Applying those same practices to data can prove challenging due to the number of systems that need to be included to implement a complete feature. In this episode Shane Gibson shares practical…

Summary

Agile methodologies have been adopted by…

23 October 2022 | 01:12:18


Going From Transactional To Analytical And Self-managed To Cloud On One Database With MariaDB - E335

Summary

The database market has seen unprecedented activity in recent years, with new options addressing a variety of needs being introduced on a nearly constant basis. Despite that, there are a handful of databases that continue to be adopted due to their proven reliability and robust features.…

Summary

The database market has seen…

23 October 2022 | 00:52:04


An Exploration Of The Open Data Lakehouse And Dremio's Contribution To The Ecosystem - E333

Summary

The "data lakehouse" architecture balances the scalability and flexibility of data lakes with the ease of use and transaction support of data warehouses. Dremio is one of the companies leading the development of products and services that support the open lakehouse. In this…

Summary

The "data lakehouse"…

16 October 2022 | 00:50:44