Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

459 Episodes

Practical First Steps In Data Governance For Long Term Success - E428

Summary Modern businesses aspire to be data driven, and technologists enjoy working through the challenge of building data systems to support that goal. Data governance is the binding force between these two parts of the organization. Nicola Askham found her way into data governance by accident, and stayed because of the benefit that she was able…

Summary Modern businesses aspire to be data driven, and technologists enjoy working through the…

02 June 2024 | 01:00:41


Data Migration Strategies For Large Scale Systems - E427

Summary Any software system that survives long enough will require some form of migration or evolution. When that system is responsible for the data layer the process becomes more challenging. Sriram Panyam has been involved in several projects that required migration of large volumes of data in high traffic environments. In this episode he shares…

Summary Any software system that survives long enough will require some form of migration or…

27 May 2024 | 01:00:00


Zenlytic Is Building You A Better Coworker With AI Agents - E426

Summary The purpose of business intelligence systems is to allow anyone in the business to access and decode data to help them make informed decisions. Unfortunately this often turns into an exercise in frustration for everyone involved due to complex workflows and hard-to-understand dashboards. The team at Zenlytic have leaned on the promise of…

Summary The purpose of business intelligence systems is to allow anyone in the business to access…

19 May 2024 | 00:54:19


Release Management For Data Platform Services And Logic - E425

Summary Building a data platform is a substrantial engineering endeavor. Once it is running, the next challenge is figuring out how to address release management for all of the different component parts. The services and systems need to be kept up to date, but so does the code that controls their behavior. In this episode your host Tobias Macey…

Summary Building a data platform is a substrantial engineering endeavor. Once it is running, the…

12 May 2024 | 00:20:09


Barking Up The Wrong GPTree: Building Better AI With A Cognitive Approach - E424

Summary Artificial intelligence has dominated the headlines for several months due to the successes of large language models. This has prompted numerous debates about the possibility of, and timeline for, artificial general intelligence (AGI). Peter Voss has dedicated decades of his life to the pursuit of truly intelligent software through the…

Summary Artificial intelligence has dominated the headlines for several months due to the successes…

05 May 2024 | 00:54:17


Build Your Second Brain One Piece At A Time - E423

Summary Generative AI promises to accelerate the productivity of human collaborators. Currently the primary way of working with these tools is through a conversational prompt, which is often cumbersome and unwieldy. In order to simplify the integration of AI capabilities into developer workflows Tsavo Knott helped create Pieces, a powerful…

Summary Generative AI promises to accelerate the productivity of human collaborators. Currently the…

28 April 2024 | 00:50:10


Making Email Better With AI At Shortwave - E422

Summary Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee started work on Shortwave he was focused on making email more productive. When AI started gaining adoption he realized that he had even more potential for a transformative experience. In this episode he shares the technical challenges that he and his…

Summary Generative AI has rapidly transformed everything in the technology sector. When Andrew Lee…

21 April 2024 | 00:53:43


Designing A Non-Relational Database Engine - E421

Summary Databases come in a variety of formats for different use cases. The default association with the term "database" is relational engines, but non-relational engines are also used quite widely. In this episode Oren Eini, CEO and creator of RavenDB, explores the nuances of relational vs. non-relational engines, and the strategies for designing…

Summary Databases come in a variety of formats for different use cases. The default association with…

14 April 2024 | 01:16:02


Establish A Single Source Of Truth For Your Data Consumers With A Semantic Layer - E420

Summary Maintaining a single source of truth for your data is the biggest challenge in data engineering. Different roles and tasks in the business need their own ways to access and analyze the data in the organization. In order to enable this use case, while maintaining a single point of access, the semantic layer has evolved as a technological…

Summary Maintaining a single source of truth for your data is the biggest challenge in data…

07 April 2024 | 00:56:23


Adding Anomaly Detection And Observability To Your dbt Projects Is Elementary - E419

Summary Working with data is a complicated process, with numerous chances for something to go wrong. Identifying and accounting for those errors is a critical piece of building trust in the organization that your data is accurate and up to date. While there are numerous products available to provide that visibility, they all have different…

Summary Working with data is a complicated process, with numerous chances for something to go wrong.…

31 March 2024 | 00:50:44