Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

469 Episodes

Harnessing Generative AI For Creating Educational Content With Illumidesk - E388

Summary Generative AI has unlocked a massive opportunity for content creation. There is also an unfulfilled need for experts to be able to share their knowledge and build communities. Illumidesk was built to take advantage of this intersection. In this episode Greg Werner explains how they are using generative AI as an assistive tool for creating…

Summary Generative AI has unlocked a massive opportunity for content creation. There is also an…

20 August 2023 | 00:54:52


Unpacking The Seven Principles Of Modern Data Pipelines - E387

Summary Data pipelines are the core of every data product, ML model, and business intelligence dashboard. If you're not careful you will end up spending all of your time on maintenance and fire-fighting. The folks at Rivery distilled the seven principles of modern data pipelines that will help you stay out of trouble and be productive with your…

Summary Data pipelines are the core of every data product, ML model, and business intelligence…

14 August 2023 | 00:47:03


Quantifying The Return On Investment For Your Data Team - E386

Summary As businesses increasingly invest in technology and talent focused on data engineering and analytics, they want to know whether they are benefiting. So how do you calculate the return on investment for data? In this episode Barr Moses and Anna Filippova explore that question and provide useful exercises to start answering that in your…

Summary As businesses increasingly invest in technology and talent focused on data engineering and…

06 August 2023 | 01:01:53


Strategies For A Successful Data Platform Migration - E385

Summary All software systems are in a constant state of evolution. This makes it impossible to select a truly future-proof technology stack for your data platform, making an eventual migration inevitable. In this episode Gleb Mezhanskiy and Rob Goretsky share their experiences leading various data platform migrations, and the hard-won lessons that…

Summary All software systems are in a constant state of evolution. This makes it impossible to…

31 July 2023 | 01:09:53


Build Real Time Applications With Operational Simplicity Using Dozer - E384

Summary Real-time data processing has steadily been gaining adoption due to advances in the accessibility of the technologies involved. Despite that, it is still a complex set of capabilities. To bring streaming data in reach of application engineers Matteo Pelati helped to create Dozer. In this episode he explains how investing in high performance…

Summary Real-time data processing has steadily been gaining adoption due to advances in the…

24 July 2023 | 00:40:43


Datapreneurs - How Todays Business Leaders Are Using Data To Define The Future - E383

Summary Data has been one of the most substantial drivers of business and economic value for the past few decades. Bob Muglia has had a front-row seat to many of the major shifts driven by technology over his career. In his recent book "Datapreneurs" he reflects on the people and businesses that he has known and worked with and how they relied on…

Summary Data has been one of the most substantial drivers of business and economic value for the…

17 July 2023 | 00:54:45


Reduce Friction In Your Business Analytics Through Entity Centric Data Modeling - E382

Summary For business analytics the way that you model the data in your warehouse has a lasting impact on what types of questions can be answered quickly and easily. The major strategies in use today were created decades ago when the software and hardware for warehouse databases were far more constrained. In this episode Maxime Beauchemin of Airflow…

Summary For business analytics the way that you model the data in your warehouse has a lasting…

09 July 2023 | 01:12:55


How Data Engineering Teams Power Machine Learning With Feature Platforms - E381

Summary Feature engineering is a crucial aspect of the machine learning workflow. To make that possible, there are a number of technical and procedural capabilities that must be in place first. In this episode Razi Raziuddin shares how data engineering teams can support the machine learning workflow through the development and support of systems…

Summary Feature engineering is a crucial aspect of the machine learning workflow. To make that…

03 July 2023 | 01:03:30


Seamless SQL And Python Transformations For Data Engineers And Analysts With SQLMesh - E380

Summary Data transformation is a key activity for all of the organizational roles that interact with data. Because of its importance and outsized impact on what is possible for downstream data consumers it is critical that everyone is able to collaborate seamlessly. SQLMesh was designed as a unifying tool that is simple to work with but powerful…

Summary Data transformation is a key activity for all of the organizational roles that interact with…

25 June 2023 | 00:50:19


How Column-Aware Development Tooling Yields Better Data Models - E379

Summary Architectural decisions are all based on certain constraints and a desire to optimize for different outcomes. In data systems one of the core architectural exercises is data modeling, which can have significant impacts on what is and is not possible for downstream use cases. By incorporating column-level lineage in the data modeling process…

Summary Architectural decisions are all based on certain constraints and a desire to optimize for…

18 June 2023 | 00:46:20