Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

459 Episodes

Scaling Analysis of Connected Data And Modeling Complex Relationships With The TigerGraph Graph Database - E288

Summary

Many of the events, ideas, and objects that we try to represent through data have a high degree of connectivity in the real world. These connections are best represented and analyzed as graphs to provide efficient and accurate analysis of their relationships. TigerGraph is a leading…

Summary

Many of the events, ideas, and objects…

09 May 2022 | 00:39:56


Exploring The Insights And Impact Of Dan Delorey's Distinguished Career In Data - E287

Summary

Dan Delorey helped to build the core technologies of Google’s cloud data services for many years before embarking on his latest adventure as the VP of Data at SoFi. From being an early engineer on the Dremel project, to helping launch and manage BigQuery, on to helping enterprises…

Summary

Dan Delorey helped to build the core…

09 May 2022 | 01:00:51


Leading The Charge For The ELT Data Integration Pattern For Cloud Data Warehouses At Matillion - E286

Summary

The predominant pattern for data integration in the cloud has become extract, load, and then transform or ELT. Matillion was an early innovator of that approach and in this episode CTO Ed Thompson explains how they have evolved the platform to keep pace with the rapidly changing ecosystem.…

Summary

The predominant pattern for data…

02 May 2022 | 00:53:20


Evolving And Scaling The Data Platform at Yotpo - E285

Summary

Building a data platform is an iterative and evolutionary process that requires collaboration with internal stakeholders to ensure that their needs are being met. Yotpo has been on a journey to evolve and scale their data platform to continue serving the needs of their organization as it…

Summary

Building a data platform is an iterative…

02 May 2022 | 01:04:11


Operational Analytics At Speed With Minimal Busy Work Using Incorta - E284

Summary

A huge amount of effort goes into modeling and shaping data to make it available for analytical purposes. This is often due to the need to simplify the final queries so that they are performant for visualization or limited exploration. In order to cut down the level of effort involved in…

Summary

A huge amount of effort goes into…

24 April 2022 | 01:11:16


Gain Visibility Into Your Entire Machine Learning System Using Data Logging With WhyLogs - E283

Summary

There are very few tools which are equally useful for data engineers, data scientists, and machine learning engineers. WhyLogs is a powerful library for flexibly instrumenting all of your data systems to understand the entire lifecycle of your data from source to productionized model. In…

Summary

There are very few tools which are…

24 April 2022 | 00:59:04


Connecting To The Next Frontier Of Computing With Quantum Networks - E282

Summary

The next paradigm shift in computing is coming in the form of quantum technologies. Quantum procesors have gained significant attention for their speed and computational power. The next frontier is in quantum networking for highly secure communications and the ability to distribute across…

Summary

The next paradigm shift in computing is…

18 April 2022 | 00:40:23


What Does It Really Mean To Do MLOps And What Is The Data Engineer's Role? - E281

Summary

Putting machine learning models into production and keeping them there requires investing in well-managed systems to manage the full lifecycle of data cleaning, training, deployment and monitoring. This requires a repeatable and evolvable set of processes to keep it functional. The term…

Summary

Putting machine learning models into…

16 April 2022 | 01:15:53


DataOps As A Service For Your Data Integration Workflows With Rivery - E280

Summary

Data engineering is a practice that is multi-faceted and requires integration with a large number of systems. This often means working across multiple tools to get the job done which can introduce significant cost to productivity due to the number of context switches. Rivery is a platform…

Summary

Data engineering is a practice that is…

11 April 2022 | 00:58:04


Synthetic Data As A Service For Simplifying Privacy Engineering With Gretel - E279

Summary

Any time that you are storing data about people there are a number of privacy and security considerations that come with it. Privacy engineering is a growing field in data management that focuses on how to protect attributes of personal data so that the containing datasets can be shared…

Summary

Any time that you are storing data about…

10 April 2022 | 00:48:32