Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Listen in your favorite app:

Podlink

More options

Amazon Music

Show RSS Feed

Click to copy to clipboard

Here are shows you might like

See show recommendations

AI Engineering Podcast
Tobias Macey

The Python Podcast.__init__
Tobias Macey

Gain Visibility Into Your Entire Machine Learning System Using Data Logging With WhyLogs - E283

Summary

There are very few tools which are equally useful for data engineers, data scientists, and machine learning engineers. WhyLogs is a powerful library for flexibly instrumenting all of your data systems to understand the entire lifecycle of your data from source to productionized model. In…

Summary

There are very few tools which are…

24 April 2022 | 00:59:04

Connecting To The Next Frontier Of Computing With Quantum Networks - E282

Summary

The next paradigm shift in computing is coming in the form of quantum technologies. Quantum procesors have gained significant attention for their speed and computational power. The next frontier is in quantum networking for highly secure communications and the ability to distribute across…

Summary

The next paradigm shift in computing is…

18 April 2022 | 00:40:23

What Does It Really Mean To Do MLOps And What Is The Data Engineer's Role? - E281

Summary

Putting machine learning models into production and keeping them there requires investing in well-managed systems to manage the full lifecycle of data cleaning, training, deployment and monitoring. This requires a repeatable and evolvable set of processes to keep it functional. The term…

Summary

Putting machine learning models into…

16 April 2022 | 01:15:53

DataOps As A Service For Your Data Integration Workflows With Rivery - E280

Summary

Data engineering is a practice that is multi-faceted and requires integration with a large number of systems. This often means working across multiple tools to get the job done which can introduce significant cost to productivity due to the number of context switches. Rivery is a platform…

Summary

Data engineering is a practice that is…

11 April 2022 | 00:58:04

Synthetic Data As A Service For Simplifying Privacy Engineering With Gretel - E279

Summary

Any time that you are storing data about people there are a number of privacy and security considerations that come with it. Privacy engineering is a growing field in data management that focuses on how to protect attributes of personal data so that the containing datasets can be shared…

Summary

Any time that you are storing data about…

10 April 2022 | 00:48:32

Accelerate Development Of Enterprise Analytics With The Coalesce Visual Workflow Builder - E278

Summary

The flexibility of software oriented data workflows is useful for fulfilling complex requirements, but for simple and repetitious use cases it adds significant complexity. Coalesce is a platform designed to reduce repetitive work for common workflows by adopting a visual pipeline builder to…

Summary

The flexibility of software oriented data…

03 April 2022 | 00:42:46

Repeatable Patterns For Designing Data Platforms And When To Customize Them - E277

Summary

Building a data platform for your organization is a challenging undertaking. Building multiple data platforms for other organizations as a service without burning out is another thing entirely. In this episode Brandon Beidel from Red Ventures shares his experiences as a data product manager…

Summary

Building a data platform for your…

03 April 2022 | 00:47:02

Eliminate The Bottlenecks In Your Key/Value Storage With SpeeDB - E276

Summary

At the foundational layer many databases and data processing engines rely on key/value storage for managing the layout of information on the disk. RocksDB is one of the most popular choices for this component and has been incorporated into popular systems such as ksqlDB. As these systems…

Summary

At the foundational layer many databases…

27 March 2022 | 00:46:53

Building A Data Governance Bridge Between Cloud And Datacenters For The Enterprise At Privacera - E275

Summary

Data governance is a practice that requires a high degree of flexibility and collaboration at the organizational and technical levels. The growing prominence of cloud and hybrid environments in data management adds additional stress to an already complex endeavor. Privacera is an enterprise…

Summary

Data governance is a practice that…

27 March 2022 | 01:02:35

Exploring Incident Management Strategies For Data Teams - E274

Summary

Data assets and the pipelines that create them have become critical production infrastructure for companies. This adds a requirement for reliability and management of up-time similar to application infrastructure. In this episode Francisco Alberini and Mei Tao share their insights on what…

Summary

Data assets and the pipelines that create…

20 March 2022 | 00:57:26