Data Engineering Podcast


This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Support the show!

Rewind 10 seconds
1X
Skip 30 seconds ahead
0:00/0:00

Listen in your favorite app:



More options

Here are shows you might like

See show recommendations
AI Engineering Podcast
Tobias Macey
The Python Podcast.__init__
Tobias Macey

469 Episodes

Bringing Business Analytics To End Users With GoodData - E138

Summary

The majority of analytics platforms are focused on use internal to an organization by business stakeholders. As the availability of data increases and overall literacy in how to interpret it and take action improves there is a growing need to bring business intelligence use cases to a…

Summary

The majority of analytics platforms are…

23 June 2020 | 00:52:24


Accelerate Your Machine Learning With The StreamSQL Feature Store - E137

Summary

Machine learning is a process driven by iteration and experimentation which requires fast and easy access to relevant features of the data being processed. In order to reduce friction in the process of developing and delivering models there has been a recent trend toward building a…

Summary

Machine learning is a process driven by…

15 June 2020 | 00:46:13


Data Management Trends From An Investor Perspective - E136

Summary

The landscape of data management and processing is rapidly changing and evolving. There are certain foundational elements that have remained steady, but as the industry matures new trends emerge and gain prominence. In this episode Astasia Myers of Redpoint Ventures shares her perspective…

Summary

The landscape of data management and…

08 June 2020 | 00:54:59


Building A Data Lake For The Database Administrator At Upsolver - E135

Summary

Data lakes offer a great deal of flexibility and the potential for reduced cost for your analytics, but they also introduce a great deal of complexity. What used to be entirely managed by the database engine is now a composition of multiple systems that need to be properly configured to…

Summary

Data lakes offer a great deal of…

02 June 2020 | 00:56:17


Mapping The Customer Journey For B2B Companies At Dreamdata - E134

Summary

Gaining a complete view of the customer journey is especially difficult in B2B companies. This is due to the number of different individuals involved and the myriad ways that they interface with the business. Dreamdata integrates data from the multitude of platforms that are used by these…

Summary

Gaining a complete view of the customer…

25 May 2020 | 00:47:00


Power Up Your PostgreSQL Analytics With Swarm64 - E133

Summary

The PostgreSQL database is massively popular due to its flexibility and extensive ecosystem of extensions, but it is still not the first choice for high performance analytics. Swarm64 aims to change that by adding support for advanced hardware capabilities like FPGAs and optimized usage of…

Summary

The PostgreSQL database is massively…

18 May 2020 | 00:52:44


StreamNative Brings Streaming Data To The Cloud Native Landscape With Pulsar - E132

Summary

There have been several generations of platforms for managing streaming data, each with their own strengths and weaknesses, and different areas of focus. Pulsar is one of the recent entrants which has quickly gained adoption and an impressive set of capabilities. In this episode Sijie Guo…

Summary

There have been several generations of…

11 May 2020 | 00:55:19


Enterprise Data Operations And Orchestration At Infoworks - E131

Summary

Data management is hard at any scale, but working in the context of an enterprise organization adds even greater complexity. Infoworks is a platform built to provide a unified set of tooling for managing the full lifecycle of data in large businesses. By reducing the barrier to entry with a…

Summary

Data management is hard at any scale, but…

04 May 2020 | 00:45:54


Taming Complexity In Your Data Driven Organization With DataOps - E130

Summary

Data is a critical element to every role in an organization, which is also what makes managing it so challenging. With so many different opinions about which pieces of information are most important, how it needs to be accessed, and what to do with it, many data projects are doomed to…

Summary

Data is a critical element to every role…

28 April 2020 | 01:01:49


Building Real Time Applications On Streaming Data With Eventador - E129

Summary

Modern applications frequently require access to real-time data, but building and maintaining the systems that make that possible is a complex and time consuming endeavor. Eventador is a managed platform designed to let you focus on using the data that you collect, without worrying about…

Summary

Modern applications frequently require…

20 April 2020 | 00:50:31