Data Engineering Podcast
Episode Archive
Episode Archive
423 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Accelerate Development Of Enterprise Analytics With The Coalesce Visual Workflow Builder
April 3rd, 2022 | 42 mins 45 secs
An interview with Satish Jayanthi about how the Coalesce platform powers enterprise analytics and accelerates their time to insight for workflows in the data warehouse.
-
Repeatable Patterns For Designing Data Platforms And When To Customize Them
April 3rd, 2022 | 47 mins 2 secs
An interview with Brandon Beidel about his experiences at Red Ventures designing and supporting analytical data platforms for his customers and how he and his team have established a set of useful patterns to make it scalable.
-
Eliminate The Bottlenecks In Your Key/Value Storage With SpeeDB
March 27th, 2022 | 46 mins 52 secs
An interview with Adi Gelvan about how he and his team re-engineered the RocksDB key/value storage engine for accelerated performance on high volume and high throughput workloads.
-
Building A Data Governance Bridge Between Cloud And Datacenters For The Enterprise At Privacera
March 27th, 2022 | 1 hr 2 mins
An interview with Balaji Ganesan about how Privacera levels up the open source Apache Ranger project to bridge data governance from on premise datacenters to the cloud without compromise.
-
Exploring Incident Management Strategies For Data Teams
March 20th, 2022 | 57 mins 25 secs
An interview with Mei Tao and Francisco Alberini about their experiences working with data teams as they adopt data observability and incident management strategies and how to start introducing those practices into your own work.
-
Accelerate Your Embedded Analytics With Apache Pinot
March 20th, 2022 | 1 hr 12 mins
An interview with Kishore Gopalakrishna and Xiang Fu about how the Apache Pinot storage engine is designed to support low latency, high concurrency, and fast updates for powering end-user facing embedded analytics in your applications.
-
Accelerating Adoption Of The Modern Data Stack At 5X Data
March 13th, 2022 | 53 mins 51 secs
An interview with Tarush Aggarwal about his work at 5X Data to help organizations adopt the modern data stack to advance their analytical capabilities and accelerate their business.
-
Taking A Multidimensional Approach To Data Observability At Acceldata
March 13th, 2022 | 1 hr 3 mins
An interview with Tristan Spaulding, head of product at Acceldata, about the goals and challenges of data observability and how they are looking to differentiate themselves through a multidimensional approach to the problem (and what that means).
-
Move Your Database To The Data And Speed Up Your Analytics With DuckDB
March 5th, 2022 | 1 hr 17 mins
An interview with Hannes Mühleisen about the DuckDB engine for in-process OLAP queries that lets you use the power of SQL and the flexibility of programming languages side by side.
-
Developer Friendly Application Persistence That Is Fast And Scalable With HarperDB
March 5th, 2022 | 49 mins 33 secs
An interview with Stephen Goldberg, CEO of HarperDB, about how he and his team are building a fast, scalable, and developer friendly database engine that supports edge, cloud, and datacenter environments.
-
Manage Your Unstructured Data Assets Across Cloud And Hybrid Environments With Komprise
February 27th, 2022 | 54 mins 46 secs
An interview with Krishna Subramanian about how Komprise is addressing the challenge of managing unstructured data assets across operating environments without losing your sanity.
-
Reflections On Designing A Data Platform From Scratch
February 27th, 2022 | 40 mins 21 secs
A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
-
Understanding The Immune System With Data At ImmunAI
February 20th, 2022 | 43 mins 7 secs
An interview with Guy Yachdav about the work that he and his team are doing at ImmunAI to help researchers and scientists understand the immune system through data and machine learning.
-
Build Your Python Data Processing Your Way And Run It Anywhere With Fugue
February 20th, 2022 | 1 hr 1 min
An interview with Kevin Kho about the open source Fugue framework for abstracting away the execution engine for your Python data workflows so you can write it once and run it anywhere.
-
Bring Your Code To Your Streaming And Static Data Without Effort With The Deephaven Real Time Query Engine
February 13th, 2022 | 1 hr 2 mins
An interview with Pete Goddard about the impressive engineering that he and his team have put into the Deephaven real time query engine for effortlessly working across streaming and static data in your preferred language.
-
Build Your Own End To End Customer Data Platform With Rudderstack
February 13th, 2022 | 47 mins 34 secs
An interview with Soumyadeb Mitra about the unique requirements for information processing in a customer data platform and how the open source Rudderstack platform allows you to customize it to meet your needs.