About
Episodes
Host
Contact
Search
Advertise
Be Our Guest
The Machine Learning Podcast
Subscribe

Data Engineering Podcast

Episode Archive

RSS
Apple Podcasts
Amazon Music
Google Podcasts
Spotify
Stitcher
TuneIn

Episode Archive

423 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.

Accelerate Development Of Enterprise Analytics With The Coalesce Visual Workflow Builder

April 3rd, 2022 | 42 mins 45 secs

An interview with Satish Jayanthi about how the Coalesce platform powers enterprise analytics and accelerates their time to insight for workflows in the data warehouse.
Repeatable Patterns For Designing Data Platforms And When To Customize Them

April 3rd, 2022 | 47 mins 2 secs

An interview with Brandon Beidel about his experiences at Red Ventures designing and supporting analytical data platforms for his customers and how he and his team have established a set of useful patterns to make it scalable.
Eliminate The Bottlenecks In Your Key/Value Storage With SpeeDB

March 27th, 2022 | 46 mins 52 secs

An interview with Adi Gelvan about how he and his team re-engineered the RocksDB key/value storage engine for accelerated performance on high volume and high throughput workloads.
Building A Data Governance Bridge Between Cloud And Datacenters For The Enterprise At Privacera

March 27th, 2022 | 1 hr 2 mins

An interview with Balaji Ganesan about how Privacera levels up the open source Apache Ranger project to bridge data governance from on premise datacenters to the cloud without compromise.
Exploring Incident Management Strategies For Data Teams

March 20th, 2022 | 57 mins 25 secs

An interview with Mei Tao and Francisco Alberini about their experiences working with data teams as they adopt data observability and incident management strategies and how to start introducing those practices into your own work.
Accelerate Your Embedded Analytics With Apache Pinot

March 20th, 2022 | 1 hr 12 mins

An interview with Kishore Gopalakrishna and Xiang Fu about how the Apache Pinot storage engine is designed to support low latency, high concurrency, and fast updates for powering end-user facing embedded analytics in your applications.
Accelerating Adoption Of The Modern Data Stack At 5X Data

March 13th, 2022 | 53 mins 51 secs

An interview with Tarush Aggarwal about his work at 5X Data to help organizations adopt the modern data stack to advance their analytical capabilities and accelerate their business.
Taking A Multidimensional Approach To Data Observability At Acceldata

March 13th, 2022 | 1 hr 3 mins

An interview with Tristan Spaulding, head of product at Acceldata, about the goals and challenges of data observability and how they are looking to differentiate themselves through a multidimensional approach to the problem (and what that means).
Move Your Database To The Data And Speed Up Your Analytics With DuckDB

March 5th, 2022 | 1 hr 17 mins

An interview with Hannes Mühleisen about the DuckDB engine for in-process OLAP queries that lets you use the power of SQL and the flexibility of programming languages side by side.
Developer Friendly Application Persistence That Is Fast And Scalable With HarperDB

March 5th, 2022 | 49 mins 33 secs

An interview with Stephen Goldberg, CEO of HarperDB, about how he and his team are building a fast, scalable, and developer friendly database engine that supports edge, cloud, and datacenter environments.
Manage Your Unstructured Data Assets Across Cloud And Hybrid Environments With Komprise

February 27th, 2022 | 54 mins 46 secs

An interview with Krishna Subramanian about how Komprise is addressing the challenge of managing unstructured data assets across operating environments without losing your sanity.
Reflections On Designing A Data Platform From Scratch

February 27th, 2022 | 40 mins 21 secs

A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
Understanding The Immune System With Data At ImmunAI

February 20th, 2022 | 43 mins 7 secs

An interview with Guy Yachdav about the work that he and his team are doing at ImmunAI to help researchers and scientists understand the immune system through data and machine learning.
Build Your Python Data Processing Your Way And Run It Anywhere With Fugue

February 20th, 2022 | 1 hr 1 min

An interview with Kevin Kho about the open source Fugue framework for abstracting away the execution engine for your Python data workflows so you can write it once and run it anywhere.
Bring Your Code To Your Streaming And Static Data Without Effort With The Deephaven Real Time Query Engine

February 13th, 2022 | 1 hr 2 mins

An interview with Pete Goddard about the impressive engineering that he and his team have put into the Deephaven real time query engine for effortlessly working across streaming and static data in your preferred language.
Build Your Own End To End Customer Data Platform With Rudderstack

February 13th, 2022 | 47 mins 34 secs

An interview with Soumyadeb Mitra about the unique requirements for information processing in a customer data platform and how the open source Rudderstack platform allows you to customize it to meet your needs.

← Previous
1
2
…
9
10
11
…
26
27
Next →

Data Engineering Podcast is © 2024 by Boundless Notions, LLC.

About
Episodes
Host
Contact
Search
Advertise
Be Our Guest
The Machine Learning Podcast
Subscribe