Data Engineering Podcast
Episode Archive
Episode Archive
423 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Building Auditable Spark Pipelines At Capital One
December 12th, 2021 | 42 mins 9 secs
An interview with Capital One engineer Gokul Prabagaren about his work on building Spark workflows with a data enrichment approach to provide auditable data transformations.
-
Deliver Personal Experiences In Your Applications With The Unomi Open Source Customer Data Platform
December 11th, 2021 | 57 mins 33 secs
An interview about the open source Unomi framework for building a customer data platform and how it can provide a personalized experience to your audience.
-
Data Driven Hiring For Data Professionals With Alooba
December 4th, 2021 | 50 mins 2 secs
An interview with Alooba founder Tim Freestone about the challenges of interviewing data professionals and how he is working to provide a more detailed view of candidates abilities through high quality skills assessments.
-
Experimentation and A/B Testing For Modern Data Teams With Eppo
December 4th, 2021 | 58 mins
An interview with Eppo founder Chetan Sharma about the challenges of designing, running, and analyzing product experiments and the work that he is doing to make it more accessible to organizations of every size.
-
Creating A Unified Experience For The Modern Data Stack At Mozart Data
November 27th, 2021 | 58 mins 31 secs
An interview with Peter Fishman and Dan Silberman about how they are working to reduce the effort involved in setting up and integrating the various components of the modern data stack at Mozart Data.
-
Doing DataOps For External Data Sources As A Service at Demyst
November 27th, 2021 | 59 mins 16 secs
An interview with Demyst founder Mark Hookey about the use cases for external data sources and how they have built a DataOps platform to provide third party data sets as a service.
-
Laying The Foundation Of Your Data Platform For The Era Of Big Complexity With Dagster
November 20th, 2021 | 1 hr 5 mins
An interview with Nick Schrock about how the Dagster framework is focusing on taming the complexity of data workflows, the introduction of Dagster Cloud for reducing the operational burden, and his philosophy on the boundaries for commercial and open source features going forward.
-
Exploring Processing Patterns For Streaming Data Integration In Your Data Lake
November 20th, 2021 | 52 mins 53 secs
An interview with Ori Rafael about the benefits of streaming data integration for data lake analytics and how to design your pipelines when migrating from a batch oriented mindset.
-
Data Quality Starts At The Source
November 14th, 2021 | 58 mins 54 secs
An interview with Michael Harper about the benefits of being proactive about data quality efforts and building expectations and metrics into every stage of your pipelines, from source to destination.
-
Eliminate Friction In Your Data Platform Through Unified Metadata Using OpenMetadata
November 10th, 2021 | 1 hr 6 mins
An interview about the OpenMetadata project and how it can provide a universal metadata layer for your whole data environment through common schema definitions and a simple architecture
-
Business Intelligence Beyond The Dashboard With ClicData
November 6th, 2021 | 1 hr 2 mins
An interview with Telmo Silva about all of the layers involved in a full featured business intelligence system and how he created ClicData to make them available to organizations of every size.
-
Exploring The Evolution And Adoption of Customer Data Platforms and Reverse ETL
November 4th, 2021 | 1 hr 2 mins
An interview with Tejas Manohar of Hightouch and Rachel Bradley-Haas of Big-Time Data about how the growth of customer data platforms led to the introduction of reverse ETL systems and how you can use them together to improve your customer experience.
-
Removing The Barrier To Exploratory Analytics with Activity Schema and Narrator
October 29th, 2021 | 1 hr 8 mins
An interview with Ahmed Elsamadisi about how the Narrator platform uses the activity schema to make self service exploratory analytics a seamless experience.
-
Streaming Data Pipelines Made SQL With Decodable
October 28th, 2021 | 1 hr 9 mins
An interview with Eric Sammer about the difficulty of working with streaming engines at a low level of abstraction and how he and his team at Decodable are working to make development of streaming data pipelines as straightforward as writing SQL
-
Data Exploration For Business Users Powered By Analytics Engineering With Lightdash
October 22nd, 2021 | 1 hr 6 mins
An interview with Oliver Laslett about the open source Lightdash framework for business intelligence and how it builds on the work that your analytics engineers are doing with dbt.
-
Completing The Feedback Loop Of Data Through Operational Analytics With Census
October 20th, 2021 | 1 hr 9 mins
An interview with Boris Jabes of Census about the growing trend of operational analytics, how it allows data teams to complete the feedback loop for data value, and how the Census platform is architected to make it easy to implement.