Data Engineering Podcast
Weekly deep dives on data management with the engineers and entrepreneurs who are shaping the industry
About the show
This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.
Episodes
-
Simplifying Continuous Data Processing Using Stream Native Storage In Pravega with Tom Kaitchuck - Episode 63
December 31st, 2018 | 44 mins 42 secs
Stream-Native Storage For Unbounded Data With Pravega (Interview)
-
Continuously Query Your Time-Series Data Using PipelineDB with Derek Nelson and Usman Masood - Episode 62
December 23rd, 2018 | 1 hr 3 mins
Real-Time Analysis Of Time-Series Data In PostgreSQL With PipelineDB (Interview)
-
Advice On Scaling Your Data Pipeline Alongside Your Business with Christian Heinzmann - Episode 61
December 16th, 2018 | 39 mins 22 secs
The Evolution Of ETL As A Function Of Business Growth (Interview)
-
Putting Apache Spark Into Action with Jean Georges Perrin - Episode 60
December 9th, 2018 | 50 mins 31 secs
Tackling Apache Spark From The Data Engineer's Perspective (Interview)
-
Apache Zookeeper As A Building Block For Distributed Systems with Patrick Hunt - Episode 59
December 2nd, 2018 | 54 mins 25 secs
Building Distributed Systems On Top Of Apache Zookeeper (Interview)
-
Set Up Your Own Data-as-a-Service Platform On Dremio with Tomer Shiran - Episode 58
November 25th, 2018 | 39 mins 18 secs
Building The Dremio Open Source Data-as-a-Service Platform (Interview)
-
Stateful, Distributed Stream Processing on Flink with Fabian Hueske - Episode 57
November 18th, 2018 | 48 mins 1 sec
Scalable and Stateful Streaming Data With Apache Flink (Interview)
-
How Upsolver Is Building A Data Lake Platform In The Cloud with Yoni Iny - Episode 56
November 11th, 2018 | 51 mins 50 secs
Building A Data Lake Platform In The Cloud At Upsolver (Interview)
-
Self Service Business Intelligence And Data Sharing Using Looker with Daniel Mintz - Episode 55
November 4th, 2018 | 58 mins 4 secs
Easy And Powerful Self Service Business Intelligence With Looker (Interview)
-
Using Notebooks As The Unifying Layer For Data Roles At Netflix with Matthew Seal - Episode 54
October 28th, 2018 | 40 mins 54 secs
How Netflix Is Using Jupyter Notebooks In Production (Interview)
-
Of Checklists, Ethics, and Data with Emily Miller and Peter Bull (Cross Post from Podcast.__init__) - Episode 53
October 21st, 2018 | 45 mins 32 secs
Of Checklists, Ethics, and Data (Interview)
-
Improving The Performance Of Cloud-Native Big Data At Netflix Using The Iceberg Table Format with Ryan Blue - Episode 52
October 14th, 2018 | 53 mins 45 secs
Iceberg: Improving The Utility Of Cloud-Native Big Data At Netflix (Interview)
-
Combining Transactional And Analytical Workloads On MemSQL with Nikita Shamgunov - Episode 51
October 9th, 2018 | 56 mins 54 secs
Fast, Scalable, and Flexible Data For Applications And Analytics On MemSQL (Interview)
-
Building A Knowledge Graph From Public Data At Enigma With Chris Groskopf - Episode 50
September 30th, 2018 | 52 mins 52 secs
The Data Engineering Behind A Real-World Knowledge Graph (Interview)
-
A Primer On Enterprise Data Curation with Todd Walter - Episode 49
September 23rd, 2018 | 49 mins 35 secs
Big Data Curation Strategies (Interview)
-
Take Control Of Your Web Analytics Using Snowplow With Alexander Dean - Episode 48
September 16th, 2018 | 47 mins 48 secs
Taking Ownership Of Your Web Analytics With Snowplow (Interview)