Data Engineering Podcast
Episode Archive
Episode Archive
424 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Completing The Feedback Loop Of Data Through Operational Analytics With Census
October 20th, 2021 | 1 hr 9 mins
An interview with Boris Jabes of Census about the growing trend of operational analytics, how it allows data teams to complete the feedback loop for data value, and how the Census platform is architected to make it easy to implement.
-
Bringing The Power Of The DataHub Real-Time Metadata Graph To Everyone At Acryl Data
October 15th, 2021 | 1 hr 8 mins
An interview with the founders of Acryl Data about their work to bring DataHub to every organization for more powerful data discovery, data quality management, and data observability.
-
How And Why To Become Data Driven As A Business
October 13th, 2021 | 1 hr 1 min
An interview with Randy Bean of New Venture Partners about his recent book "Fail Fast, Learn Faster", and why it is more important than ever for businesses to be data driven at every level.
-
Make Your Business Metrics Reusable With Open Source Headless BI Using Metriql
October 8th, 2021 | 43 mins 37 secs
An interview with Burak Kabakcı about the open source headless BI system Metriql and how it provides a central system for defining and using key business metrics.
-
Adding Support For Distributed Transactions To The Redpanda Streaming Engine
October 5th, 2021 | 45 mins 58 secs
An interview with Denis Rystsov about how he designed and implemented support for distributed transactions in the Redpanda streaming engine.
-
Building Real-Time Data Platforms For Large Volumes Of Information With Aerospike
October 2nd, 2021 | 1 hr 7 mins
An interview about how the Aerospike database engine provides a foundation for building real-time data platforms that work at terabyte to petabyte scale.
-
Delivering Your Personal Data Cloud With Prifina
September 29th, 2021 | 1 hr 12 mins
A conversation about how the team at Prifina are building a platform that puts users in control of their own data and lets developers build easy to use experiences that are powered by that rich information.
-
Digging Into Data Reliability Engineering
September 25th, 2021 | 58 mins 7 secs
A conversation about the parallels between data reliability engineering and site reliability engineering, how they differ, and steps that you can take to start adopting data reliability engineering practices in your own teams.
-
Massively Parallel Data Processing In Python Without The Effort Using Bodo
September 24th, 2021 | 1 hr 4 mins
An interview about how Bodo converts standard Python code to native MPI automatically for massive speed ups in data processing workloads
-
Declarative Machine Learning Without The Operational Overhead Using Continual
September 19th, 2021 | 1 hr 11 mins
An interview with Tristan Zajonc about his work at Continual to make declarative machine learning workflows possible and seamless by building on top of the data warehouse, and how it reduces the time and cost of putting machine learning into production.
-
An Exploration Of The Data Engineering Requirements For Bioinformatics
September 19th, 2021 | 55 mins 9 secs
An interview with Jillian Rowe about the data engineering and data infrastructure needs that exist in the field of bioinformatics.
-
Setting The Stage For The Next Chapter Of The Cassandra Database
September 12th, 2021 | 59 mins 28 secs
An interview with Ben Bromhead about the technical and community efforts that went into the latest release of the Cassandra database and the foundation that it has laid for the future of the project.
-
A View From The Round Table Of Gartner's Cool Vendors
September 8th, 2021 | 1 hr 4 mins
An interview with the leaders of the companies identified as Gartner's "Cool Vendors" in data for 2021 about the challenges faced by companies and data professionals and the work that they are doing to address those difficulties.
-
Designing And Building Data Platforms As A Product
September 3rd, 2021 | 1 hr
A panel discussion about what constitutes a data platform, how to think about designing one from scratch, and ways that you can evolve your existing data infrastructure into a cohesive experience for all of your stakeholders.
-
Presto Powered Cloud Data Lakes At Speed Made Easy With Ahana
September 1st, 2021 | 1 hr 30 secs
An interview with Dipti Borkar about how she and her team at Ahana are cutting out the complexity so that you can get your cloud data lake up and running in no time with Presto powering your low latency SQL analytics.
-
Do Away With Data Integration Through A Dataware Architecture With Cinchy
August 27th, 2021 | 51 mins 26 secs
An interview with Cinchy CEO Dan DeMers about the benefits of building your systems with a dataware architecture to eliminate the need for ongoing data integration