Data Engineering Podcast
Episode Archive
Episode Archive
423 episodes of Data Engineering Podcast since the first episode, which aired on January 7th, 2017.
-
Lessons Learned From The Pipeline Data Engineering Academy
June 25th, 2021 | 1 hr 11 mins
An interview with the co-founders of the Pipeline Data Engineering Academy about the lessons that they learned along with their first cohort of students.
-
Make Database Performance Optimization A Playful Experience With OtterTune
June 22nd, 2021 | 58 mins 28 secs
An interview with Andy Pavlo about his work on OtterTune to automatically tune your database configuration for better performance.
-
Bring Order To The Chaos Of Your Unstructured Data Assets With Unstruk
June 17th, 2021 | 40 mins 47 secs
An interview about the Unstruk Data platform and how it automatically extracts metadata from unstructured data in order to build a searchable graph of information about your assets.
-
Accelerating ML Training And Delivery With In-Database Machine Learning
June 14th, 2021 | 1 hr 5 mins
An interview about the benefits of in-database machine learning for building and serving your models, and how Vertica is integrating those capabilities into their product.
-
Taking A Tour Of The Google Cloud Platform For Data And Analytics
June 11th, 2021 | 53 mins 16 secs
An interview about the data and analytics services available on the Google Cloud Platform and how they can be combined to simplify your workflows.
-
Make Sure Your Records Are Reliable With The BookKeeper Distributed Storage Layer
June 8th, 2021 | 42 mins 1 sec
An interview about the BookKeeper project for fast and reliable distributed storage that scales up and down with your workloads and how it is being used for systems like Pulsar
-
Build Your Analytics With A Collaborative And Expressive SQL IDE Using Querybook
June 3rd, 2021 | 52 mins 35 secs
An interview about the Querybook SQL IDE for big data analytics and how you can use it to build more expressive and maintainable analytics.
-
Making Data Pipelines Self-Serve For Everyone With Shipyard
June 1st, 2021 | 51 mins 22 secs
An interview about how the Shipyard platform is designed to make data pipelines more accessible by everyone in the business with a graphical approach to wiring together reusable processing steps.
-
Paving The Road For Fast Analytics On Distributed Clouds With The Yellowbrick Data Warehouse
May 27th, 2021 | 52 mins 40 secs
An interview with Yellowbrick's CTO about the engineering behind their data warehouse engine that was built for speed and deployment across distributed clouds.
-
Easily Build Advanced Similarity Search With The Pinecone Vector Database
May 25th, 2021 | 46 mins 47 secs
An interview with Edo Liberty about the Pinecone vector database and how it makes it easy to build a similarity search service.
-
A Holistic Approach To Data Governance Through Self Reflection At Collibra
May 20th, 2021 | 55 mins 52 secs
An interview with Stijn Christiaens about his experience building Collibra to address the complexities of data governance in the enterprise, and what he has learned from using his own product to run the business.
-
Unlocking The Power of Data Lineage In Your Platform with OpenLineage
May 18th, 2021 | 57 mins 38 secs
An interview with Julien Le Dem about the OpenLineage specification and the opportunity that it offers for simplifying the tracking and analysis of data lineage across your data platform.
-
Building Your Data Warehouse On Top Of PostgreSQL
May 13th, 2021 | 1 hr 15 mins
An interview about how you can build your data warehouse on top of PostgreSQL for flexibility and full control over your data.
-
Making Analytical APIs Fast With Tinybird
May 10th, 2021 | 54 mins 23 secs
A conversation about how Tinybird invested in Clickhouse to power analytical APIs that are fast to build and operate.
-
Making Spark Cloud Native At Data Mechanics
May 6th, 2021 | 40 mins 15 secs
A conversation about how the team at Data Mechanics is bringing Apache Spark into the cloud native world and the positive impact that has on your development experience.
-
The Grand Vision And Present Reality of DataOps
May 3rd, 2021 | 57 mins 8 secs
A conversation about the grand vision and current realities of DataOps and how you can start on the journey toward more maintainable and reliable data systems.