Tobias Macey
Host of Data Engineering Podcast
Tobias Macey is a dedicated engineer with experience spanning many years and even more domains. He currently manages and leads the Technical Operations team at MIT Open Learning where he designs and builds cloud infrastructure to power online access to education for the global MIT community. He also owns and operates Boundless Notions, LLC where he offers design, review, and implementation advice on data infrastructure and cloud automation.
In addition to the Data Engineering Podcast, he hosts Podcast.__init__ where he explores the universe of ways that the Python language is being used. By applying his experience in building and scaling data infrastructure and processing workflows, he helps the audience explore and understand the challenges inherent to data management.
Tobias Macey has hosted 425 Episodes.
-
Data Infrastructure Automation For Private SaaS At Snowplow
February 17th, 2020 | 49 mins 1 sec
An interview with Snowplow Analytics tech lead about how they manage data infrastructure for streaming events across multiple clouds
-
Data Modeling That Evolves With Your Business Using Data Vault
February 9th, 2020 | 1 hr 6 mins
An interview about the data vault method of data modeling and how it simplifies integrating the evolving data sources that you are dealing with in your enterprise data warehouse
-
The Benefits And Challenges Of Building A Data Trust
February 3rd, 2020 | 56 mins 52 secs
An interview about the BrightHive platform for building data trusts and the complexities that are inherent in sharing data across organizations
-
Pay Down Technical Debt In Your Data Pipeline With Great Expectations
January 26th, 2020 | 46 mins 31 secs
An interview about how the Great Expectations framework helps you add meaningful tests and validation to your data pipeline to drive down technical debt
-
Replatforming Production Dataflows
January 20th, 2020 | 39 mins
An interview about how Mayvenn replatformed their production dataflows using Ascend and improved their ability to deliver meaningful analytics to their business
-
Planet Scale SQL For The New Generation Of Applications With YugabyteDB
January 13th, 2020 | 1 hr 1 min
An interview about YugabyteDB and how it was architected to power the new generation of planet scale applications
-
Change Data Capture For All Of Your Databases With Debezium
January 5th, 2020 | 53 mins 1 sec
An interview about how the Debezium framework simplifies implementing change data capture for all of your database engines
-
Building The DataDog Platform For Processing Timeseries Data At Massive Scale
December 30th, 2019 | 45 mins 54 secs
An interview with a DataDog engineer about how they build reliable and highly available systems for processing timeseries data in real time and at massive scale
-
Building The Materialize Engine For Interactive Streaming Analytics In SQL
December 22nd, 2019 | 48 mins 7 secs
An episode about building Materialize for interactive analytics on continuously updated streams of data
-
Solving Data Lineage Tracking And Data Discovery At WeWork
December 16th, 2019 | 1 hr 1 min
An interview about how the Marquez platform for metadata management powers data lineage tracking, data discovery, and health reporting at WeWork
-
SnowflakeDB: The Data Warehouse Built For The Cloud
December 8th, 2019 | 58 mins 56 secs
An interview about how SnowflakeDB was built to provide a performant and flexible data platform for the cloud era
-
Organizing And Empowering Data Engineers At Citadel
December 2nd, 2019 | 45 mins 50 secs
An interview about building a successful data team and managing their career growth to power a successful financial business
-
Building A Real Time Event Data Warehouse For Sentry
November 26th, 2019 | 1 hr 1 min
An interview about how Sentry used Clickhouse to build an event data warehouse and pay down their architecture debt
-
Escaping Analysis Paralysis For Your Data Platform With Data Virtualization
November 18th, 2019 | 55 mins 42 secs
An interview about data virtualization and data engineering automation with AtScale and the value of abstractions for your data platform architecture
-
Designing For Data Protection
November 11th, 2019 | 51 mins 23 secs
An interview about data protection regulations and how they can influence the design of your data platform
-
Automating Your Production Dataflows On Spark
November 4th, 2019 | 48 mins 50 secs
An interview about how the Ascend platform provides an autonomous data orchestration platform to simplify your production dataflows