Tobias Macey
Host of Data Engineering Podcast
Tobias Macey is a dedicated engineer with experience spanning many years and even more domains. He currently manages and leads the Technical Operations team at MIT Open Learning where he designs and builds cloud infrastructure to power online access to education for the global MIT community. He also owns and operates Boundless Notions, LLC where he offers design, review, and implementation advice on data infrastructure and cloud automation.
In addition to the Data Engineering Podcast, he hosts Podcast.__init__ where he explores the universe of ways that the Python language is being used. By applying his experience in building and scaling data infrastructure and processing workflows, he helps the audience explore and understand the challenges inherent to data management.
Tobias Macey has hosted 423 Episodes.
-
Speed Up Your Analytics With The Alluxio Distributed Storage System
February 18th, 2019 | 59 mins 44 secs
An interview about the Alluxio distributed virtual in-memory file system
-
Machine Learning In The Enterprise
February 11th, 2019 | 48 mins 18 secs
An interview about how to build, launch, and maintain machine learning products
-
Cleaning And Curating Open Data For Archaeology
February 3rd, 2019 | 1 hr 55 secs
An Interview About Building An Open Data Platform For Archaeologists
-
Managing Database Access Control For Teams With strongDM
January 28th, 2019 | 42 mins 17 secs
An Interview About strongDM's Approach To Managing Access To Multiple Databases
-
Building Enterprise Big Data Systems At LEGO
January 21st, 2019 | 48 mins 3 secs
An Interview With The Founding Members Of The LEGO Big Data Team
-
TimescaleDB: The Timeseries Database Built For SQL And Scale - Episode 65
January 13th, 2019 | 41 mins 25 secs
Checking In On The Time Series Database Market With TimescaleDB (Interview)
-
Performing Fast Data Analytics Using Apache Kudu - Episode 64
January 6th, 2019 | 50 mins 46 secs
Bringing Fast Data To The Hadoop Ecosystem With Kudu (Interview)
-
Simplifying Continuous Data Processing Using Stream Native Storage In Pravega with Tom Kaitchuck - Episode 63
December 31st, 2018 | 44 mins 42 secs
Stream-Native Storage For Unbounded Data With Pravega (Interview)
-
Continuously Query Your Time-Series Data Using PipelineDB with Derek Nelson and Usman Masood - Episode 62
December 23rd, 2018 | 1 hr 3 mins
Real-Time Analysis Of Time-Series Data In PostgreSQL With PipelineDB (Interview)
-
Advice On Scaling Your Data Pipeline Alongside Your Business with Christian Heinzmann - Episode 61
December 16th, 2018 | 39 mins 22 secs
The Evolution Of ETL As A Function Of Business Growth (Interview)
-
Putting Apache Spark Into Action with Jean Georges Perrin - Episode 60
December 9th, 2018 | 50 mins 31 secs
Tackling Apache Spark From The Data Engineer's Perspective (Interview)
-
Apache Zookeeper As A Building Block For Distributed Systems with Patrick Hunt - Episode 59
December 2nd, 2018 | 54 mins 25 secs
Building Distributed Systems On Top Of Apache Zookeeper (Interview)
-
Set Up Your Own Data-as-a-Service Platform On Dremio with Tomer Shiran - Episode 58
November 25th, 2018 | 39 mins 18 secs
Building The Dremio Open Source Data-as-a-Service Platform (Interview)
-
Stateful, Distributed Stream Processing on Flink with Fabian Hueske - Episode 57
November 18th, 2018 | 48 mins 1 sec
Scalable and Stateful Streaming Data With Apache Flink (Interview)
-
How Upsolver Is Building A Data Lake Platform In The Cloud with Yoni Iny - Episode 56
November 11th, 2018 | 51 mins 50 secs
Building A Data Lake Platform In The Cloud At Upsolver (Interview)
-
Self Service Business Intelligence And Data Sharing Using Looker with Daniel Mintz - Episode 55
November 4th, 2018 | 58 mins 4 secs
Easy And Powerful Self Service Business Intelligence With Looker (Interview)