discussions on software development
#45 Michal Klos, Localytics and the World of Big Data
January 25, 2016
00:39:23
28.38 MB
Downloads: 0
Summary
Michal Klos of Localytics tells me about their big data stack and where he thinks the industry is going.
Details
Who he is, what he does; overview of the world of big data, history, batch processing, stream processing and micro batching; databases, Apache Spark, separating storage and compute; where he thinks the industry is going in the next five years, more about Spark, data lakes, query federation, Presto; how to get started with a big data project, picking technologies, doing a test; most big data projects fail, you should start small, get cross team involvement; how to scale to petabytes, start small with short expected lifespan; technologies Localytics uses, blog, they are hiring.