Agenda subject to change.
Download PDF
Kick off your summit with a panel of educators discussing how and why they teach data science at scale. Learn…
Wednesday Morning Keynote at 2020 Data + AI Summit Europe
This session is continuation of “Automated Production Ready ML at Scale” in last Spark AI Summit at Europe. In this…
For Roularta, a news & media publishing company, it is of a great importance to understand reader behavior and what…
The Data Lake paradigm is often considered the scalable successor of the more curated Data Warehouse approach when it comes…
We are the recommendation team that performs Data Engineering + Machine Learning + Software Engineering practices in “hepsiburada.com” which is…
The SQL tab in the Spark UI provides a lot of information for analysing your spark queries, ranging from the…
One of the most significant benefits provided by Databricks Delta is the ability to use z-ordering and dynamic file pruning…
Implementing efficient Spark application with the goal of having maximal performance often requires knowledge that goes beyond official documentation. Understanding…
Privacy engineering is an emerging discipline within the software and data engineering domains aiming to provide methodologies, tools, and techniques…
Note: This is a replay of a highly rated session from the June Spark + AI Summit. Enjoy! Machine learning…
In this talk, we will cover how to extract entities from text using both rule-based and deep learning techniques. We…
Spark SQL works very well with structured row-based data. Vectorized reader and writer for parquet/orc can make I/O much faster.…
Data & ML projects bring many new complexities beyond the traditional software development lifecycle. Unlike software projects, after they were…
For fast food recommendation use cases, user behavior sequences and context features (such as time, weather, and location) are both…
In this talk, I will show the range of data engineering challenges in acquiring accurate COVID-19 case data from hundreds…
The Weather Company (TWC) collects weather data across the globe at the rate of 34 million records per hour, and…
Note: This is a replay of a highly rated session from the June Spark + AI Summit. Enjoy! At Microsoft,…
Feature Stores for machine learning (ML) are a new class of data platform for the organization, governance, and sharing of…
Shap for recommendation systems: How to use existing Machine Learning models as a recommendation system. We introduce a game-theoretic approach…
If you have questions, or would like information on sponsoring a Data + AI Summit, please contact organizers@spark-summit.org.
Apache, Apache Spark, Spark, and the Spark logo are trademarks of the Apache Software Foundation. The Apache Software Foundation has no affiliation with and does not endorse the materials provided at this event.