Lehr- und Forschungseinheit für Datenbanksysteme

Breadcrumb Navigation


Big Data Management and Analytics (WS 2017/18)


  • Course registration in UNIWORX is now open (here)


  • Umfang: 3+2 hours weekly (equals 6 ECTS)
  • Lecture: Prof. Dr. Matthias Schubert
  • Assistants: Julian Busch, Daniyal Kazempour
  • Required: Lecture "Database Systems I" or equivalent
  • Beneficial: Lecture "Knowledge Discovery in Databases I" or equivalent
  • Audience: The lecture is directed towards Bachelor students (5th term) and Master students in Mediainformatics, Bioinformatics, and Informatics

Time and Locations

All times are c.t. (cum tempore)

Component When Where Starts at
Lecture Tue, 13.00 - 16.00 h Room S 004 (Schellingstr. 3) 17.10.2017
Tutorial 1 Wed, 14.00 - 16.00 h Room D Z007 (HGB) 25.10.2017
Tutorial 2 Wed, 16.00 - 18.00 h Room D Z007 (HGB) 25.10.2017
Tutorial 3 Thu, 16.00 - 18.00 h Room B 185 (Edmund-Rumpler-Str. 13) 26.10.2017
Tutorial 4 Thu, 14.00 - 16.00 h Room B 185 (Edmund-Rumpler-Str. 13) 26.10.2017


In almost all areas of business, industry, science, and everybody's life, the amount of available data that contains value and knowledge is immense and fast growing. However, turning data into information, information into knowledge, and knowledge into value is challenging.To extract the knowledge, the data needs to be stored, managed, and analyzed. Thereby, we not only have to cope with increasing amount of data, but also with increasing velocity, i.e., data streamed in high rates, with heterogeneous data sources and also more and more have to take data quality and reliability of data and information into account. These properties referring to the four V's (Volume, Velocity, Variety, and Veracity) are the key properties of "Big Data". Big Data grows faster than our ability to process the data, so we need new architectures, algorithms and approaches for managing, processing, and analyzing Big Data that goes beyond traditional concepts for knowledge discovery and data mining.

This course introduces Big Data, challenges associated with Big Data, and basic concepts for Big Data Management and Big Data Analytics which are important components in the new and popular field Data Science.


Course Schedule

17.10.17 Lecture 1: Data Science: The Big Picture ---
24.10.17 Lecture 2: NoSQL Databases 25.10.17

Tutorial 1


31.10.17 no lecture ( 500th reformation day) 01.11.17
no tutorials for this week
07.11.17 Lecture 3: Batch Systems 08.11.17

Tutorial 2

movie dataset

blob dataset

mouse dataset


14.11.17 Lecture 4: : Apache Spark 15.11.17

 Tutorial 3

Slides for Tutorial 3

21.11.17 Lecture 5: Stream Processing 22.11.17

 Tutorial 4



28.11.17 Lecture 5: Apache Flink 29.11.17
05.12.17 Lecture 6: Stream Analytics 06.12.17
12.12.17 tbA 13.12.17
19.12.17 tbA 20.12.17