Experiment Design for Data Science

Submitted by webmaster on Fri, 10/22/2021 - 14:48
Course No: 
188992
Course Type: 
VU
Term: 
2021W
Weekly Hours: 
2.0
Lecturer: 
Peter Knees
Allan Hanbury
Alexander Schindler
Language: 
English
Objective: 

This course gives an introduction to data science. The emphasis is on strategies for the design of experiments, considering both workflow paradigms and aspects of reproducibility and traceability of solutions. Furthermore, knowledge about the lifecycle of data, from acquisition through processing and analysis to the long-term provision and reuse, is covered. Students are also introduced to the complex legal and ethical aspects of working with data.
 
 

Content: 

The following topics are covered in the lectures:

  • Introduction to Data Science
  • Data and the data lifecycle
  • Conceptual Experiment design
  • Workflow paradigms
  • Data management, reproducibilty and traceability
  • Experiment error analysis and statistical testing
  • Advanced experiment design

In addition, two exercises will be done.
 
The effort breakdown is:
9 2-hour lectures: 18hExercise 1: 10hExercise 2 (incl presentation): 30hExam preparation: 16hExam: 1hSUM: 75h
 
 

Information: 

Dates (all online via Zoom, Thu, 14:15-15:45)
First Meeting / Introduction to data science: Thu, Oct 14, 14:15-15:45
Zoom Meeting: https://tuwien.zoom.us/j/93329349139?pwd=ZHFaY2d5WG9rZDBkc3NFbmx0Skp2QT09
Meeting ID: 933 2934 9139, Password: 4cZkN8Kc
BLOCK Data Science

  • Introduction to data science - data science process -Hanbury
  • Data and the data lifecycle, ethical and legal aspects -Hanbury

BLOCK Conceptual Experiment Design

  • Planning and Execution of Experiments, hypotheses, Data collection -Knees
  • ML basics, Evaluation -Knees
  • Workflow paradigms and environments  -Schindler, Knees
  • Experiment Error Analysis and Statistical Testing 1 -Knees
  • Experiment Error Analysis and Statistical Testing 2 -Knees
  • Exercise 1 (individual): Design an experimental workflow for a given dataset 

BLOCK Reproducibility and traceability

  • Reproducibility and traceability 1 -Rauber
  • Reproducibility and traceability 2 -Rauber
  • Exercise 2 (in groups): Reproduce experimental results from a paper

January 2022 Intermediate Group Presentations of Exercise 2
January 2022 Written Exam (online)
March 2022 Exam repeat

Notes: 
Examination: 

<p>2 Exercises, Exam</p>

Recommendation: