Course detail

Introduction to Data Processing

FSI-SZD-A Acad. year: 2025/2026 Winter semester

The course is focused on basic data handling: introduction to databases and its effective design for data manipulation; elementary concepts from statistics – linear regression, machine learning; and visualization, geographical data included. The course is oriented on practical aspects, all main concepts are implmented in programming language python.

Language of instruction

English

Number of ECTS credits

6

Entry knowledge

Foundations of programming.

Foundations of descriptive statistics, probability theory and mathematical statistics.

Rules for evaluation and completion of the course

Students will have to finish two minor projects during the semestr to proceed to the final examination. First is focused on databases, the second one on data presentation (interactive dashboard). The final project should involve more advanced concepts from data analysis. Students will work independently on a topic, which will be discussed (and approved) with the teacher in advance. The final exam and evaluation is based on the individual discussion of that project, which can receive 0 – 100 points.

Evaluation by points: excellent (90 – 100 points), very good (80 – 89 points), good (70 – 79 points), satisfactory (60 – 69 points), sufficient (50 – 59 points), failed (0 – 49 points).

Participation in the exercises is compulsory. During the semester two abstentions are tolerated. Replacement of missed lessons (if there are more of them) is dealt with individually.

Aims

Introduction to concepts and tools for data manipulation. The following main topics will be taught and implemented

  • databases (quering, indexing,..)
  • visualization
  • basic statistics
  • regression analysis and machine learning
  • geographical data

The study programmes with the given course

Programme N-LAN-A: Logistics Analytics, Master's, compulsory

Type of course unit

 

Lecture

26 hours, optionally

Syllabus


  1. Introduction to databases

  2. Basic queries and simple commands

  3. Larger instances and database indexing (computational aspects vs. database size)

  4. Project 1: Own Database Project

  5. Descriptive statistics and basic statistical methods

  6. Visualization: introduction to various libraries, different types of graphs

  7. Advanced visualizations and dashboards

  8. GIS + Python: map data and visualizations

  9. Analyses on maps

  10. Project 2: Own Dashboard

  11. Linear regression and logistic regression: basic econometrics

  12. Linear regression II; machine learning: neural networks

  13. Machine learning: boosted trees

Computer-assisted exercise

26 hours, compulsory

Syllabus


  1. Installation of python, sqlite, simple example

  2. Basic queries and simple commands

  3. Larger instances and database indexing (computational aspects vs. database size)

  4. Project 1: Own Database Project

  5. Descriptive statistics and basic statistical methods

  6. Visualization: introduction to various libraries, different types of graphs

  7. Advanced visualizations and dashboards

  8. GIS + Python: map data and visualizations

  9. Analyses on maps

  10. Project 2: Own Dashboard

  11. Linear regression and logistic regression: basic econometrics

  12. Linear regression II; machine learning: neural networks

  13. Machine learning: boosted trees