Course detail
Introduction to Data Processing
FSI-SZD-A Acad. year: 2025/2026 Winter semester
The course is focused on basic data handling: introduction to databases and its effective design for data manipulation; elementary concepts from statistics – linear regression, machine learning; and visualization, geographical data included. The course is oriented on practical aspects, all main concepts are implmented in programming language python.
Language of instruction
English
Number of ECTS credits
6
Supervisor
Department
Entry knowledge
Foundations of programming.
Foundations of descriptive statistics, probability theory and mathematical statistics.
Rules for evaluation and completion of the course
Students will have to finish two minor projects during the semestr to proceed to the final examination. First is focused on databases, the second one on data presentation (interactive dashboard). The final project should involve more advanced concepts from data analysis. Students will work independently on a topic, which will be discussed (and approved) with the teacher in advance. The final exam and evaluation is based on the individual discussion of that project, which can receive 0 – 100 points.
Evaluation by points: excellent (90 – 100 points), very good (80 – 89 points), good (70 – 79 points), satisfactory (60 – 69 points), sufficient (50 – 59 points), failed (0 – 49 points).
Participation in the exercises is compulsory. During the semester two abstentions are tolerated. Replacement of missed lessons (if there are more of them) is dealt with individually.
Aims
Introduction to concepts and tools for data manipulation. The following main topics will be taught and implemented
- databases (quering, indexing,..)
- visualization
- basic statistics
- regression analysis and machine learning
- geographical data
The study programmes with the given course
Programme N-LAN-A: Logistics Analytics, Master's, compulsory
Type of course unit
Lecture
26 hours, optionally
Syllabus
- Introduction to databases
- Basic queries and simple commands
- Larger instances and database indexing (computational aspects vs. database size)
- Project 1: Own Database Project
- Descriptive statistics and basic statistical methods
- Visualization: introduction to various libraries, different types of graphs
- Advanced visualizations and dashboards
- GIS + Python: map data and visualizations
- Analyses on maps
- Project 2: Own Dashboard
- Linear regression and logistic regression: basic econometrics
- Linear regression II; machine learning: neural networks
- Machine learning: boosted trees
Computer-assisted exercise
26 hours, compulsory
Syllabus
- Installation of python, sqlite, simple example
- Basic queries and simple commands
- Larger instances and database indexing (computational aspects vs. database size)
- Project 1: Own Database Project
- Descriptive statistics and basic statistical methods
- Visualization: introduction to various libraries, different types of graphs
- Advanced visualizations and dashboards
- GIS + Python: map data and visualizations
- Analyses on maps
- Project 2: Own Dashboard
- Linear regression and logistic regression: basic econometrics
- Linear regression II; machine learning: neural networks
- Machine learning: boosted trees