Detail publikace

Automation-Driven Dataset Preparation for Continuous Czech Sign Language Recognition

ŠNAJDER, J. KREJSA, J.

Anglický název

Automation-Driven Dataset Preparation for Continuous Czech Sign Language Recognition

Typ

článek ve sborníku mimo WoS a Scopus

Jazyk

en

Originální abstrakt

This paper presents an automation-driven solution for preparing a continuous Czech Sign Language dataset, addressing the lack of resources in this area. Manual processing of daily sign language news recordings would be extremely time-consuming, as the videos vary in quality, use different overlays, and have no captions. To streamline this process, we use the Structural Similarity Index Measure (SSIM) to compare key frames and extract relevant parts of the recording, such as weather forecast segments. Automatic speech recognition (ASR) then processes the accompanying audio and generates textual transcriptions of the spoken content. The outcome is the highly automated preparation pipeline and the dataset containing 4699 annotated videos of weather forecast news in Czech Sign Language providing a foundation for future research in sign language recognition.

Anglický abstrakt

This paper presents an automation-driven solution for preparing a continuous Czech Sign Language dataset, addressing the lack of resources in this area. Manual processing of daily sign language news recordings would be extremely time-consuming, as the videos vary in quality, use different overlays, and have no captions. To streamline this process, we use the Structural Similarity Index Measure (SSIM) to compare key frames and extract relevant parts of the recording, such as weather forecast segments. Automatic speech recognition (ASR) then processes the accompanying audio and generates textual transcriptions of the spoken content. The outcome is the highly automated preparation pipeline and the dataset containing 4699 annotated videos of weather forecast news in Czech Sign Language providing a foundation for future research in sign language recognition.

Klíčová slova anglicky

sign language, continuous, dataset, recognition, translation

Vydáno

04.12.2024

Nakladatel

Institute of Electrical and Electronics Engineers Inc.

Místo

Brno

ISBN

979-8-3503-9489-4

Kniha

2024 21st International Conference on Mechatronics - Mechatronika (ME)

Strany od–do

52–56

Počet stran

5

BIBTEX


@inproceedings{BUT196505,
  author="Jan {Šnajder} and Jiří {Krejsa},
  title="Automation-Driven Dataset Preparation for Continuous Czech Sign Language Recognition",
  booktitle="2024 21st International Conference on Mechatronics - Mechatronika (ME)",
  year="2024",
  month="December",
  pages="52--56",
  publisher="Institute of Electrical and Electronics Engineers Inc.",
  address="Brno",
  isbn="979-8-3503-9489-4"
}