Analysis of Audio Interruptions in Speech Recognition Processes for Applications in Electrical Power Systems

Title:Analysis of Audio Interruptions in Speech Recognition Processes for Applications in Electrical Power Systems

Authors:Victor Hideki Yoshizumi, Sofia Moreira de Andrade Lopes, Danilo Hernane Spatti, Rogério Andrade Flauzino, Ivan Nunes da Silva, Ivan Gídaro Ricci and Alexandre Gerber Choupina Latorre

Conference:SBAI-SBSE-2023

Tags:Speech Processing, Speech Recognition, Speech-To-Text, Transmission Systems and Whisper Model

Abstract:

Speech recognition as part of automatic decision-making systems has advanced in recent years, becoming a consistent reality in several engineering sectors. Especially in Power Systems, Speech-to-text recognition allows a significant increase in the quality of process operation involving audio communication. In this way, it becomes possible to transcribe the audios involving the communication and also future audits. A growing number of tools have been proposed lately in order to automate speech recognition, but these still have analysis limitations, not allowing redundancy in the transcription process, for example. This work proposes a methodology for analyzing audio in separate channels based on recordings of calls between electrical system operators, aiming increasing the degree of robustness in the application of speech-to-text recognition processes.