Media Mining Indexer and the CAVA Framework

submitted to Interspeech 2019, September 15-19 2019, Graz, Austria

In January 2021, SAIL LABS Technology GmbH was acquired by the sensor specialist HENSOLDT and became HENSOLDT Analytics.

In today’s attention-driven news economy, rapid changes of topics and events go hand in hand with rapid changes of vocabulary and of language use. ASR systems aimed at transcribing contents pertaining to this fluid media landscape need to keep upto-date in a continuous and dynamic manner. Static models, potentially created a long time ago, are hopelessly outdated within a short period of time. The frequent changes in vocabulary and wording need to be reflected in the models employed for optimal performance of transcription if one does not want to risk falling behind. In this demonstration paper we present the audio processing capabilities of the SAIL LABS Media Mining Indexer, and the CAVA Framework allowing semi-automatic and periodic updates of the ASR vocabulary and language model from relevant and new data.

This article was presented at Interspeech conference on 2019. To access the full article, please fill in the form below.

HENSOLDT Analytics

HENSOLDT Analytics is a global leading provider of Open Source Intelligence (OSINT) systems and Natural Language Processing technologies, such as Automatic Speech Recognition, which are key elements for media monitoring and analysis.

Media Mining Indexer and the CAVA Framework

Privacy Policy

HENSOLDT Analytics

News & Research

Company

Contact

Contact us to discover our solutions

Webdemo

Request access to our demo system

Podcast

Lisen to tour last intelligence episodes now

Careers & Jobs

Join us in shaping the future