You are currently viewing Retrieving Video Segments Based on Combined Text

Retrieving Video Segments Based on Combined Text

submitted to NAB 2003 (Las Vegas, USA, 2003)

In January 2021, SAIL LABS Technology GmbH was acquired by the sensor specialist HENSOLDT and became HENSOLDT Analytics.

This paper describes a multimedia, multilingual and multimodal research system (CIMWOS) supporting content-based indexing, archiving, retrieval and on-demand delivery of audiovisual content. CIMWOS  (Combined IMage and WOrd Spotting) incorporates an extensive set of multimedia technologies by seamless integration of three major subsystems – text, speech and image processing – producing a rich collection of XML metadata annotations following the MPEG-7 standard. These XML annotations are further merged and loaded into the CIMWOS Multimedia Database. Additionally, they can be dynamically transformed for interchanging semantic-based information into RDF and Topic Maps documents via XSL stylesheets. The CIMWOS Retrieval Engine is based on a weighted boolean model with intelligent indexing components. An ergonomic and user-friendly web-based interface allows users to efficiently retrieve video segments by a combination  of media description, content metadata and natural language text. The database is a large collection of broadcast news and documentaries in three languages (English, Greek, and French), while the open architecture allows for more languages to be incorporated in the future.

To access the full article, please fill in the form below

    Your name and e-mail are going to be used in order to send you only the research file and not any additional commercial material. You can change your mind at any time by clicking the unsubscribe in the footer of the email that you receive from us, or by contacting Please find out about your rights and choices and how we use your information in our Privacy Policy.