Digital Nirvana Updates MetadataIQ Metadata-Automation Tool For Avid Ecosystem

Digital Nirvana has announced an upgrade to MetadataIQ, its SaaS-based tool that automatically generates speech-to-text and video intelligence metadata, increasing the efficiency of production, preproduction, and live content creation services for Avid PAM/MAM users.

The new version, which will be previewed at the 2022 NAB Show, makes beta-tested video intelligence capabilities commercially available and integrates directly with Avid MediaCentral.

MetadataIQ 4.0 relies on advanced machine learning and high-performance AI capabilities in the cloud (speech to text, facial recognition, object identification, content classification, etc.) to create highly accurate metadata more quickly and less expensively than traditional methods. Crucially, MetadataIQ is the only tool that not only automatically generates speech-to-text transcripts on incoming feeds (or on stored content) in real time, but then takes the transcript, parses it by time, and indexes it back to the media in the Avid environment. No other such product integrates with Avid today.

Since Digital Nirvana introduced MetadataIQ about a year ago, the primary use case has been generating speech to text in real time as massive amounts of live streams are being ingested, then sending that STT transcript into the Avid Interplay PAM system with time inputs. The application’s unique ability to marry real-time transcript generation with real-time indexing in Avid means producers and editors can quickly find relevant media assets for their news stories, thereby accelerating the entire production process.

In the new version, MetadataIQ’s transcription and other video intelligence capabilities will emerge from the proof-of-concept stage and be commercially available based on the overwhelming success of the beta testing.

Also, instead of sending metadata only to Avid Interplay on-prem implementations, MetadataIQ 4.0 will integrate with Avid’s cloud-based MediaCentral hub, where editors access multiple Avid applications to do their work. Thanks to cloud integration, instead of being able to search only one type of metadata at once as they’ve been doing in Avid Interplay, editors will be able to combine searches in MediaCentral based on multiple forms of metadata. For example, if MetadataIQ generates metadata using OCR, facial recognition, and speech to text, when an editor enters search terms, MediaCentral will search all three of those types of metadata simultaneously. This means editors will get more precise results even faster.

You might also like...

Essential Guide: Delivering Intelligent Multicast Networks

This Essential Guide discusses the potential weaknesses of the ‘Protocol-Independent Multicast’ protocols that underpin multicast, and explores how a bandwidth aware infrastructure can maximize network capacity to reduce the risk of congestion.

Standards: Part 16 - About MP3 Audio Coding & ID3 Metadata

The MP3 audio format has been around for thirty years and has been superseded by several other codecs – so here we discuss why it still has a very strong position in broadcast. We also discuss ID3 metadata tags which often a…

HDR Picture Fundamentals: Brightness

This article describes one of the fundamental principles of broadcast - how humans perceive light, how this relates to the technology we use to capture and display images, and how this relates to HDR & Wide Color Gamut

Virtualization - Part 2

In part one, we saw how virtualization is nothing new and that we rely on it to understand and interact with the world. In this second part, we will see how new developments like the cloud and Video Over IP…

Standards: Part 15 - ST2110-2x - Video Coding Standards For Video Transport

SMPTE 2110 and its related standards help to construct workflows and broadcast systems. They coexist with standards from other organizations and incorporate them where necessary. In an earlier article we looked at the ST 2110 standard as a whole. This time we…