As the television business has become more global, and evolving consumer devices spawn the need for ever more formats, there has been an explosion of the number of versions that are needed for an item of content. The need to provide tens to hundreds of language versions provides added complications, with localized versions often being created at dispersed dubbing and captioning facilities. The Interoperable Media Format (IMF) has been developed as the solution to the sensible processing of motion pictures and episodic shows. In the linked e-book, Rohde & Schwarz explain IMF and introduce Clipster as a platform for IMF workflows.
In the past content versioning has been fulfilled through a mix of videotapes: HDCAM-SR, HD-D5 and Betacam, along with 8-tracks audio tapes. File-based operations promise a more manageable solution, but it quickly became evident that the complexity of managing a few hundred versions needed some serious standardization. The early days of MXF were remembered as a standard that was too flexible but stands as a good starting point.
The motion picture industry understood how the standards had to be nailed down, yet retaining the flexibility for new codecs, new video formats. They needed a master file that could wrap all the variants of a program to create a single master. The many versions can then be created from that single master.
Broadcasters and vendors also saw the advantages of IMF, and together the three parties have worked with the SMPTE to standardize IMF (SMPTE ST-2067 group).
World-wide distribution not only entails more audio tracks—a lot more—but captions and subtitles in the local languages. This also applies to title and credit sequences.
There are also the many edits that may be needed: TV censor edits to meet local legislation, airline versions, different segmentation for different markets. A single IMF container can wrap all the necessary assets to create any number of versions.
A specific version of content from an IMF file is defined using a Composition PlayList (CPL). This defines the audio and metadata files to use,l and defines any edits or substitutions to the original video track.
A separate Output Profile List (OPL) then defines the formats—codecs, aspect ratio, audio formats—as required for the target consumer device.
In an IMF workflow, a master video track is stored, and any version, like foreign languages titles, can be stored as stubs. Similarly, the different language audio tracks and captions are stored as linked files. A specific CPL draws together all the necessary assets to assemble the required version.
This file-based approach has many advantages. Consider a program released for the US market in English and Spanish. The program is later sold to Brazil. The Portuguese soundtrack, title and credit asset are added to the IMF container, and a new CPL created for the Brazilian version. Everything is synchronized by time code. There is a single global master that adapts to the requirements of international distribution.
The end-result of the long road of standardization is a truly interoperable standard that enables, rather than constrains, versioning. IMF is a master for the any market, any device world.
This e-book, sponsored by Rohde & Schwarz, examines the latest changes to the standards and sheds light on its application in a range of scenarios from cinema to broadcast TV and OTT applications.
You might also like...
In the last article in this series, we looked at how PTP V2.1 has improved security. In this part, we investigate how robustness and monitoring is further improved to provide resilient and accurate network timing.
NDI (Network Device Interface) is a free protocol for Video over IP, developed by NewTek. The key word is “free.”
NAB have announced the show scheduled for October 2021 has been cancelled.
Violent weather storms are wreaking havoc on the East Coast of the U.S. and radio and TV stations there are struggling to get the life-saving news out. In the past two months alone storms have knocked out TV antenna…
Timing accuracy has been a fundamental component of broadcast infrastructures for as long as we’ve transmitted television pictures and sound. The time invariant nature of frame sampling still requires us to provide timing references with sub microsecond accuracy.