MESA And SMPTE Develop First Human- And Machine-Readable Language Metadata Table

In collaboration with MESA, SMPTE is bringing the media industry’s first human- and machine-readable Language Metadata Table (LMT) into its public review process, an early step toward SMPTE standardization

Published as a SMPTE Public Committee Draft (CD), the vetted and approved list of language codes will be readily available for public comment, implementation, and validation.

The LMT register is intended to give media companies, content owners, video service providers, and others a controlled vocabulary and standardized set of codes for accurately and consistently identifying spoken and written language, in turn supporting more efficient interchange of media worldwide. Reflecting many thousands of permutations, LMT codes support numerous applications including audio, written and timed text (closed captions and subtitles), accessibility, licensing, content localization, and international distribution.

“The Language Metadata Table was started at WarnerMedia in 2017 to normalize language codes within the organization, and IETF BCP 47 was selected due to its flexibility,” said Yonah Levenson, LMT chair at MESA, the LMT sponsor. “As interest in LMT as the M&E industry's language code solution increased, SMPTE recognized the value of the LMT and came on board as the technical partner/advisor.”

Work on the LMT register has begun in SMPTE Technology Committees (TCs), which will produce a SMPTE Public CD in the first half of 2021. The Public CD process allows SMPTE to put the LMT register into the public domain quickly and then start the work of gathering feedback and making improvements to both the register and guidelines for its independent management by multiple stakeholders.

“If you buy and sell media, you understand that a common vocabulary for language tagging is sorely needed,” said SMPTE Standards VP Bruce Devlin. “The LMT register accounts for all languages as well as dialects and scripts. As we see the register through the Public CD process, our hope is that the LMT register will become a canonical resource that serves the needs of all media organizations and ecosystems. Accessing this data will be as simple as clicking on a link or using an API to grab required codes.”

SMPTE TCs are reviewing the prototype LMT register to determine if the structure of the dictionary is correct and if the process for updating that dictionary is correct. After this step is complete, the dictionary and update process will enter a public review period, during which people and organizations can try out the register and use a dedicated GitHub repository at github.com/smpte to provide real-world feedback that will inform iterative improvement of the register. The Society will leverage the SMPTE Knowledge Network, which is built on a flexible Microsoft Teams environment with integrated apps including the Microsoft 365 suite and GitHub, to bring agility and efficiency to the Public CD process.

“My hope is that ultimately we will have a structure document in SMPTE that defines the LMT, presents the data itself in both human- and machine-readable form, and provides a new administrative guideline that describes how we’ll manage controlled vocabularies and ontologies for third parties. I encourage any organization or individual with a stake in the internationalization of content to join the appropriate SMPTE TC and contribute their requirements and expertise,” added Devlin.

You might also like...

Essential Guide: Flexible IP Monitoring

Video, audio and metadata monitoring in the IP domain requires different parameter checking than is typically available from the mainstream monitoring tools found in IT. The contents of the data payload are less predictable and packet distribution more tightly defined…

Is Remote Operation Underrated?

A recent Lawo remote activities case study notes, “It should be obvious by now that remote operation has been seriously underrated. For some, it allows to save substantial amounts of money, while others will appreciate the time gained from not…

Timing: Part 1 - Sidereal Or Solar?

The subjects of timing, synchronizing and broadcasting are inseparable and in this new series John Watkinson will look at the fundamentals of timing, areas in which fundamental progress was made, how we got where we are and where we might…

The Sponsors Perspective: PTP In LANs & WANs - An Essential Component In IP Broadcast Infrastructure

PTP - as a precise network timing technology has been available for nearly two decades. It is already widely used in Telecommunication networks, Finance and Trading platforms, substation automation networks and many more industries. Every industry has its own demands…

Computer Security: Part 4 - Making Hardware Secure

The history of computing has been dominated by the von Neumann computer architecture, also known as the Princeton architecture, after the university of that name, in which one common memory stores the operating system, user programs and variables the programs…