Standards: Containers - Metadata Is Important!
Metadata is the invisible infrastructure that makes your content searchable, manageable and monetizable. Yet it’s routinely neglected until far too late in the production cycle. We examine what metadata really is, why it matters at every stage of the content lifecycle, and how poor metadata design can lead to lost revenue.
Meta What?
Metadata is data about data. The content is described as essence data. Metadata describes any properties of that data which are not directly involved in the management of the essence. Here are some properties of a simple audio file with an indication of whether they are essence or metadata.
| Property Value | Data Type |
|---|---|
| Sample rate. | Essence |
| Sample size. | Essence |
| File name. | Meta |
| Modification date. | Meta |
| Creator application. | Meta |
| Rights owner. | Meta |
| Duration. | Essence |
| Codec type. | Essence |
| Time code of loudest passage. | Meta |
| Loudest passage duration. | Meta |
Metadata is the glue that connects all of the media assets to each other. Here is where it affects your systems and processes:
- Workflow process pipeline steering.
- Content searching in the library.
- Creative process support.
- End-user playback experience.
- Container management.
- Revenue generation.
- Sharing and exchange with other organizations.
- Deployment.
- Scheduling.
Metadata is vital to managing your collection of assets in an archive or librarian system. It describes the relationships between different media assets so they can be managed as sub-sets owned by a particular program.
Metadata steers the production processes and broadcast delivery. Each stage is enabled by specific kinds of metadata.
During playback, metadata driven signals trigger remote actions to synchronously present supporting assets alongside the viewer in the client player.
Structured metadata is often converted into XML or JSON formats for transmission via a web server. Unpacking those formats is well supported in web browsers with JavaScript.
Metadata Quality
Reliability and performance are impacted by the quality of your supporting metadata.
Media asset collections are curated with a content management system. This typically describes the workflow from ingest to asset storage ready for playout. Although that is the core part of the production process, the asset exists well outside of that context and the metadata system should reflect the entire life cycle.
Poor quality or missing metadata will compromise any activity that depends on it and persuading operational teams to enter metadata has always been problematic.
Useful metadata can be created at the conceptual stage of a project and more insights can be gathered from the consumer’s playback device to analyze their level of engagement with the content and any advertising that you include with it. The user experience can also be steered by metadata that is created during pre-production. These aspects are rarely supported by a content management system. Embracing the entire life cycle facilitates significant time and money saving automation.
What Is Metadata?
Video and audio recordings are described as Essence Data. They are the embodiment of the content in a tangible form.
Anything else that is not directly controlling of the playback process is Metadata.
Timed sub-title text is a special case that carries human readable text as essence data. It can also carry control signals, chapter marks and other descriptive information which is very much in the realm of metadata. Timed text looks a lot like metadata but it really is an essence format.
Metadata helps to make the best use of your content assets. Ideally, it should be developed in parallel with the essence format design.
Metadata Standards
There are very few metadata standards and they only describe the core assets. This is sufficient for a basic content storage system. There are no metadata standards for workflow process control or lifecycle management. There are proprietary systems that do this, but they are not open-sourced.
Our primary focus is on the lifecycle of media assets for creating broadcast or streamed media. No single schema will cover everything. For full coverage, multiple schemas are necessary but these can be linked using Relational Database Management System (RDBMS) techniques.
It may be useful to look outside the broadcasting media walled garden and consider content-specific standards. If a natural history documentary program is being created, then knowing about metadata standards such as the Access to Biological Collections Data (ABCD) schema is helpful. A program about space exploration might make use of a resource database built with the Astronomy Visualization Metadata (AVM) schema.
By incorporating external knowledge-based metadata into the workflow, additional checks and balances can be applied automatically. For example, nomenclature and terminology used in subtitles can be reconciled against subject specialist resources and errors highlighted for correction before deployment.
There are a few metadata standards directly relevant to digital content management amongst the many others that address a large range of topics.
The Digital Curation Centre (DCC) provides a useful list of metadata schemas with descriptions and links to resources:
https://www.dcc.ac.uk/guidance/standards/metadata/list
Amongst others, DCC describes these schemas which are particularly relevant when building digital media asset management solutions:
| Schema | Description |
|---|---|
| DCMI | Dublin Core Metadata Initiative – Published as ISO Standard 15836. Probably the most popular foundational media metadata schema. Provides the basis for many content management applications which extend the core model. |
| Data Package | For exchanging arbitrary data. |
| PREMIS | Describes long-term archiving. |
| PROV | Describes the provenance of assets. |
| QuDEx | A software-neutral format for data exchange. |
| SDMX | Statistical Data and Metadata Exchange. |
| ISO 19115 | A schema for describing geographic information and services. This could be used for tagging media assets. |
| OAI-ORE | The Open Archives Initiative for Object Reuse and Exchange. Exposes the rich content in aggregations of Web resources to applications that support authoring, deposit, exchange, visualization, reuse, and preservation. |
The DublinCore Metadata Initiative
For a long time, the DublinCore Metadata Initiative (DCMI) has been the most well-known and often quoted example of a metadata standard. The EBU has adopted DublinCore and enhanced it to create their own metadata standard (EBUCore). Public Service Broadcasters in America have derived PBCore from DCMI. Likewise, the Audio Engineering Society have based AESCore on DCMI too. Parts of the DCMI specification are also used in the EPUB standard for eBooks.
Start with DCMI for your own metadata designs because it encapsulates a description of the fundamentals. These are covered by high-level element types originally defined in 1995. In 2000 the standard was revised to allow extensibility and controlled vocabularies. Many more properties were added in 2008 and the metadata terms were redefined as Resource Description Framework (RDF) properties. Additional inherited RDF classes are used to define domains and range limits for the DCMI properties. RDF is a W3C standard and this helps with deployment on the Internet.
Alternatives To DCMI
Whilst DCMI is a useful standard, many additional properties may need to be recorded at each stage of a workflow. Other schemas might suggest enhancements to the foundational DCMI:
| Schema | Description |
|---|---|
| AESCore | The AESCore metadata set for audio content derived from DublinCore. |
| CDWA | Categories for the 'Description of Works of Art' may be relevant as it pertains to visual assets. |
| CSDG | Provides insights into geographic information. |
| DDI | Data Documentation Initiative is focused on Social Sciences but may yield some useful insights into archiving. |
| EAD | Encoded Archival Description used for general purpose archiving. |
| EBU-CCDM | The EBU Class Conceptual Data Model describes programs in their different phases during creation. |
| EBUCore | The EBUCore metadata set for audiovisual content derived from DublinCore. |
| MPEG-7 | This is the ISO standard for describing multimedia. |
| NISO MIX | The Z39.87 standard describes digital still images. |
| RDF | Descriptions of Web Resources from W3C. |
| VRA Core | The Visual Resources Association describes cultural visual artefacts. |
The EBU describes other broadcasting and media related metadata specifications here:
https://tech.ebu.ch/metadata/ebucore
Proprietary Designs
Although it has never been published as a standard, one of the most complete schemas was inside Apple’s long discontinued Final Cut Server product. This was originally based on the Proximity Artbox content server. Final Cut Server described over 1200 different properties belonging to a media asset. Used selectively, they are axes on which the media library could be filtered and searched. This was a very sophisticated media librarian.
Some properties described deeply technical aspects of the footage such as shutter and aperture settings. Others pertained to the ingest of movie film including the text burned into the edge of the film substrate.
Clearly the aperture and lens settings metadata will be very useful when working on the footage to add visual effects.
Interestingly, some of these obscure metadata properties are also still present in the macOS Spotlight search engine and can be selected for use in your own searches from the Finder:
Outside-in Design Process
Study all the available resources and design your own schema. This is not a trivial exercise and requires competent database design skills as well as a robust knowledge of media internals.
Reverse the design process for your content workflow by documenting the user-facing application first. The metadata and database model are then governed by the requirements of the client player.
Work backwards so that each process in your workflow defines the metadata to be collected by the preceding process steps. This reveals parts of the design that require further refinement. Discovering omissions early creates a more reliable product. It needs less maintenance and exhibits fewer problems for the end-user.
Visualize the database table design in an Entity Relationship Diagram (ERD) to work out where tables are related to one another. Make the connections which define the joins in your SQL queries.
Object Oriented Programming (OOP) is recommended and leads to a collection of Object Modelling Diagrams derived from the ERD.
These should also describe the state changes your model undergoes and any transaction sequences for messaging and communications protocol design.
The Object Management Group has created the Unified Modelling Language (UML) as a standardized approach to building object models. At the time of writing the current version is 2.5.1 and an earlier version was adopted as ISO 19501. Although UML is declining in popularity, it still has some useful ideas to offer.
Develop your own modelling conventions based on previously published work. Document them and apply them consistently throughout the project.
Complete the metadata and database model design before writing any code to shorten the development time.
Metadata Categories
A program proceeds through the workflow and is eventually scheduled for delivery to the consumer. Metadata describes the production processes, the commercial and revenue generating aspects, scheduling and delivery.
Creating a single database table to satisfy the requirements for all of these would lead to an unwieldy design with hundreds of redundant columns. Many columns are only needed for part of the lifecycle.
Good business logic design separates each stage of production into its own table. Connect them together with RDBMS joins during a SQL query.
This is a spine and role record model. Attach role records to the spine using their unique record ID values as foreign keys only when you need them. Some of them might not exist until late in the creative or deployment process. Be careful not to duplicate any data in more than one role.
The requirements are different for each part of a media item lifecycle. Each stage is represented by a different role:
| Role | Description |
|---|---|
| Shooting | Capture everything about the location. For every clip, log the camera type, lens, aperture & settings. Record the film stock used & capture the developing characteristics which will affect film-grain size & image quality. These are vital for post production. The GPS location, date, & time are important for searches. |
| Rights clearances | Gather information pertaining to where the footage can be distributed. You may also need clearances for background music or content being displayed in a TV screen in the scene. Log all of these items when the scene is shot or identify them later. This is important for legal reasons. |
| Editing & post-production | Create a manifest of clips that were sourced & edited into the footage. Metadata can be acquired from them during the edit to conform & color grade the final package for a consistent look. Keep track of the modification history. Metadata allows you to work non-destructively. |
| Program compilation | Build the necessary metadata to support the EPG & search functions in the streaming or broadcast service. Tag things appropriately as members of a series. Note carefully the episode order & season number. |
| Client player support | Define how the content will be presented to the end-user. Configure HDR, Filmmaker Mode & aspect ratio attributes so the player can respond correctly. |
| Streaming deployment | Consult the delivery specifications for your customer's ingest process & provide the supporting metadata that they require. |
| Broadcast deployment | Allocate times in the schedule & attach the program with its EPG metadata so it can be advertised & the program description used in the popup info box on the receiving TV. |
| Commercial exploitation | Allocate an ISAN number to the finished program. Define the commercial terms for licensing the content to your customers. This is important for syndication or creation of physical media products. |
| Advertising breaks | Define where these are & also link to slates or interstitials that should be delivered with the program. Choose sensible timecode locations at scene breaks. |
| Usage analytics | As the content is played by the end user, track their viewing behavior. Note whether they saw the entire program or exited early. Collect any other data that your commercial team will need to assess whether the program merits follow up seasons or episodes. |
| Asset library functions | Manage the collection of related assets. This includes component clips & raw footage as well as finished programs. Keep records of when the item has been requested for research, re-broadcast or extracts to be included in new programs. |
| Archiving | Design an efficient & very long-term archiving regime gathering everything known about the assets. Note whether the content is redacted for security, political, historical, commercial or safeguarding reasons. Certain celebrities are featured in archived assets who cannot ever be presented onscreen. Suitably tagging the archives avoids embarrassment. |
International Standard Audio-visual Number
This works like the ISBN numbers used to register books or ISSN numbers used for describing newspapers and magazines in search systems. It facilitates the distribution of your content when it is released for viewing. It also uniquely identifies a program when searching the archives.
The ISAN numbers are defined by the ISO 15706 standard.
Register your media with an ISAN agent to get a unique ID number for each episode so that individual programs can be found. Do this during the pre-production stage of your project, long before any content is created.
Your registration facilitates the collection of your copyright licensing revenues and recording music permissions for the audio content of your production. This process is evolving and growing in popularity and if you cannot locate a registration agency for your own country, there is alternative coverage via the ISAN organization.
Publish your ISAN number by including it in the end-credits crawl.
Hints, Tips, Pitfalls & Gotchas
Metadata is sometimes described as a Cinderella technology. It is often under-appreciated, neglected and if it is dealt with at all, it is often as an afterthought. Neglecting the metadata gathering and quality assurance tasks ultimately leads to trouble sooner or later.
Here are a few suggestions of what can go wrong based on real life project examples:
- Last minute agonies – Metadata design is sometimes left until very late in the project. Likewise, documentation is often a last-minute exercise and is sometimes not done at all. This has serious consequences for your project and compromises the maintenance later on. Conceptually, the documentation is another kind of metadata, perhaps in a less structured format.
- Wasted effort – Only capture information that is genuinely required to avoid wasting effort and resources.
- Potential data loss – The media business exists as separate silos of expertise. The kind of metadata collected at each stage may only apply during that part of the lifecycle but it might affect things much later. The importance of the metadata being collected is not always apparent at the point of collection. It is often not captured because the operators at that point in the lifecycle have no use for it themselves.
- Collaboration – The on-location shooting team cannot know what the deployment team will need without their help and advice. Collaboration and information sharing is necessary to keep everyone informed. Publish a manifest of metadata that is needed with actions directed to the team that can collect it most efficiently.
- Too much detail – Avoid developing the modelling diagrams down to a fine grained level. They are helpful when looking at the bigger picture but can hinder development progress if they are overused.
- Not enough detail – If the metadata includes the film grain size (possibly based on the film stock and developing techniques used), an optimal scanning resolution and dynamic range can be chosen when digitizing old movie film. Smaller grain sizes give better resolution but are less sensitive to light. Adjust the brightness, contrast and dynamic range automatically to compensate.
- Keep all the documents – Interactive presentations or extras on a DVD product rely on behind the scenes material being gathered and logged during the shoot. Retain all the unused footage, paperwork and notes.
- Huge benefits for free – A natural history unit was able to create a six-part series using footage that had not been previously broadcast. This saved money by not needing to send out a film crew and was only possible because the footage was accurately logged and marked as unused.
- Silly mistakes – A news package editor substituted ‘nice looking’ background scenery from an earlier project. They did not know that the GPS metadata would be transcribed into a footer in the client player showing the location of the shot. In the middle of a high-profile news item, the strap-line suddenly described the location as being somewhere else entirely. The news outlet’s credibility was damaged because the editor was unaware of the downstream processes. Check and clean the metadata in the finished product and ensure everyone fully understands the pipeline where it will be routed.
- Proving prior art – If cases of copyright infringement, plagiarism or IP ownership crop up after the program has been aired, you need to be able to prove you did the due-diligence for including material from other sources or that your content was created first. Lack of evidence leads to lost legal cases.
There Can Be Only One
One of the fundamental rules in database design and software development in general is that any data value should only ever be defined just ONCE! There are many programming techniques that help you define constant values in code using a symbolic name. Databases are optimized to relate tables to each other so the same source data cells can be used in different contexts.
That singular definition of the value should be inherited everywhere it is needed. The moment you define any data values a second time, you have introduced a potential maintenance nightmare.
It is possible to build code that picks up these unique values from other domains and programming paradigms.
Automated Metadata Extraction
With the orders of magnitude increase in footage being acquired, it is no longer humanly possible to scan and log everything manually. Technology is evolving to fill the gap. Automation can extract such metadata as is embedded in the footage by modern cameras. A second pass analyzes the content for semantic meaning. A lot of useful information can be deduced automatically:
- Coding format, frame rate, image size, video standards (PAL/NTSC/Interlace/Progressive/pull-down).
- Facial recognition makes educated guesses about the identity of people in the frame.
- Voice recognition can back that up to reduce uncertainty.
- Scene detection can record useful bookmarks where cuts occur.
- Background music recognition can potentially identify rights and licensing issues.
- Where previously finished programs are being ingested, OCR techniques can extract burned in text, slates, title screens and end-credits.
- Copyright ownership and creation date can be extracted from the end credits. Sometimes that text is very small and moving quickly. A human eyeball can discriminate this and read the text but OCR may struggle.
- Sub-title streams can be extracted and stored as VTT files.
- Geographic location detection when there is no camera GPS data is possible with reverse image searching techniques.
Off-air recorded programs should contain embedded EPG metadata. This can be parsed with a data detector to infer various properties. The conventions of EPG data are generally followed very well although there are occasional uncertainties, formatting oddities, typographical and spelling errors and outright mistakes. I wrote a data detector a few years ago and extracted these properties in an off-air recording system:
- Broadcast channel.
- Broadcast date and time.
- Program or series title.
- Whether audio description was included.
- Whether there were subtitles available.
- Whether the program was BSL signed.
- Whether an alternative broadcast in HD was available.
- Episode name.
- The episode number if available.
- Square bracketed dates ([YYYY]) at the end, which implied that this was a movie.
- Synthesized links to IMDB, Wikipedia and Amazon.
- Cast names and director when available.
- Text outlining the program content as a short description.
Reconciling this against a database of known broadcast schedules (see BBC Genome) provided additional metadata.
A useful metadata feed into the asset management database is possible without human intervention. Manual inspection and moderation is important for quality assurance until it is proven to be accurate and reliable.
Document Centric Vs. Application Centric Design
If you conceive your workflow based on individual apps, you are limited by the file formats they support. Working out a streamlined path through the workflow becomes more difficult.
Turn the problem inside out by making the workflow document-centric rather than application-centric. Then find apps and tools that can operate on those document types. This simplifies the interchange between different parts of your workflow. The concept is illustrated here with a simple Rich Text Format document.
Problems are sometimes easier to solve when they are translated into a different domain.
Metadata & Data Models
Each metadata standard serves a different discipline. No single standard covers everything and some schemas are informal. Metadata needs to:
- Describe content.
- Identify assets.
- Locate assets within the storage.
- Index the assets.
- Describe technical processes operating on the content.
- Provide administrative support.
- Control access permissions.
- Manage version control.
- Log access and usage.
- Track changes.
- Manage long-term archives.
- Facilitate conservation.
These involve the use of many different standards. Here are some that you might find useful. Use their profiles to constrain the range of values:
| Standard | Description |
|---|---|
| ISO 15836 | Dublin Core. |
| EBUCore | Based on Dublin Core for audio/visual content. |
| AESCore | Based on Dublin Core for audio content. |
| ISO 19115 | Geographic information. |
| ISO 14721 | Open Archival Information System (OAIS). |
| PREMIS | Preservation Metadata Implementation Strategies. |
| NISO MIX | Image content description. |
| PBCore | Used by Public Service Broadcasters. |
| ISO 15938 | MPEG-7 – Multimedia descriptions. |
These are starting points when designing your own data model to steer the workflow. More standards are described at the Digital Curation Centre (DCC) web site.
Relevant Standards
This is a list of the most relevant and helpful standards. Not all of these are for media applications. They may contain fragments of useful information to help you design a metadata model.
| Document | Vintage | Description |
|---|---|---|
| 5G-Media | - | Standards for delivery of media over 5G. Work in progress by the 5G PPP organization. |
| AES60id | 2020 | Information document about AESCore Audio Metadata. |
| AES-R9 | 2008 | Considerations for standardizing AES metadata sets. |
| CQL | 2023 | Context Query Language. Designed to be a human readable query language for searching web indexes, bibliographic archives and collections of assets. This is much easier to use than SQL. Google implements something similar for web searches. |
| CWA 13874 | 2000 | Dublin Core Metadata Element Set. |
| DCMI | DCMI | DublinCore Metadata Initiative described in RDF. This is the latest definitive description. |
| EBUCore | EBUCore | EBUCore is a set of descriptive and technical metadata based on the Dublin Core. |
| EBU CCDM | 2020 | The EBU Class Conceptual Data Model describes the various lifecycle stages for a media item. |
| EBUCorePlus | 2023 | The next generation that supersedes EBUCore and EBU CCDM. |
| EG 39 | SMPTE Metadata Dictionary. | |
| EXIF | 2023 | Metadata embedded in image files. Usually created by the camera that took the picture. Originally developed for still photographs. Version 3.0 is the latest revision. |
| IPI Code | - | The ASCAP Interested Party Information code is used when tracing rights holders. It replaces the earlier CAE identifiers. Use this code to identify all works relating to an individual. Used with ISWC numbers. See ISO 27730. |
| ISAN | 2002 | International Standard Audiovisual Number. This uniquely identifies a program or episode of a series. You would occasionally see these on the end credits. It is conceptually similar to the ISBN used to identify books and the ISSN for magazines. See ISO 15706. |
| ISCI | 2012 | International Standard Collection Identifier. See ISO 27730. |
| ISLI | 2015 | International Standard Link Identifier for connecting assets together using links. See ISO 17316. |
| ISMN | 2021 | International Standard Music Number. See ISO 10957. |
| ISNI | 2024 | International Standard Name Identifier for identifying individuals who are contributors to content assets. See ISO 27729. |
| ISO 639 | 2023 | Code for individual languages and language groups. Parts 4 and 6 have been withdrawn. |
| ISO 639-1 | 2002 | Two-letter language identifiers for major, mostly national individual languages. |
| ISO 639-2 | 1998 | Three-letter language identifiers for a larger number of widely known individual languages and a number of language groups. |
| ISO 639-3 | 2007 | Three-letter language identifiers covering all individual languages including living, extinct and ancient languages. |
| ISO 639-5 | 2008 | Three-letter language identifiers covering a larger set of language groups, living and extinct. |
| ISO 3166-1 | 2020 | Country codes in 2 and three letter formats. |
| ISO 3166-2 | 2013 | Defines codes for identifying the principal subdivisions (e.g., provinces or states) described in ISO 3166-1. |
| ISO 3166-3 | 2020 | Defines codes for country names which have been deleted from ISO 3166-1. |
| ISO 2709 | 2008 | MAchine-Readable Cataloging (MARC). Designed for bibliographic data management by the Library of Congress. |
| ISO 3901 | 2001 | Definition of an ISRC. |
| ISO 10957 | 2021 | Definition of an ISMN. |
| ISO 14721 | 2025 | OAIS – Open Archival Information Systems. |
| ISO 15444-14 | 2013 | IJPXML – XML representation of Metadata for JPEG 2000. |
| ISO 15706 | 2002 | Definition of an ISAN. |
| ISO 15706-1 | 2007 | Definition of a V-ISAN. |
| ISO 15707 | 2022 | Definition of an ISWC. |
| ISO 15836 | 2017 | Dublin Core Metadata Initiative (DCMI). |
| ISO 15938 | - | MPEG-7 – Content description metadata (Multiple separate parts published). |
| ISO 15836 | 2017 | DublinCore Metadata Initiative. |
| ISO 16684-1 | 2019 | Adobe Extensible Metadata Platform (XMP) – Data model. |
| ISO 16684-2 | 2014 | Adobe Extensible Metadata Platform (XMP) – Description of XMP schemas using RELAX NG. Reconfirmed in 2020. |
| ISO 17316 | 2015 | Definition of an ISLI. |
| ISO 17316 | 2015 | Definition of an ISLI. |
| ISO 19115 | 2014 | A standard for identifying geographic locations. This could be used for tagging footage shot on location. |
| ISO 19116 | 2019 | GPS positioning & locating. |
| ISO 19501 | 2005 | Unified Modelling Language (UML) version 1.4.2. Currently under review. |
| ISO 19566-3 | - | JPEG Feature list and box types. |
| ISO 19566-5 | - | JPEG – JUMBF – Universal metadata box format. |
| ISO 19566-8 | 2023 | JPEG – Snack – Metadata store inside J2K files. |
| ISO 20775 | 2009 | Describes how to record the holdings in an archive system. This standard describes how to manage physical and electronic assets. |
| ISO 21000 | - | MPEG-21 – Rights control for access to content (Multiple separate parts published). |
| ISO 23002 | - | MPEG-C – MPEG Media Tool Library. (Multiple separate parts published). |
| ISO 23003-7 | 2022 | MPEG-SEI – Supplemental Enhancement Information and MPEG-VSEI – Versatile Supplemental Enhancement Information. |
| ISO 23091 | - | MPEG-CICP – Coding- independent code points (Multiple separate parts published). |
| ISO 23950 | 1998 | Describes the Z39.50 interface for integrating remote databases with a central search engine process. |
| ISO 25577 | 2013 | Information and documentation – MarcXchange. An XML formatted version of the MARC data format. |
| ISO 27729 | 2024 | Definition of an ISNI. |
| ISO 27730 | 2012 | Definition on an ISCI. |
| ISRC | 2001 | International Standard Recording Code for identifying sound recordings. See ISO 3901. |
| ISWC | 2022 | International Standard Musical Work Code for identifying musical works. See ISO 15707. |
| PBCore | 2015 | Based on DublinCore and used in America for the management of assets used in Public Service Broadcasting. |
| PREMIS | 2015 | An international standard for metadata to support the preservation of digital objects and ensure their long-term usability. It is developed by the American Library of Congress in collaboration with other organizations. |
| RDF | 2014 | Resource Description Framework designed by W3C. Version 1.1 is current. |
| RFC 2413 | 1998 | Original ITEF description of the Dublin Core. |
| RFC 5013 | 2007 | Dublin Core Metadata Initiative (DCMI) published by the IETF. |
| RFC 4646 | 2006 | Tags for Identifying Languages. |
| RFC 8866 | 2021 | SDPSession Description Protocol. |
| SDP | 2021 | Session Description Protocol. Standardized by the IETF. See RFC 8866. |
| SQL | 2023 | First released in 1974. Subsequently standardized by ANSI and ISO. Most database systems diverge from the standard I various ways so code needs to be ported carefully when moving to a different database engine. |
| ST 2110-41 | 2024 | SMPTE ST 2110 Fast Metadata. |
| TECH 3293 | 2020 | Describes EBUCore version 1.10 – Based on the DublinCore. |
| TR 32 | 2015 | EBU document describing support for ISO 25938-9 audio-visual profile. |
| UML | 2017 | OMG has developed a later version (2.5.1) but this is not widely adopted. |
| V-ISAN | 2007 | An ISAN code with the version number filled in. The version number is usually set to a string of zero characters otherwise. See ISO 15706 part 2. |
| XMP | 2019 | Adobe Extensible Metadata Platform. |
| Z39.50 | 2003 | Standards for CQL managed by the Library of Congress. The original publication of this standard pre-dates the World-Wide-Web and is being worked on to make it more relevant to modern users. |
| Z39.85 | 2013 | Dublin Core Metadata Initiative (DCMI) published by ANSI/NISO. |
Metadata Is Vital To Your Success
Streaming services require significantly more metadata than traditional broadcast ever required. It becomes increasingly important as content is shared between organizations.
Previously, ‘job knowledge’ and scribbled notes were sufficient when programs were expected to air just once and never be seen again. With on-demand viewing, that is no longer the case. Viewers only find content if the metadata accurately describes it.
Content is now available indefinitely and the so called ‘long-tail-economy’ ensures that it can continue to generate valuable revenue long after it has been aired. This can only work to your benefit if you have maintained the necessary metadata to license and re-distribute your content. Repurposing to new platforms that do not yet exist is also only possible if the supporting metadata allows it.
Traditional broadcasting lacked a communications channel back to the service provider so monitoring analytical data was conducted on an estimated best-efforts basis. Modern streaming platforms can very accurately measure the viewing statistics and report back instantly. This can be fed back to the content store as metadata to identify content that is more popular.
These Appendix articles contain additional information you may find useful:
Supported by
You might also like...
Standards: Video - High Efficiency Video Coding (HEVC)
Designed to halve the bitrate of AVC while supporting resolutions up to 16K, HEVC represents a significant leap in video coding efficiency. This guide explores its profiles, tiers and levels, and examines whether it can overcome the challenges of entrenched…
Discoverability And Findability: Part 1 – Services Of General Interest In The Streaming Age
In this first of two parts on content discoverability and findability we discuss the underlying issues and challenges facing broadcasters as they strive to stay relevant in an era of mass streaming and AI-based search. The second part will then…
Production–Delivery Convergence: Part 6 - Designing Experiences That Viewers Trust
Performance reliability is an invisible contract between a streaming service and its customer, and it is fundamental to guaranteeing viewer retention. The problem is that performance isn’t just about delivery. Here we identify where to look and why it’s c…
Standards: Video - Advanced Video Coding (AVC)
AVC remains one of the most widely deployed video codecs in the world, but navigating its profiles, levels and signaling mechanisms is far from straightforward.
Live Sports & Monetization: Public Service Broadcasters Maximizing Live Sports Opportunities
PSBs across the world are making the most of limited resources to enrich live sports coverage around ancillary content and platforms, and monetizing the resulting services. Here we focus on the content and coverage rather than technical issues around workflow…