Auto-Mix As One Element On The Way To Object-Based Audio

The world of broadcast audio is about to reach new levels as the industry embraces the future with Next Generation Audio (NGA). While precisely what features will be offered remain unknown, several 3D Immersive formats are already under development and soon will find their way into broadcast production and distribution.

Unlike the world of constrained channel based coding, these new NGA codecs will support more channels and/or object based audio coding. For the consumer, there will be two major benefits - a greater sense of involvement or immersion and a degree of personalisation.

Immersive 3D audio is undoubtedly one aspect of Next Generation Audio. In contrast to that approach, personalized audio can be incorporated into traditional and standard channel based formats, in the same way as stereo or mono. The key to enabling such features is Object-Based Audio (OBA).

Inside Object-based audio

Object Based Audio (OBA) will give users the option of personalising their experience by selecting from a number of audio sources and controlling the level and maybe even the position in the mix. With OBA, an “object” is essentially an audio stream with accompanying descriptive metadata. The metadata carries information about where and how to render the object in a mix that is being reproduced.

That might sound complicated, but in fact it is very straightforward. Two versions of a commentary with one mono FX stream is already an OBA format, and having the ability to choose between one commentary track and the other is already personalised audio. This system works fine as long as both commentary tracks reach the recipients home as separated audio channels and are not mixed into the audio bed.

In common parlance, Auto-Mix means balancing dynamic input levels so they have equal power output at the summation point. This can also be described as conference auto-mixing, where un-used microphone channels receive less gain and therefore noise and crosstalk is automatically reduced. 

Figure 1. Jünger’s audio technology, enables separate feeds to be  automatically mixed into a program feed as shown by this diagram.

Figure 1. Jünger’s audio technology, enables separate feeds to be automatically mixed into a program feed as shown by this diagram.

Another Auto-Mix method is A/B crossfade, in which a crossfade from source A to source B is automatically performed in response to a pre-defined trigger. If a sequence of audio elements is being used to create the audio programme, then a typical procedure would involve sequentially switching the sources – for example, presentation, clip, promo, presentation, clip and so on.

A real mix is the result of Auto-Voice-Over mixing, in which one audio element is laid over the audio bed. This kind of Auto-Mix can be triggered by the producer or by an automation system that takes a level controlled ‘voice’ input and lays it over the audio bed in a process known as ‘ducking’.

The question one needs to ask is which of these Auto-Mix methods is most relevant.

Jünger Audio D*AP8 digital audio processor.

Jünger Audio D*AP8 digital audio processor.

Choosing your tools

One of the major challenges for the production industry will be to create OBA production strategies. This means completely rethinking how a final mix is created, because with OBA, it will be performed at home by the viewer rather than by a mixer in a post facility.

Keep in mind that as soon as a post house mixes objects (different language commentaries, for example) into the audio bed, they are gone and are no longer available for personalisation by the viewer. To give viewers the chance for personalised audio, the production workflow must change and deliver separate ‘unmixed’ channels so that the home receiver and decoder can finish the final mix. This is very different to how we currently mix for surround or for standard stereo audio.

Take the next step

A first step is education. Help the audio production staff understand this new way of working whereby they no longer create a final mix. Metadata is key to successful implementation.

Review current workflows. As content is created or added to a mix, be sure the accompanying metadata survives. That metadata is the key to any object based audio tracks surviving postproduction for delivery to the broadcast transmission facility.

As consumers seek out new customisable and immersive audio environments, broadcasters who can supply them will benefit. Content and program production facilities can plan now for the necessary tools to enable these new features. A related benefit of implementing the changes will be a faster and more cost-effective production workflow.

Peter Poers, Managing Director, Jünger Audio.

Peter Poers, Managing Director, Jünger Audio.

Let us know what you think…

Log-in or Register for free to post comments…

You might also like...

UK HPA Tech Retreat Report - Day 3

Tuesdays HPA Tech Retreat was all about 360 and VR, and Wednesday focused on the versioning explosion. On the final day, delegates were given a summary of the current state of the industry, and the influences of artificial intelligence on media…

UK HPA Tech Retreat Report - Day 2

Yesterday’s 2017 HPA Tech Retreat in Oxford, UK, was all about VR and 360, and on Wednesday, they moved to the thorny issue of the versioning explosion. As broadcasters seek wider audiences over different platforms, localisation has become a big issue.…

UK HPA Tech Retreat Report - Day 1

Set in the idyllic surroundings of Heythrop Park, Oxfordshire - UK, this year’s Hollywood Professional Association Tech Retreat was brimming with innovation, demonstrations and expert industry speakers. VR and 360 dominated day one with production and technical speakers battling out…

AES67 Offers Unique Benefits for Remotes: Telos Alliance Explains

The AES67 audio standard provides unique benefits for audio networking to accommodate remote broadcasts and multi-channel immersive audio recording. Greg Shay, Chief Technology Officer (CTO) at The Telos Alliance, explains how audio engineers can benefit by using the technology to…

Plugins vs. Hardware: An Argument for the Ages

Since the world’s first audio recording in 1860, there have been legendary technical disputes in the field that are never settled. One more recent one is the question of which is better: digital plugins or hardware components? Debate is fiery…