“New” Audio: Sound to Match UHD Images

Last month, NHK covered the New York Yankees and Seattle Mariners baseball game in 8K with six Ikegami cameras at Yankee Stadium. The game was viewed in 8K by the media in a special suite.

While press coverage of the NHK experiment has naturally focused on the visual aspects of a multi-camera 8K production, I’ve been wondering how NHK will utilize the 22.2-channel audio scheme that is part of their Super Hi-Vision specification. This astounding surround specification forecasts that the 5.1 audio experience we are accustomed to from both HD and 4K UHD will undergo a major upgrade in the next few years.

22.2 channels of audio

Super Hi-Vision employs a very specific three-dimensional arrangement for the 22.2-channel system. At the front of the room left, center, and right speakers are placed in the traditional manner. Two sub woofers—providing localized LFE information—also will be placed at the front.

To support a third “depth” dimension, 10 surround speakers are employed. These speakers are placed mid-way between the floor and ceiling to provide three points of surround information on all four sides of the room. If you are counting speakers, our current 5.1 speakers have bloomed to a 13.2 arrangement. At the front, a total of 7 speakers provide two levels of height (low and middle) information. Figure 1 shows this arrangement plus the location of 9 ceiling speakers that provide surround information above the listeners.

Figure 1: Super Hi-Vision 22.2-channel array. <br />

Figure 1: Super Hi-Vision 22.2-channel array.

Two aspects of this diagram standout. First, this arrangement of speakers simply doesn’t seem practical for the home unless the home has a theater. (It certainly seems impractical for any Japanese flat I’ve visited.) Second, it shows a movie theater seating arrangement with a pair of projectors. 

Perhaps diagrams like this are nothing more than an artist’s conception of how a 22.2-channel array defined by engineers might look. Or, perhaps it illustrates how technology can leap ahead of any practical home application.

Dolby's Atmos

Dolby Labs has developed a far more reasonable surround system they call Atmos. This system is already being used in movie theaters as well as employed on Blu-ray discs. With an Atmos encoded disc playing in a Blu-ray player, via HDMI, a bitstream is sent to an Audio/Video receiver (AVR) that supports Atmos decoding. To the listener, playing an Atmos disc seems no different than playing any flavor of Digital Dolby surround sound. The differences are visible on the back of the AVR and in the room where speakers designed for the Atmos system are placed. 

Figure 2 shows the speaker connection panel of an AVR that supports 7.2.4 surround sound (the rightmost digit specifies the number of “overhead” sound sources).

Figure 2: This AVR speaker connection panel supports 7.2.4-channel audio

Figure 2: This AVR speaker connection panel supports 7.2.4-channel audio

Figure 3 shows a room with a 7.1.4 configuration. Dolby stresses that a wide range of speaker configurations are possible—including 5.1.2. The upward firing pair of speakers can be “tops” added to your existing front speakers. (See Figure 4.)

Figure 3: Dolby Atmos 7.1.4 configuration

Figure 3: Dolby Atmos 7.1.4 configuration

Figure 4: Atmos upward firing speaker “Tops”

Figure 4: Atmos upward firing speaker “Tops”

The magic, of course, is not simply in the critical design of the upward firing Atmos speaker. Atmos is based on the concept of audio objects. Each object includes the audio data itself plus metadata that dynamically defines its location in 3D space—allowing the audio object to move over time.

 Dolby Atmos supports up to 128 simultaneous audio objects. Included in these objects is a 7.1 “bed” that defines the fundamental nature of a mix and also serves as a fallback for non Atmos AVRs. The audio mixer and film director determine the location and action of each object and the Dolby system makes speaker-assignment decisions.

More magic occurs in the AVR. Dolby Atmos logic reads the metadata and determines how to use the speakers in your specific setup to best recreate the precise placement and movement of each object. Dolby notes that adding speakers increases the precision of audio placement. Potentially you can have up to 24 “floor-level” speakers and up to 10 “overhead” speakers.

DTS has announced it too has a system that employs audio objects. Called DTS:X, it can be written to Blu-ray discs and provides height information. DTS stresses the same mix can be used in theaters and homes. Within a DTS:X supported AVR, the DTS “speaker remapping engine” can support a wide range of speaker configuration “within a hemispherical layout.” In the future, DTS claims the engine will allow a listener to alter the dialog loudness independently of all other sounds. Manufacturers will begin to add DTS:X firmware to their AVRs in the fall.

The Ultra HD Blu-ray specification allows immersive, object-based sound formats—although they are not mandatory. Both Atmos and DTS:X will likely be supported.

At this point it is an open question whether Dolby Atmos and/or DTS:X will be supported by UHD streaming services. For each internet link to a viewer’s home there is a practical bandwidth value—which can be expected to vary over time.

The question for services like Netflix and Amazon is whether there are currently enough ISP links that can provide both a superb UHD motion picture and advanced surround sound formats to justify offering object-based audio. Remember, the promised HDR streaming itself will require greater data bandwidth.

Check the specs

Curious about the newest generation of AVRs, I went to the website of one of the high-end manufacturers. It was at this point I once again fell into the same kind of specification hell I visited in my Television 2015 article

Looking at their least expensive ($2000) AVR that supports Atmos I noticed that it did not support HDCP 2.2. Thankfully, the top-of-the-line ($3000) did support HDCP 2.2, which is necessary for playing Ultra HD Blu-ray discs. Once again I learned reading the fine-print is critical to purchasing anything related to UHD.

Read carefully the specifications of any AVS device. Manufacturers can make claims that sound and look good, but can be, in fact, technically wrong.

Read carefully the specifications of any AVS device. Manufacturers can make claims that sound and look good, but can be, in fact, technically wrong.

For example, a device can claim to support “4K at 60Hz” even though it has only an HDMI 1.4 port. How? Even though almost every UHD source, including Ultra HD Blu-ray, uses only 4:2:0 color sampling—the HDMI specification itself is based upon passing 4:4:4 color. Thus, HDMI 2.0 can transfer up to 2160p60 with 4:4:4 chroma sampling. HDMI 1.4 has the potential to operate in a mode where 4:2:0 can be transferred at 60p. I’m now careful to look at an HDMI specification to see if it specifies “2.0.”

Checking different brands is also critical. Another well-regarded company has an Atmos-equipped AVR that supports HDCP 2.2 and costs only $500. Yet other currently selling AVRs support neither HDMI 2.0 nor HDCP 2.2. These AVRs will not support Ultra HD Blu-ray players nor will you be able to connect future devices such as cable or satellite STBs that provide channels which broadcast live events such as sports at 2160p60.

To conclude on a lighter note—have you heard about HDMI 2.0a? Not to worry. This is not a new hardware connection. Rather, it is a new protocol that supports HDR metadata over HDMI 2.0 connections.

You might also like...

Designing IP Broadcast Systems: Why Can’t We Just Plug And Play?

Plug and play would be an ideal solution for IP broadcast workflows, however, this concept is not as straightforward as it may first seem.

Why Live Music Broadcast Needs A Specialized Music Truck

We talk to the multi-award winning team at Music Mix Mobile about the unique cultural and creative demands of mixing music live for broadcast.

An Introduction To Network Observability

The more complex and intricate IP networks and cloud infrastructures become, the greater the potential for unwelcome dynamics in the system, and the greater the need for rich, reliable, real-time data about performance and error rates.

Designing IP Broadcast Systems: Part 3 - Designing For Everyday Operation

Welcome to the third part of ‘Designing IP Broadcast Systems’ - a major 18 article exploration of the technology needed to create practical IP based broadcast production systems. Part 3 discusses some of the key challenges of designing network systems to support eve…

What Are The Long-Term Implications Of AI For Broadcast?

We’ve all witnessed its phenomenal growth recently. The question is: how do we manage the process of adopting and adjusting to AI in the broadcasting industry? This article is more about our approach than specific examples of AI integration;…