Facial recognition - game changer
Video compression solutions provider V-Nova and Metaliquid, an AI video analysis solutions provider, have announced a strategic partnership to develop and commercialize products for machine learning powered content indexing.
To effectively deliver video analysis, a combination of speed and accuracy is paramount, states V-Nova. Currently, broadcasters can only afford to analyze a small portion of their media archive or a limited sample of frames. “They are often forced to reduce the resolution at which the analysis is performed because it’s faster and cheaper to process,” it argues. “However, lower resolutions lose details, which reduce the accuracy when recognizing key features like faces or the OCR of small text.
After a proof-of-concept was shown at IBC 2019, Metaliquid’s video analysis solution and V-Nova’s PPro (previously PERSEUS Pro) have been combined to deliver an AI-powered software library for encoding and decoding SMPTE VC-6, which uses a hierarchical approach to represent images.
Guendalina Cobianchi, SVP Business Development & Partnerships at V-Nova, comments “PPro is very smart: each video frame includes multiple levels of resolution and you can not only selectively access these resolutions, but also decode specific areas of the frame that are important for the analysis, plus it’s extremely fast. We can perform each video analysis task on the most appropriate set of pixels without having to trade off speed and accuracy.”
The initial proof-of-concept demonstrated an “outstanding” 3.2x performance gain thanks to the use of PPro instead of JPEG, which combined with the performance of Metaliquid’s algorithm, outperformed by “over an order of magnitude the benchmarked solutions currently used by the sponsors, with a comparable or higher accuracy level.”
Even further gains are expected during productisation.
Testimonies in support of their claims include:
Alan Winthroub, Director, Software Engineering at AP says, “We have one of the world’s largest multimedia archives and its long-term value is dependent on powerful indexing to deliver the rich metadata to make it discoverable. The step-change in performance this catalyst project has delivered means we can process more content, more quickly while generating richer data.”
Deirdre Temple, Head of Solutions, Transformation & Technology at RTÈ says “If we have an election or referendum, as a public service provider we need to show that we're giving balanced coverage for all parties involved, or for both sides of the debate. Providing this data is very labour-intensive, during a live debate we use stopwatches. Ideally, we would like to have real time data available online to show our balanced coverage throughout a campaign. Metaliquid and V-Nova’s solution that has emerged from the Catalyst programme is a real game-changer for us”.
Simone Bronzin, CEO and founder at Metaliquid said “The full-stack control and customization capabilities we have over our proprietary deep-learning technology has yielded products that respond to industry challenges that are not easy to solve with general purpose AI solutions. Increasing the performance of image encoding and decoding in our workflow was another important step in offering a best-in-class solution. We are tremendously excited by the results of the catalyst project and look forward to delivering this step-change solution to the market very soon.”
You might also like...
In the UK we have Oxford v Cambridge. In the USA it’s Princeton v Harvard. The only difference is that one is a boat race and the other is computer architecture race.
A recent Lawo remote activities case study notes, “It should be obvious by now that remote operation has been seriously underrated. For some, it allows to save substantial amounts of money, while others will appreciate the time gained from not…
The history of computing has been dominated by the von Neumann computer architecture, also known as the Princeton architecture, after the university of that name, in which one common memory stores the operating system, user programs and variables the programs…
Our first Essential Insights is a set of three video episodes in which we discuss transitioning to IP with industry experts. We explore the fundamental challenges during the planning stage. The decisions that need to be made, and the long-term…
After years of trial and error designed to reduce operating cost and (more recently) keep crews safely distanced, remote production has found its niche in live production and will remain the de facto method for producing events over a distributed…