Audio is one of the more overlooked aspects of content creation. You can spend hours setting up the perfect shot, tinkering with the lighting, angles, and bokeh, but all that can be ruined if you donβt consider background noise. Who wants to hear the honks of traffic while watching an interview?
ElevenLabs, the text-to-speech AI platform that raised $80 million from A16Z, Sequoia, and others, just announced its newest tool, Voice Isolater. This tool can seemingly take even the noisiest videos, remove the background noise, and leave your dialogue sounding crystal clear.
Say you just recorded a podcast episode for YouTube, but halfway through, a fire engine speeds down the road, blaring its sirens. With the Voice Isolator tool, you can drag and drop that clip into the studio, and your audio gets cleaned up with the press of a button.
You can see for yourself by watching the demo. During it, a member of the team records audio while a leafblower goes off. With the press of a button, the leafblower is automatically muted, and you can hear the dialogue fairly clearly.
Alongside the user-facing platform, the team also announced an accompanying API for developers to build their platforms using the Audio Isolation technology. Once implemented, you can integrate the same AI-powered audio clean-up technology into your product.
If you want to give it a shot, it seems pretty easy to implement. In the launch demo, a team member uses Claude and Replit to quickly whip up a mock app that uses the API to clean up any YouTube videoβs audio just by pasting the URL.