Google DeepMind’s new AI tool uses video pixels and text prompts to generate soundtracks

Google DeepMind has taken the wraps off of a new AI tool for generating video soundtracks. In addition to using a text prompt to generate audio, DeepMind’s tool also takes into account the contents of the video.

By combining the two, DeepMind says users can use the tool to create scenes with “a drama score, realistic sound effects or dialogue that matches the characters and tone of a video.” You can see some of the examples posted on DeepMind’s website — and they sound pretty good.

For a video of a car driving through a cyberpunk-esque cityscape, Google used the prompt “cars skidding, car engine throttling, angelic electronic music” to generate audio. You can see how the sounds of skidding match up with the car’s movement. Another example creates an underwater soundscape using the prompt, “jellyfish pulsating under water, marine life, ocean.”

Even though users can include a text prompt, DeepMind says it’s optional. Users also don’t need to meticulously match up the generated audio with the appropriate scenes. According to DeepMind, the tool can also generate an “unlimited” number of soundtracks for videos, allowing users to come up with an endless stream of audio options.

That could help it stand out from other AI tools, like the sound effects generator from ElevenLabs, which uses text prompts to generate audio. It could also make it easier to pair audio with AI-generated video from tools like DeepMind’s Veo and Sora (the latter of which plans to eventually incorporate audio).

DeepMind says it trained its AI tool on video, audio, and annotations containing “detailed descriptions of sound and transcripts of spoken dialogue.” This allows the video-to-audio generator to match audio events with visual scenes.

The tool still has some limitations. For example, DeepMind is trying to improve its ability to synchronize lip movement with dialogue, as you can see in this video of a claymation family. DeepMind also notes that its video-to-audio system is dependent on video quality, so anything that’s grainy or distorted “can lead to a noticeable drop in audio quality.”

Source link

Breaking News

Greek Salad – Budget Bytes

This battery floodlight camera is just what my dark yard needed (and it’s on sale)

Does your kid need a new tablet? Get a kids’ tablet for as low as $85 on Amazon

Deal alert: Save almost 60% on one of the best robot vacuum brands I’ve tested — but hurry

The first look at DC’s Supergirl movie is hiding a comic-accurate scene from one of Kara’s best adventures

Google Home users can ask Gemini to control their smart homes now – how it works

Best Internet Providers in Metairie, Louisiana

The Best Air Fryer Salmon Recipe (With Maple Soy Glaze)

Stranded NASA Astronaut on the ISS Takes a Spacewalk, With Another Planned

Greek Salad – Budget Bytes

This battery floodlight camera is just what my dark yard needed (and it’s on sale)

Does your kid need a new tablet? Get a kids’ tablet for as low as $85 on Amazon

Deal alert: Save almost 60% on one of the best robot vacuum brands I’ve tested — but hurry

Google DeepMind’s new AI tool uses video pixels and text prompts to generate soundtracks

More From Author

Greek Salad – Budget Bytes

This battery floodlight camera is just what my dark yard needed (and it’s on sale)

Does your kid need a new tablet? Get a kids’ tablet for as low as $85 on Amazon

+ There are no comments

Cancel reply

Doctor Who and Game of Thrones stars cast in one of the most highly-anticipated book adaptations

Nintendo Direct June 2024: all the news and trailers

You May Also Like:

Greek Salad – Budget Bytes

This battery floodlight camera is just what my dark yard needed (and it’s on sale)

Does your kid need a new tablet? Get a kids’ tablet for as low as $85 on Amazon

Deal alert: Save almost 60% on one of the best robot vacuum brands I’ve tested — but hurry

The first look at DC’s Supergirl movie is hiding a comic-accurate scene from one of Kara’s best adventures

Google Home users can ask Gemini to control their smart homes now – how it works

Best Internet Providers in Metairie, Louisiana

The Best Air Fryer Salmon Recipe (With Maple Soy Glaze)

Breaking News

Top Tagged

+ There are no comments

Doctor Who and Game of Thrones stars cast in one of the most highly-anticipated book adaptations

Nintendo Direct June 2024: all the news and trailers