Adobe demos “photoshop for audio,” lets you edit speech as easily as text

Ars Technica:

Adobe has demonstrated tech that lets you edit recorded speech so that you can alter what that person said or create an entirely new sentence from their voice. It seems inevitable that it will eventually be referred to as “photoshop but for audio.”

VoCo works by ingesting a large amount of voice data (about 20 minutes right now, but that’ll be improved), breaking it down into phonemes (each of the distinct sounds that make up a spoken language), and then attempting to create a voice model of the speaker—presumably stuff like cadence, stresses, quirks, etc., but Adobe hasn’t provided much detail yet.

Yeah this won’t be used for bad purposes. Nope, not at all.