Music Generation via Masked Acoustic Token Modeling

Remote video URL

In this talk from the recent workshop on Decoding Communication in Nonhuman Species, Bryan Pardo (Northwestern) presents work applying iterative decoding and acoustic token modeling to music audio synthesis. The outputs of this procedure can range from a high-quality audio compression technique to variations on the original input music that match the original input music in terms of style, genre, beat and instrumentation, while varying specifics of timbre and rhythm.

,