P4-03: Automatic music mixing with deep learning and out-of-domain data
Martinez Ramirez, Marco A*, Liao, WeiHsiang, Nagashima, Chihiro, Fabbro, Giorgio, Uhlich, Stefan, Mitsufuji, Yuki
Subjects (starting with primary): MIR fundamentals and methodology -> music signal processing ; MIR tasks -> music synthesis and transformation ; Domain knowledge -> machine learning/artificial intelligence for music ; Applications -> performance, and production ; Musical features and properties -> musical affect, emotion and mood
Presented In-person, in Bengaluru: 4-minute short-format presentation
Music mixing traditionally involves recording instruments in the form of clean, individual tracks and blending them into a final mixture using audio effects and expert knowledge (e.g., a mixing engineer). The automation of music production tasks has become an emerging field in recent years, where rule-based methods and machine learning approaches have been explored. Nevertheless, the lack of dry or clean instrument recordings limits the performance of such models, which is still far from professional human-made mixes. We explore whether we can use out-of-domain data such as wet or processed multitrack music recordings and repurpose it to train supervised deep learning models that can bridge the current gap in automatic mixing quality. To achieve this we propose a novel data preprocessing method that allows the models to perform automatic music mixing. We also redesigned a listening test method for evaluating music mixing systems. We validate our results through such subjective tests using highly experienced mixing engineers as participants.