P5-01: Sonus Texere! Automated Dense Soundtrack Construction for Books using Movie Adaptations
Subjects (starting with primary): Human-centered MIR -> human-computer interaction ; Applications -> music videos, multimodal music systems ; MIR fundamentals and methodology -> web mining, and natural language processing ; Musical features and properties -> musical affect, emotion and mood ; Applications -> music retrieval systems ; MIR fundamentals and methodology -> multimodality
Presented Virtually: 10-minute long-format presentation
Reading, much like music listening, is an immersive experience that transports readers while taking them on an emotional journey. Listening to complementary music has the potential to amplify the reading experience, especially when the music is stylistically cohesive and emotionally relevant. In this paper, we propose the first fully automatic method to build a dense soundtrack for books, which can play high-quality instrumental music for the entirety of the reading duration. Our work employs a unique text processing and music weaving pipeline that determines the context and emotional composition of scenes in a chapter. This allows our method to identify and play relevant excerpts from the soundtrack of the book's movie adaptation. By relying on the movie composer's craftsmanship, our book soundtracks include expert-made motifs and other scene-specific musical characteristics. We validate the design decisions of our approach through a perceptual study. Our readers note that the book soundtrack greatly enhanced their reading experience, due to high immersiveness granted via uninterrupted and style-consistent music, and a heightened emotional state attained via high precision emotion and scene context recognition.