Skip to the content.

PAGURI: PROMPT AUDIO GENERATOR USER RESEARCH INVESTIGATION

Abstract

In recent years, text-to-music models have been the biggest breakthrough in automatic music generation. While they are unquestionably a showcase of technological progress, it is not clear yet whether and how they can realistically be integrated and used as effective tools in the artistic practice of musicians and music practitioners. This paper aims to address this question via Prompt Audio Generation User Research Investigation (PAGURI), a user experience study where we leverage recent text-to-music developments to study how musicians and practitioners interact with these systems, evaluating their satisfaction levels. We developed an online tool where users are asked to generate and/or modify music samples of their choice by applying recently proposed personalization techniques, based on fine-tuning and/or prompting a text-to-music model. Through the use of questionnaires and interviews, we analyzed how participants interacted with the proposed tool. Results show that even if the audio samples generated and their quality may not always meet user expectations, the majority acknowledge consistency between input prompts and generated audio. Moreover, they express willingness to incorporate the proposed tool into their creative process. Users also offer suggestions on how the system could be improved and integrated into their music practice.

paGUIri

The graphical user interface of PAGURI.

my alt text
1) paGUIri Graphical User Interface.
my alt text
2) paGUIri audio generation window.
my alt text
3) paGUIri model personalization window.
missing
4) Options: available models for generation.
missing
5) Options: fine-tuning time.

Questionnaire Description

The answers to the questionnaires are of 5 types:

Data Questionnaire and Answers

๐Ÿ“Š ๐Ÿ”— Musical knowledge and experience with AI tools survey (questions and answers)

๐Ÿ“Š ๐Ÿ”— Text-to-Music Interaction (Model Evaluation) survey (questions and answers)

๐Ÿ“Š ๐Ÿ”— Final Survey (questions and answers)

Users' interactions with the model

๐Ÿ”— Users' interactions, prompts used as input for generation, and model personalization history can be found here ๐Ÿ‘‰ User interactions with the model