Final Survey

Questions and type of questions

ID	Question	Question type
Q24	I enjoyed the interaction with the text-to-music generation system	A
Q25	The data generated by the model are consistent with respect to the desired audio file provided by the user	A
Q26	The waiting time of the fine-tuning of the model is proportionate with the quality of the generated audio	A
Q27	The audio generated by the personalized model is better than the audio generated by the based model	A
Q28	There is consistency between input prompt and audio(s) generated	A
Q29	The use of text-to-music models can support musicians in musical creation endeavours	A
Q30	I would use this system again	A
Q31	Would you include this workflow in your music creative process? How?	D
Q32	In what context you would use the audio generated by the model?	O
Q33	If you personalized the model, inn what context you would use the audio generated by the fine-tuned model?	D
Q34	Do you think giving the opportunity to personalize the text-to-music model with own music is good? Why?	O
Q35	Insert here your suggestions and comments	O

Answers to the final survey

Q31) Would you include this workflow in your music creative process? How?

Participants who chose other gave the following answers:

Inspiration and post processing modification
I would use it for inspiration purposes and for modifying in post-generation
ispiration purpose and modify after generation
I would use it for specific exercises on rhythm and improvisation based on the generated audio *
Not at this time because the program does not process music according to my specifications *
I would use it for inspiration, include it as it is and include it after alteration

* answers translated

Q32) In what context you would use the audio generated by the model?

P1: "Song-writing"

P2: "it is possible to use to generate parameters value to control a synth in order to emulate the desired sound"

P3: "Music making and foleys"

P4: "music production to produce samples that can be modified, looped and integrated in a track"

P5: "I would use it to aid in music production with a DAW"

P6: "Beat making for music production*"

P7: "In beats and music productions, maybe soundtracks"

P8: "cinematic and dance music"

P9: "more general requests"

P10: "During dance lessons*"

P11: "To create new songs by old songs and sounds."

P12: "If I really like the sound, maybe I would use it in the production of a song by changing and mixing it"

P13: "during a production session"

P14: "I would use it to find inspiration when writing music, or in the context of sound design"

P15: "Broaden your musical culture*"

P16: "To find the inspiration when i am stuck with the writing process of a new song. Or for making memes"

P17: "The base model can help to have inspiration to start*"

P18: "to compose new music"

P19: "Inspiration to start song/mixing ideas"

P20: "Music production"

P21: "Inspiring the composition of a piece of music"

P22: "to make personalized sound effects"

P23: "Generating drafts from a preliminary sound idea I have in my mind"

P24: "I would use it to build musical bases aimed at dance choreographies*"

* answers translated

Q33) If you personalized the model, in what context you would use the audio generated by the fine-tuned model?

P1: "Producing"

P2: "I did not use the personalization option of the model"

P3: "Music making, with a more personalized sound"

P4: "still in music production, to produce samples or perhaps in a creative installation"

P5: "I'd use a fine-tuned model to try and create a specific sound I'm looking for in music production, hoping that it would save time with respect to trying to create it with traditional means"

P6: "Beat making for music production*"

P7: "In beats and music productions, maybe soundtracks"

P8: "live music performance"

P9: "To create samples for my group"

P10: "I would not use it because it did not meet my expectations*"

P11: "To create new songs by old songs and sounds."

P12: "If I look for something in particular maybe to incorporate to the production of a song or for inspiration"

P13: "with synth one shots and drum loop"

P14: "I would use it in order to write music from a particular sub-genre"

P15: "Enter musical genres more aligned with my tastes*"

P16: "To take inspiration from a certain music style, or to obtain a certain type of audio that I need"

P17: "The custom model must adhere more closely to the requested specifications in order to be used*"

P18: "to compose new music"

P19: "Same context"

P20: "Latin Music Production"

P21: "Electronic music bases, sampling of old songs"

P22: "to make personalized sound effects"

P23: "Generating sounds based on sounds and patterns that I like"

P24: "I did not use the personalization option of the model"

* answers translated

Q34) Do you think giving the opportunity to personalize the text-to-music model with own music is good? Why?

P1: "I believe that integration of artificial intelligence can give a boost to music production, providing more opportunities for young artists"

P2: "maybe it could be useful in order to save your general signature sound and use it to create new sounds starting from them"

P3: "Yes, it's very good because gives more control on the output and creative use of my samples"

P4: "Yes, it further improves the quality of the model in the task of generating samples from a specific genre. Even if this means that the quality in very different genres is decreased it's not a big deal, as we can still fall back to the original model or even fine-tune it again for a different task."

P5: "I think it has the potential to let musicians get realistic sounds with very little effort, but it could also be a bit risky as it could essentially allow anyone to copy any style in very little time, possibly harming the original creator (e.g. by providing much cheaper copies they can't react to in time)"

P6: "Yes, because that way everyone would create their own style and use it as they see fit*"

P7: "Is good because it can fit better your own music tastes"

P8: "yes, because it allows the user to draw a line to be followed by the model and to get better results"

P9: "Yes because you can experiment mixing different types of genres listening the resulting output"

P10: "Yes, to satisfy specific needs or discover new sounds, but it needs to be better customizedo*"

P11: "I think is a good option for the artist to work on a different settings for their wor."

P12: "Yes because it creates a lot of oportunities to the user to create and play with the model. Maybe this would hook the user to use the model more than other models that does not offer this posibility"

P13: "absolutely, because it can provide signature sounds that only belongs to the artists"

P14: "Yes, because music tends to be really different, so a fine tuning is necessary in order to have a result that might be meaningful during the writing process, taking care of specific constraints such as harmony, rythm, etc..."

P15: "Yes, because it can help artists in the composition and editing of musical bases for lyrics*"

P16: "Yes. The opportunity to personalize the text-to-music model with own music can be an opportunity to find some new alterations in what a musician play, leading to new possibilities of finding new path in the music creation process"

P17: "Not at this moment, because I can't play the requested music*"

P18: "yes, because the result is more likeable to the user"

P19: "To better tune the model towards a specific genre"

P20: "Yes to generate unique sounds and personalized music"

P21: "Yes since the output would be more likely to match personal taste and give good chances of re-intepreting our own music"

P22: "Yes, because its a good way to make new content to use for new stuff"

P23: "It is a very good idea because it can give the user the possibility to explore the sounds he/she provides in a more creative way"

P24: "I have not used the model personalization, but it could be used in any context in reference to specific needs, e.g. birthday party, music for children, DJ or themed evenings, themed parties, etc...*"

* answers translated

Q35) Insert here your suggestions and comments (optional answer)

P3: "The model should have less noisy outputs and more frequencies in the spectrum of the outputs"

P4: "Fine tuning the model greatly increases the quality in the generation of one specific genre, but it also lowers the average quality when generating music from completely different genres."

P5: "DAW integration would be great to have for my use case"

P6: "I enjoyed the interaction and it was very interesting, but it is still a bit immature. I think it has a lot of potential for growth.*"

P8: "in the future a combination of the model with an interacive process (the user makes a choise between n audio and next generation is conditioned by his/her choise) can be useful"

P9: "Very nice work, I would suggest to try the experiment with others text to music models and compare results to analyze what is the better."

P10: "We should make longer tracks and improve the sound quality, overall it was a new and beautiful experience*"

P11: "More speed (just a little bit) for the generation of my models."

P13: "only generate one sound at the time and not the whole track, words ambiguity with similar meanings, improve the GUI"

P14: "It's a very nice tool for musicians that might get stuck finding inspiration. Probably it's still a little too complicated for someone without some technical knowledge, so in the future it might be useful to 'guide' the user in some way in order to obtain better results"

P15: "Very pleasant experience, I enjoyed experimenting with this new tool that I had never tried before. Interesting the combinations and searches that the system does for the word-sound association.*"

P16: "I would add a list of words related to specific areas of music structure, with the corrispective description, to give the user a tool to modify specific aspect of the song in the prompt given to the model"

P17: "My expectation was that the model would be able to generate music following my specific specifications, regardless of the models pre-set inside it. In the specific case, I wanted it to transform the basic electric guitar riff inside it and adapt it to a riff usable with an acoustic guitar.*"

P18: "Improve the graphical interface by adding color*"

P22: "improve the quality of sound in terms of sample Hz"

P23: "Create a VST plugin with this application, fix the generation of silence or volume attenuation that sometimes occur"

P24: "The sound should be more relevant and pertinent to my requests given by the text, but overall I found the experiment interesting*"

* answers translated