Affective responses to singing voice in different vocal registers and modes

Yingyi Wu; Hyun-Ju Chong

doi:10.7776/ASK.2023.42.1.075

Preview

Research Article

The Journal of the Acoustical Society of Korea. 31 January 2023. 75-82
https://doi.org/10.7776/ASK.2023.42.1.075

Affective responses to singing voice in different vocal registers and modes

보컬 음역대와 음악 조성에 따른 감상자의 정서반응

Yingyi Wu¹^†

Hyun-Ju Chong¹

WU YINGYI¹^†

정 현주¹

¹Ewha Womans University

^{†Corresponding Author}

ABSTRACT

The purpose of this study was to investigate listener’s affective responses to different vocal registers and modes in terms of valence (i.e., negative to positive affect) and arousal (i.e., low to high energy level). The data were collected from four different conditions (i.e., higher and lower registers paired with major and minor modes). A total of 188 female college students participated in the survey online and rated their perceived valence and arousal levels on a visual analogue scale after listening to each excerpt. The two-way analysis of variance (ANOVA) was administered for data analysis. The results revealed that there were significant differences in the affective responses to the two vocal registers, showing that the arousal was more affected by the register than the valence. Secondly, mode had statistically significant impact on both valence and arousal while weighing more on valence. Further, there was significant interaction effect of vocal register and mode on valence, but not on arousal. Results also displayed that listeners had the most negative valence when listening to the excerpt of minor mode in higher register, while having the lowest arousal when listening to the excerpt of minor mode in lower register. These findings imply that it is important to consider the vocal range as well as the musical mode when selecting music for appreciation.

Keywords

Musical elements

Vocal register

Affective response

Valence

Arousal

Vocal music

본 연구의 목적은 다른 음역대의 목소리로 노래한 음악(고음역대, 저음역대)과 불려진 음악이 조성적으로 다른 경우(장조/단조) 감상자가 경험하는 정서적 반응에 차이가 있는지를 살펴보았다. 음악은 첫째 옥타브 차이를 두고 높은 음역대와 낮은 음역대의 가창 영역과 둘째, 조성적 변인을 통제하기 위해 장조와 단조를 사용한 총 네 가지 음원을 사용하였다. 총 188명의 여성 대학생들이 온라인 설문으로 참여하였으며 시각아날로그척도(Visual Analogue Scale)을 사용하여 지각한 정서가와 각성 수준을 기록하였다. 수집된 자료는 two-way analysis of variance(ANOVA)를 이용하여 분석하였다. 분석 결과 두 음역대 간의 유의미한 정서 반응 차이를 보여주었으며 정서가(valence)보다는 각성 수준(arousal level)에 더 많은 차이를 보여주었다. 둘째, 조성 또한 정서가와 각성 수준에 영향을 미쳤으나 두 정서 변인 중에서는 정서가에 더욱 큰 차이를 보였다. 또한 교호작용을 분석한 결과 음역대와 조성의 상호작용이 정서가에게는 큰 영향을 미치지만 각성에는 영향을 미치지 않는다는 것을 보여주었다. 더 나아가 감상자들이 단조 음악의 높은 음역대 조건에서 가장 부정적인 정서가를 보여주였고 단조 음악의 낮은 음역대 조건에서는 가장 낮은 각성 반응을 보여주었다. 이러한 결과는 음악 감상시 그 음악의 음역대와 조성을 고려해서 선곡해야 함을 암시한다.

키워드

음악요소

음역대

정서 반응

정서가

정서적 각성

가창 음악

MAIN

I. Introduction
II. Method
2.1 Participants
2.2 Music stimuli
2.3 Measures
2.4 Procedure
2.5 Data analysis
III. Results and discussion
3.1 Differences of valence response depending on vocal register, mode and the interaction effect
3.2 Differences of arousal response depending on vocal register, mode and the interaction effect
3.3 Difference of affective responses among the four vocal excerpts
IV. Conclusion

I. Introduction

Affective response towards music listening has long been studied in the emotion and music research domain, leading to specific findings on musical variables which may induce various emotional responses during listening. Existing studies suggest that certain musical elements such as tempo, mode,^[1] dynamics,^[2] pitch,^[3] and rhythm^[4] have significant impact on listener’s perceived emotion.

Among many musical elements, some are evinced to have more influence on a particular emotion,^[5] such as mode on valence^[6] and tempo on arousal.^[7] Music is composed of diverse musical elements including rhythmic, tonal and other components (e.g., timbre, texture, form). It is learnt that the rhythmic components mostly govern listener’s energy level, such as tempo,^[1] while the tonal components mainly rule listener’s mood or feeling, such as mode.^[1] The harmonic structure including consonance and dissonance may also evoke different emotional responses from the states of anxiety and tension to the relaxed and calm condition.^[8]

Tonal components mainly determine the emotionality of music due to its primary feature, namely mode. Mode is the grounding frame that the tones are selected to form a melody. Thus, depending on different mode, the melody constitutes the emotional atmosphere for the music. It is examined that music in minor mode creates melancholic and sad feelings whereas the major mode generates gay and joyful atmosphere in music.^[9]

As one of the tonal components, melody is a significant musical element composed of a series of tones with different pitches. Scholars propose that the melodic range can bring different emotional responses. They state that the high pitch range is associated with happiness, anger and fear, while the low pitch range is linked to sadness and tenderness.^[10] In addition, the pitch range variations were found to have impact on perceived valence and arousal of the listener.^[3]

Voice is one of the important timbral element in vocal music. In instrumental music, the musical timbre depends on the material that the instrument is made of and what the particular instruments are used in a piece of music. That is, the timbre can be varied when the same music is played by one or multiple instruments.^[11] Similarly, voice is a human instrument that may create music along with other instruments. Vocal music has particular impacts on musical expression and articulation, as voice may sing with lyrics. A song sung by different genders (e.g., male, female), different numbers of singers (e.g., solo, chorus), using different vocal techniques (e.g., head voice, bel canto) or different singing styles (e.g., fine art song, popular song) may lead to listener’s different emotional responses.

With regard to the measurement of listener’s emotional response, although various measuring tools have been developed, the dimensional model of emotion (i.e., valence and arousal) is what seems to be the most apt for evaluating affective responses induced in music listening.^[12,13,14] As mentioned above, the rhythmic components chiefly determine the energy level which coincides with the arousal status in the dimensional model, while the tonal components influence the affective traits which match the valence level in the model.^[1]

It is noticeable, however, that these findings are based on the existing literature with respect to instrumental music. In vocal music, the singer’s voice serves as the timbre of the music, being like a musical instrument. Since singing high or low may determine different vocal timbres, the vocal range is also an essential feature to direct listener’s emotional response. Therefore, this study decided to examine whether there is any difference in affective responses towards the vocal music sung in different vocal ranges. In order to control the modality of the music, the musical excepts were presented in both minor and major modes.

Accordingly, there were two research questions as following.

1. Are there significant differences in the perceived valence depending on vocal register, mode, and the interaction effects between vocal register and mode?

2. Are there significant differences in the perceived arousal depending on vocal register, mode, and the interaction effects between vocal register and mode?

II. Method

2.1 Participants

A total of 188 female university students aged from 19 to 34 years old (mean age = 24.13, SD = 3.12) participated in an online survey experiment. Among the original 228 participants, 22 participants were excluded for their incompletion of the survey questionnaire, and 1 non-binary gender and 17 male participants were removed from the data analysis for consideration of insignificant gender ratio. Regardless of education level or major, there were 147 non-musicians (78.19 %) and 41 musicians (21.81 %). All participants agreed to take part in this survey voluntarily and were noticed with the research subject, purpose, procedure, confidentiality, the experimenter’s contact information, and that they could end the survey at any time during their participation prior to the commencement of the survey. They were given a digital coupon of coffee or a snake after completing the survey.

2.2 Music stimuli

The study employed one major and one minor melodic piece that were sung in a higher and a lower vocal register respectively by a male singer who was vocally trained as a Baritone. The male voice was selected due to his vocal ability of covering over three tonal registers when singing. Two pieces of abridged songs were selected: one in major mode and another in minor mode. Each of them was sung in two different tonal registers correspondingly. The two songs were selected based on the previous study and hypothesis that high register may associate with high level of arousal and positive affect whereas low register may be linked to low arousal level and negative affect.^[10,15] Consequently, four excerpts were generated using two songs with each being sung in two different vocal registers (see Fig. 1).

https://cdn.apub.kr/journalsite/sites/ask/2023-042-01/N0660420110/images/ASK_42_01_10_F1.jpg

Fig. 1.

Sheet music of the four vocal excerpts.

The songs were sung with a nonsense syllable (i.e., /a:/ vowel) in order to avoid any textual messages that may have affective influence on listeners. Each song was approximately 25 s without accompaniment, and recorded at a professional sound studio. The tempo and duration of the four excerpts were controlled to a similar level by the vocalist during the recording. All recordings were exported via Logic Pro X. Some of the notes in vocal excerpts were slightly adjusted for better appreciation (see Table 1).

2.3 Measures

In this study, a self-report questionnaire and the psychometric calibration “Visual Analogue Scale (VAS)” were implemented to collect participants’ a) demographic information, b) current affective state and c) affective responses of valence and arousal to the four vocal excerpts.

In the survey questionnaire, participant’s agreement of consent, the information with regard to their age, gender, language ability, hearing condition, education level, major, daily music listening hours and vocal music training were collected (See Table 2).

At the second part, the VAS was provided for participants to specify affective response levels by implying a position on the continuum slider bar (from 0 points to 100 points).^[16]

Table 1.

Parameters of the vocal excerpts.

Vocal excerpts	Range	Notes	Register	Mode	Maximum amplitude	Tempo	Duration
HM	Higher	A₄ / A₅	Counter-tenor	A major	-0.1 dB	100 bpm	23 s.
LM	Lower	A₃ / A₄	Tenor	A major	-0.1 dB	100 bpm	23 s.
Hm	Higher	A₄ / B₅	Counter-tenor	D minor	-0.1 dB	100 bpm	26 s.
Lm	Lower	A₃ / B₄	Tenor	D minor	-0.1 dB	100 bpm	26 s.

Note. dB: decibel; bpm: beats per minute; s.: second; Duration: full length of each vocal music

Table 2.

Survey questionnaire structure.

No.	Section	Question	Content
	Introduction & consent		Agreement of consent
I	Demographic information	Q 1 ~ 4	Age, gender, language, hearing condition
		Q 5 ~ 9	Education, major, music listening hours
		Q 10 ~ 12	Vocal music training
II	Current affective state	Q 13	Current valence state rating
II	Current affective state	Q 14	Current arousal state rating
III	Affective response to the four vocal excerpts	Q 15, 17, 19, 21	Perceived valence ratings
III	Affective response to the four vocal excerpts	Q 16, 18, 20, 22	Perceived arousal ratings

2.4 Procedure

A pilot study was carried out in order to trial the feasibility of the measure tools and overall procedure. There were six female postgraduate students (mean age = 27.5, SD = 1.5) majored in music therapy participated in an online survey. Question items were tested and modified for clarity.

For the main study, it was conducted via an online survey tool. Participants were recruited through university teachers and direct invitation in which the advertisement with the research information was presented. The agreement of participating in this study was obtained prior to the questionnaire. Participants were noted to take this survey under the quiet and cosy environment and to use earphones when listening to vocal excerpts.

Participants were guided online to complete the survey process in three sections: a) survey on demographic information, b) ratings on current affective state, and c) ratings on perceived valence and arousal to the four vocal excerpts. The entire participation was estimated 9 minutes. In section c), a sample excerpt of instrumental music (i.e., piano, 16-second, Pop-style) was provided for sound test and volume adjustment ahead of the vocal excerpts. The four vocal excerpts were presented to the participant in a randomised order.

2.5 Data analysis

A two-way ANOVA was performed to analyse whether there were differences in perceived valence and arousal depending on vocal register and mode, and whether there was any interaction effect between vocal register and mode, respectively. The demographic information of participants was enquired into via descriptive statistics. All statistics were conducted on program SPSS (version 22).

III. Results and discussion

3.1 Differences of valence response depending on vocal register, mode and the interaction effect

The results exhibited that there was a significant difference of perceived valence between the vocal registers (F(1, 752) = 8.09, p < .01), modes (F(1, 752) = 108.36, p < .001), and further the interaction effect of vocal register and mode (F(1, 752) = 18.66, p < .001) (See Table 3).

Table 3.

Two-way analysis of variance (ANOVA) on perceived valence. (N = 188)

Source	MS	F	p	ηp²	dF
Vocal Register (VR)	3425.57	8.09	.005^**	.011	187
Mode (M)	45867.19	108.36	.000^***	.127	187
VR M*	7897.57	18.66	.000^***	.024	187

^*p ≤ .05; ^**p ≤ .01; ^***p ≤ .001

The result indicated that vocal register by itself had a significant effect on listener’s valence response, which is partially in line with an earlier study suggesting that higher pitch in singing voice was mostly perceived as pleasantness.^[17] This finding implies that musical melody in higher register may be a metaphor for uplifted mood that manifested acoustically. Often, the mood is expressed using spatial metaphor such as feeling high or low, which can be a form of a schematic expression.

Secondly, it was shown that mode had a statistically significant effect on perceived valence, which was congruent with the past researches on mode being associated with valence responses.^[6,18,19,20] Also, compared to vocal register, musical mode appeared to weigh predominantly on affecting listener’s valence response, which was consistent with preceding findings.^[6,21]

According to descriptive statistics of listeners’ affective responses to the four vocal excerpts, the Estimated Marginal Means (EMM) of valence in Fig. 2 revealed that a) in major mode, the difference of valence rating between higher and lower vocal registers was more significant than the difference in minor mode, b) in higher vocal register, the difference of valence ratings (i.e., solid-line slope) between major and minor modes was more significant than the difference in lower vocal register (i.e., dotted-line slope), c) regardless of the vocal register, valence responses in major mode were both more significant than the ones in minor mode, which was mostly congruent with earlier studies suggesting that major mode elicits valence response more remarkably than minor mode,^[19,21] and d) in higher vocal register, the valence response was more positive in major mode and more negative in minor mode, whereas in lower vocal register, the valence response was less positive and less negative in major and minor mode, respectively.

https://cdn.apub.kr/journalsite/sites/ask/2023-042-01/N0660420110/images/ASK_42_01_10_F2.jpg

Fig. 2.

Estimated Marginal Means of valence.

3.2 Differences of arousal response depending on vocal register, mode and the interaction effect

The results displayed that there was significant difference of perceived arousal between vocal registers (F(1, 752) = 28.36, p < .001) and modes (F(1, 752) = 59.13, p < .001). However, there was no statistical significance shown in the interaction effect of vocal register and mode on arousal response (F(1, 752) = .01, p > .05) (See Table 4).

Table 4.

Two-Way ANOVA on perceived arousal. (N = 188)

Source	MS	F	p	ηp²	dF
Vocal Register (VR)	8831.02	28.36	.000^***	.037	187
Mode (M)	18412.02	59.13	.000^***	.073	187
VR M*	1.82	.01	.939	.000	187

^*p ≤ .05; ^**p ≤ .01; ^***p ≤ .001

The result implied that the vocal register which is the pitch related feature had a statistically significant effect on arousal response, which was consistent with a previous study showing that pitch range variation influences perceived arousal to instrumental music.^[3]The result also denoted that mode had significant impact on arousal as well. However, this is opposed to a past study’s finding that musical mode had no manipulative influence on arousal when it comes to the instrumental music.^[1]

EMM of arousal in Fig. 3 presented two seemingly parallel lines implying that the interaction effects of two vocal registers and two modes had the same influencing patterns on arousal. That is, in major mode, the arousal was perceived higher in higher vocal register and lower in lower vocal register. Similarly, in minor mode, the arousal response was also higher and lower in the corresponding vocal registers. Further, it was seen that the interaction effect of higher vocal register and major mode had the most significant effect on arousal response, while the lower vocal register and minor mode combination had the least on affecting arousal.

https://cdn.apub.kr/journalsite/sites/ask/2023-042-01/N0660420110/images/ASK_42_01_10_F3.jpg

Fig. 3.

Estimated Marginal Means of arousal.

3.3 Difference of affective responses among the four vocal excerpts

Based on the descriptive statistics (see Table 5), the excerpt with higher register in major mode (HM) had the highest mean score (mean = 54.23, SD = 17.86) and the excerpt with higher register in minor mode (Hm) had the lowest mean score (mean = 32.13, SD = 20.34) with regards to valence response. The excerpt HM also had the highest mean score (mean = 48.40, SD = 18.50) whereas the excerpt with lower register in minor mode (Lm) had the lowest mean score (mean = 31.65, SD = 18.29) in respect of arousal response. The results reflected that the collaboration of higher vocal register and major mode had the most significant interaction effect on listener’s affective responses (valence and arousal).

Table 5.

Descriptive statistics on affective responses to the vocal excerpts. (N = 188)

Vocal excerpt	Valence mean (SD)	Arousal mean (SD)
HM	54.23 (17.86)	48.40 (18.50)
LM	43.48 (22.15)	41.65 (17.28)
Hm	32.13 (20.34)	38.62 (16.45)
Lm	34.35 (21.67)	31.65 (18.29)

Note. HM: higher register of major mode; LM: lower register of major mode; Hm: higher register of minor mode; Lm: lower register of minor mode.

IV. Conclusion

This study examined the differences in affective response between high and low vocal registers and musical modes. The results revealed that there were significant differences in perceived valence and arousal depending on vocal register and mode correspondingly. While the significant interaction effect was found in perceived valence, there was no salience shown in perceived arousal. In addition, although mode affecting valence is a widely known convention in the domain of music and emotion,^[15] the result implies that the difference in the perception of valence may accompany the perception of arousal level as well. Conclusively, the study suggests that the vocal register, as much as the mode, is an important factor to be considered for music selection, as it is found to induce different responses within the same piece of music.

The limitation of the study involves gender factors related to the vocalist’s voice in musical excerpts. The musical excerpts were recorded using male voice in order to cover two octave vocal ranges, while all participants evaluated were female. Thus, firstly, studies on such a topic including a balanced number of female and male participants are needed. Also, it would be worthwhile to investigate whether the findings will be in line with the ones of this study while hiring a female vocalist to sing the vocal experts.

References

G. Husain, W. F. Thompson, and E. G. Schellenberg, "Effects of musical tempo and mode on arousal, mood, and spatial abilities," Music Perception, 20, 151-171 (2002). 10.1525/mp.2002.20.2.151

S. B. Kamenetsky, D. S. Hill, and S. E. Trehub, "Effect of tempo and dynamics on the perception of emotion in music," Psychol. Music, 25, 149-160 (1997). 10.1177/0305735697252005

L. Jaquet, B. Danuser, and P. Gomez, "Music and felt emotions: how systematic pitch level variations affect the experience of pleasantness and arousal," Psychol. Music. 42, 51-70 (2014). 10.1177/0305735612456583

P. Gomez and B. Danuser, "Relationships between musical structure and psychophysiological measures of emotion," Emotion, 7, 377-387 (2007). 10.1037/1528-3542.7.2.37717516815

P. N. Juslin and P. Laukka, "Communication of emotions in vocal expression and music performance: Different channels, same code?," Psychol. Bull. 129, 770-814 (2003). 10.1037/0033-2909.129.5.77012956543

F. Morreale, R. Masu, A. D. Angeli, and P. Fava, "The effect of expertise in evaluating emotions in music." Proc. ICME3, 16, 374-381 (2013).

Y. Liu, G. Liu, D. Wei, Q. Li, G. Yuan, S. Wu, G. Wang, and X. Zhao, "Effects of musical tempo on musicians' and non-musicians' emotional experience when listening to music," Front. Psychol. 9, 02118 (2018). 10.3389/fpsyg.2018.0211830483173PMC6243583

E. Bigand, R. Parncutt, and F. Lerdahl, "Perception of musical tension in short chord sequences: The influence of harmonic function, sensory dissonance, horizontal motion, and musical training," Percept. Psycho. 58, 125-141 (1996). 10.3758/BF03205482

P. G. Hunter, E. G. Schellenberg, and U. Schimmack, "Feelings and perceptions of happiness and sadness induced by music: Similarities, differences, and mixed emotions," PACA. 4, 47-56 (2010). 10.1037/a0016873

P. N. Juslin and E. Lindström, "Musical expression of emotions: Modelling listeners' judgements of composed and performed features," Music Analysis, 29, 334-364 (2010).

J. J. Aucouturier and F. Pachet, "The influence of polyphony on the dynamical modelling of musical timbre," PRL. 28, 654-661 (2007). 10.1016/j.patrec.2006.11.004

S. Droit-Volet, D. Ramos, J. L. O. Bueno, and E. Bigand, "Music, emotion, and time perception: the influence of subjective emotional valence and arousal?," Front. Psychol. 4, 00417 (2013). 10.3389/fpsyg.2013.0041723882233PMC3713348

P. Loui, J. P. Bachorik, H. C. Li, and G. Schlaug, "Effects of voice on emotional arousal," Front. Psychol. 4, 00675 (2013). 10.3389/fpsyg.2013.0067524101908PMC3787249

E. Schubert, "Modeling perceived emotion with continuous musical features," Music Perception, 21, 561- 585 (2004). 10.1525/mp.2004.21.4.561

J. Berg and J. Wingstedt, "Perceived properties of parameterised music for interactive applications," JSCI. 4, 65-71, (2006).

A. W. K. Yeung and N. S. M. Wong, "The historical roots of visual analog scale in psychology as revealed by reference publication year spectroscopy," Front. Hum. Neurosci. 13, 1-5 (2019). 10.3389/fnhum.2019.0008630914939PMC6423150

T. Hakanpää, T. Waaramaa, and A. Laukkanen, "Emotion recognition from singing voices using contemporary commercial music and classical styles," Journal of Voice, 33, 501-509 (2019). 10.1016/j.jvoice.2018.01.01229478708

W. G. Collier and T. L. Hubbard, "Musical scales and brightness evaluations: Effects of pitch, direction, and scale mode," Musicae Scientiae, 8, 151-173 (2004). 10.1177/102986490400800203

G. D. Webster and C. G. Weir, "Emotional responses to music: Interactive effects of mode, texture, and tempo," Motiv. Emot. 29, 19-39 (2005). 10.1007/s11031-005-4414-0

P. Gomez and B. Danuser, "Relationships between musical structure and psychophysiological measures of emotion," Emotion, 7, 377-387 (2007). 10.1037/1528-3542.7.2.37717516815

The Journal of the Acoustical Society of KoreaISSN:1225-4428(Print) 2287-3775(Online)한국음향학회

Preview

Affective responses to singing voice in different vocal registers and modes

ABSTRACT

MAIN

Fig. 1.

Sheet music of the four vocal excerpts.

Table 1.

Parameters of the vocal excerpts.

Table 2.

Survey questionnaire structure.

Table 3.

Two-way analysis of variance (ANOVA) on perceived valence. (N = 188)

Fig. 2.

Estimated Marginal Means of valence.

Table 4.

Two-Way ANOVA on perceived arousal. (N = 188)

Fig. 3.

Estimated Marginal Means of arousal.

Table 5.

Descriptive statistics on affective responses to the vocal excerpts. (N = 188)

References