NCVS Insights – Science that Resonates
My Path Through the Intersection Between Voice and Psychology
January 28, 2026
Volume 4, Issue 1 – January 2026
By Dr. Miriam van Mersbergen
Intersections can be dangerous. The U.S. Department of Transportation (U.S. Department of Transportation, 2025) estimates that nearly half of all traffic accidents happen where roads meet – a cautionary reminder for anyone crossing paths, whether on actual roads, or in lines of research.
Fortunately, unlike traffic patterns, the intersection between two research lines do not result in dire consequences for those involved. In fact, intersections in research lines can become turning points – places to change course, glimpse the edges of other work, and pause to consider where to go next. This article will discuss the intersection between two lines of research: voice science and psychophysiology.
The discipline of psychophysiology is in itself an intersection between anatomy, physiology, and the study of human behavior (J. T. Cacioppo et al., 2016, pp 4-5). It coopts measures of physiology to study how humans act and think. It views the body through physiological measures to see brain processes that are difficult to directly measure. Psychophysiology studies abstract constructs such as stress, interoception, emotion, language, and attention, to name a few. By observing the cardiovascular, respiratory, somatic, and gastrointestinal system changes during psychological conditions, insights into how the body reflects brain processes inform the relationships between mind and body. There are numerous measures to accomplish these tasks such as electrocardiography, electroencephalography, electrodermal responses, electromyography, and respiratory measures of rate, volume, and oxygen saturation. For example, psychophysiologists have found that increases in tonic electrical activity in muscles of facial expression, the corrugator supercilii and zygomaticus major, reflect negative and positive mood, respectively (Land et al., 1993). They have found that heart rate variability is reduced in infants under stress (Weiss et al., 2024). Thus, physiological measures can track psychological constructs that are covert and subconscious. What is suspiciously hidden from the fleet of physiological measures employed by psychophysiologists are measures of the voice.
Much of my research integrating voice and psychology has focused on how psychological processes affect voice functioning. In other words, how brain processes affects the voice. We know that mood states can affect glottal closure as measured by electroglottography contact quotient (van Mersbergen & Delany, 2014) and that increased arousal amplifies this effect (van Mersbergen et al., 2017). Acoustic measures such as relative fundamental frequency are also modulated by emotion states (van Mersbergen & Lanza, 2019). Cognitive processes also influence acoustic measures such as intensity (van Mersbergen & Payne, 2021) and cepstral peak prominence (MacPherson et al., 2017). Social stressors can also affect the muscular activity of the vocal folds (Helou et al., 2013, 2018). This research revealed links between our psychological processes and how we phonate, and provides evidence that mental health and well-being can influence vocal health and well-being. The voice science community can construct emerging evidence to develop deeper understandings of how various psychological concepts influence voicing.
However, the relationship between voice and psychology is bi-directional and can serve a wider scientific audience; specifically the field of psychophysiology. It is possible that measures of vocal functioning can be used to measure the inner workings of the mind because vocal function employs and interacts with many physiological systems (Scherer, 1989). Additionally, measures of voice functioning can serve as composite measures, integrating respiratory and somatic systems (Van Puyvelde et al., 2018) . One such area employing vocal measures to peer into mental processes is lie detection, where voice measures were integrated with other physiological systems to attempt to reflect episodes of deception in the voice (Brenner et al., 1979; Cutbow et al., 1972; Gamer et al., 2006). Another such area is stress and mental load where acoustic data are used to track physical, emotional, and mental stress (Finke et al., 2021; Johannes et al., 2007; Kappen et al., 2022; Schneider et al., 2006).
A challenge in studying voice from a psychophysiological perspective is that it can be confused with research that views the voice from different psychological spaces. Much of the psychological research that includes voice, studies how individuals recognize emotions via vocal output.. A prominent researcher in this field, Klauss Scherer, has led the field in how the voice can signal emotional states in others and what aspects of the voice carry such signals (Banse & Scherer, 1996; Patel et al., 2011; Scherer, 1989, 2003; Sundberg et al., 2011). Other researchers study how vocal perception signals a sense of self as compared to others (Bickford et al., 2013; Broekner et al., 1998; Eckland et al., 2019; Hughes & Harrison, 2013) or how others perceive one’s voice (Belin et al., 2011; Guldner et al., 2020; McGettigan, 2015; Scott & McGettigan, 2016.) Some researchers study more esoteric aspects of the voice, such as whether auditory hallucinations in individuals with schizophrenia hear those voices as their own or as others (Baumeister et al., 2017). Finally, in many lines of research the term voice can be used as an allegory to advocacy or perspective (Gabbay & Stern, 2012).
From the psychophysiological perspective, voice measures have the potential to provide insights into the mind that involve complex physiological interactions. Interest in employing voice measures to study stress, emotions, anxiety disorders, threat response, and cognitive load has increased, particularly with researchers employing machine learning technologies. For a comprehensive review of this research, I would suggest the article by Schewski and colleagues (2025) that describes a number of acoustic parameters (e.g., Fo or L/H ratio) that vary with cognition, emotion, and stress. While recent research has made valuable progress in linking vocal features to psychophysiological states such as acute stress, some methodological choices reflect a limited engagement with principles from voice science. For example, jitter and shimmer have occasionally been applied to connected speech, although these measures are not designed for such material, which may explain the mixed results reported. As this area of research develops, closer collaboration with voice scientists could help refine methodological approaches.
Given the growing interest in using the voice as a window into the mind, voice scientists are uniquely positioned to inform the psychophysiological discipline. At this intersection, the journey in discovering the relationship between the mind and the voice has just begun and many miles lay ahead. But, this journey is open to everyone.
References
U.S. Department of Transportation. (2025, November 5). About Intersection Safety. Federal Highway Administration. https://highways.dot.gov/safety/intersection-safety/about
Banse, R., & Scherer, K. R. (1996). Acoustic Profiles in Vocal Emotion Expression. Journal of Personality and Social Psychology, 70(3), 614–636. https://doi.org/10.1037/0022-3514.70.3.614
Baumeister, D., Sedgwick, O., Howes, O., & Peters, E. (2017). Auditory verbal hallucinations and continuum models of psychosis: A systematic review of the healthy voice-hearer literature. Clinical Psychology Review, 51, 125–141. https://doi.org/10.1016/j.cpr.2016.10.010
Belin, P., Bestelmeyer, P. E. G., Latinus, M., & Watson, R. (2011). Understanding Voice Perception. British Journal of Psychology, 102(4), 711–725. https://doi.org/10.1111/j.2044-8295.2011.02041.x
Bickford, J., Coveney, J., Baker, J., & Hersh, D. (2013). Living with the altered self: A qualitative study of life after total laryngectomy. International Journal of Speech Language Pathology, 15(3), 324–333. https://doi.org/10.3109/17549507.2013.785591
Brenner, M., Branscomb, H. H., & Schwartz, G. E. (1979). Psychological Stress Evaluator-Two Tests of a Vocal Measure. Psychophysiology, 16(4), 351–357.
Broekner, J., Heuer, L., College, B., Siegel, P. A., Wiesenfeld, B., Martin, C., Grover, S., Reed, T., & Bjorgvinsson, S. (1998). The Moderating Effect of Self-Esteem in Reaction to Voice: Converging Evidence From Five Studies. Journal of Personality and Social Psychology, 75(2), 394–407.
Cacioppo, J. T., Tassinary, L. G., & Berntson, G. G. (2016). Handbook of psychophysiology, fourth edition. In J. Cacioppo, L. Tassinary, & G. Berntson (Eds.), Handbook of Psychophysiology, Fourth Edition (4th ed.). https://doi.org/10.1017/9781107415782
Cutbow, R. J., Parks, A., Lucas, N., & Thomas, I. (1972). The Objective Use of Multiple Physiological Indices in the Detection of Deception. Psychophysiology, 9(6), 578–588.
Eckland, N. S., Leyro, T. M., Mendes, W. B., & Thompson, R. J. (2019). The Role of Physiology and Voice in Emotion Perception During Social Stress. Journal of Nonverbal Behavior, 43(4), 493–511. https://doi.org/10.1007/s10919-019-00311-4
Finke, J. B., Zhang, X., Plein, D., Schilling, T. M., Schächinger, H., & Larra, M. F. (2021). Combining mental and physical stress: Synergy or interference? Physiology and Behavior, 233, 1–8. https://doi.org/10.1016/j.physbeh.2021.113365
Gabbay, F. H., & Stern, R. M. (2012). A quiet voice: Roland Clark Davis and the emergence of psychophysiology. Psychophysiology, 49(4), 443–453. https://doi.org/10.1111/j.1469-8986.2011.01339.x
Gamer, M., Rill, H. G., Vossel, G., & Gödert, H. W. (2006). Psychophysiological and vocal measures in the detection of guilty knowledge. International Journal of Psychophysiology, 60(1), 76–87. https://doi.org/10.1016/j.ijpsycho.2005.05.006
Guldner, S., Nees, F., & McGettigan, C. (2020). Vocomotor and Social Brain Networks Work Together to Express Social Traits in Voices. Cerebral Cortex, 30(11), 6004–6020. https://doi.org/10.1093/cercor/bhaa175
Helou, L. B., Rosen, C. A., Wang, W., & Verdolini Abbott, K. (2018). Intrinsic laryngeal muscle response to a public speech preparation stressor. Journal of Speech, Language, and Hearing Research, 61(7), 1525–1543. https://doi.org/10.1044/2018_JSLHR-S-17-0153
Helou, L. B., Wang, W., Ashmore, R. C., Rosen, C. A., & Abbott, K. V. (2013). Intrinsic laryngeal muscle activity in response to autonomic nervous system activation. Laryngoscope, 123(11). https://doi.org/10.1002/lary.24109
Hughes, S. M., & Harrison, M. A. (2013). I like my voice better: Self-enhancement bias in perceptions of voice attractiveness. Perception, 42(9), 941–949. https://doi.org/10.1068/p7526
Johannes, B., Wittels, P., Enne, R., Eisinger G., Castro, C. A., Thomas, J. L., Adler, A. B., & Gerzer, R. (2007). Non-linear function model of voice pitch dependency on physical and mental load. Uropean Journal of Applied Physiology, 101(3), 267–276. https://doi.org/doi.org/10.1007/s00421-007-0496-6
Kappen, M., Hoorelbeke, K., Madhu, N., Demuynck, K., & Vanderhasselt, M.-A. (2022). Speech as an indicator for psychosocial stress: A network analytic approach. Behavior Research Methods, 54, 910–921. https://doi.org/10.3758/s13428-021-01670-x/Published
Land, P. J., Greenwald, M. K., Bradley, M. M., & Hamm, A. O. (1993). Looking at pictures: Affective, facial, visceral, and behavioral reactions. Psychophysiology, 30(3), 261–273. https://doi.org/10.1111/j.1469-8986.1993.tb03352.x
MacPherson, M. K., Abur, D., & Stepp, C. E. (2017). Acoustic Measures of Voice and Physiologic Measures of Autonomic Arousal during Speech as a Function of Cognitive Load. Journal of Voice, 31(4), 504.e1-504.e9. https://doi.org/10.1016/j.jvoice.2016.10.021
McGettigan, C. (2015). The social life of voices: studying the neural bases for the expression and perception of the self and others during spoken communication. Frontiers in Human Neuroscience, 9, 1. https://doi.org/10.3389/fnhum.2015.00129
Patel, S., Scherer, K. R., Björkner, E., & Sundberg, J. (2011). Mapping emotions into acoustic space: The role of voice production. Biological Psychology, 87(1), 93–98. https://doi.org/10.1016/j.biopsycho.2011.02.010
Scherer, K. R. (1989). Vocal correlates of emotional arousal and affective disturbance. In Handbook of social psychophysiology. (pp. 165–197). https://research.ebsco.com/linkprocessor/plink?id=6dca2c74-e598-34d8-9252-4b5029c6153f
Scherer, K. R. (2003). Vocal communication of emotion: A review of research paradigms. Speech Communication, 40(1–2). https://doi.org/10.1016/S0167-6393(02)00084-5
Schewski, L., Doss, M. M., Beldi, G., & Keller, S. (2025). Measuring negative emotions and stress through acoustic correlates in speech: A systematic review. PLOS ONE, 20(7 July). https://doi.org/10.1371/journal.pone.0328833
Schneider, B., Enne, R., Cecon, M., Diendorfer-Radner, G., Wittels, P., Bigenzahn, W., & Johannes, B. (2006). Effects of Vocal Constitution and Autonomic Stress-Related Reactivity on Vocal Endurance in Female Student Teachers. Journal of Voice, 20(2), 242–250. https://doi.org/10.1016/j.jvoice.2005.03.002
Scott, S., & McGettigan, C. (2016). The voice: From identity to interactions. In A. P. Association (Ed.), APA handbook of nonverbal communication (pp. 289–305). https://doi.org/10.1037/14669-011
Sundberg, J., Patel, S., Björkner, E., & Scherer, K. R. (2011). Interdependencies among voice source parameters in emotional speech. IEEE Transactions on Affective Computing, 2(3), 162–174. https://doi.org/10.1109/T-AFFC.2011.14
van Mersbergen, M., & Delany, M. (2014). Vocal responses to emotional picture viewing. Logopedics Phoniatrics Vocology, 39(3), 99–107. https://doi.org/10.3109/14015439.2013.777108
van Mersbergen, M., & Lanza, E. (2019). Modulation of Relative Fundamental Frequency During Transient Emotional States. Journal of Voice, 33(6), 894–899. https://doi.org/10.1016/j.jvoice.2018.07.020
van Mersbergen, M., Lyons, P., & Riegler, D. (2017). Vocal responses in heighted states of arousal. Journal of Voice, 31(1), 127.e13-127.e19. https://doi.org/10.1016/j.jvoice.2015.12.011
Van Mersbergen, M., & Payne, A. E. (2021). Cognitive, Emotional, and Social Influences on Voice Production Elicited by Three Different Stroop Tasks. Folia Phoniatrica et Logopaedica, 73(4), 326–334. https://doi.org/10.1159/000508572
Van Puyvelde, M., Neyt, X., McGlone, F., & Pattyn, N. (2018). Voice Stress Analysis: A New Framework for Voice and Effort in Human Performance. Frontiers in Psychology, 9, 1–25. https://doi.org/doi:10.3389/fpsyg.2018.01994
Weiss, S. J., Keeton, V. F., Leung, C., & Niemann, S. (2024). Infant emotion regulation in the context of stress: Effects of heart rate variability and temperament. Stress and Health, 40(4), 1. https://doi.org/10.1002/smi.3373
Dr. Miriam van Mersbergen
Dr. Miriam van Mersbergen is Associate Professor in the School of Communication Sciences and Disorders at the University of Memphis where she teaches Anatomy & Physiology or the Speech Mechanism, Voice Disorders, and Dysphagia. She began her academic career studying music and communication arts at Calvin College in Grand Rapids, Michigan. After a brief career in music she returned to academia and studied speech language pathology and vocology at The University of Iowa. She worked as a voice therapist in Chicago and Milwaukee before commencing doctoral studies in speech language hearing sciences and psychology at the University of Minnesota. In her current position at the University of Memphis, she is the director of the Voice, Emotion, & Cognition Laboratory where her research investigates emotional and cognitive influences on voice production. She integrates electroencephalographic, psychophysiological, acoustic, and aerodynamic measures to capture mind, body, voice interactions. She maintains an active clinical career specializing in voice, breathing, and cough disorders and continually yearns for more time to pursue her singing career.
HOW TO CITE
Subscribe to NCVS Insights
Contact
975 S. State Street
Clearfield, UT 84015
