What is vocal resonance?
When the vocal folds vibrate, they produce a sound rich in harmonics — a fundamental frequency plus a stack of overtones at 2×, 3×, 4× the fundamental, and so on. The vocal tract above the larynx acts as a filter: it amplifies harmonics that fall near its natural resonance peaks (called formants, from the Latin formare, to shape) and suppresses harmonics that fall between those peaks.
The result is that every vowel you sing is partly a product of which formants your vocal tract happens to be tuned to. The "ee" vowel and the "ah" vowel use the same folds but sound completely different because the tongue, jaw, and lip positions shift the formant frequencies.
This is the core of resonance training: learning to position the vocal tract deliberately so the right harmonics get amplified for the pitch and style you want.
How formant tuning works — and why belting is different from classical singing
The acoustic research is worth knowing, because it explains why classical technique and contemporary commercial music (CCM) approaches sometimes give opposite-sounding advice.
In classical singing, trained male voices develop what researchers call the singer's formant — a prominent energy peak near 2.5–3.5 kHz produced when the epilaryngeal tube (the small airway just above the larynx) narrows while the pharynx stays wide. This clustering of formants lets operatic voices carry over an orchestra without amplification. Johan Sundberg at KTH Royal Institute of Technology coined the term and described the mechanism systematically in 1974, building on earlier spectrographic work by Bartholomew (1934), who first documented that trained male voices showed consistent energy concentration around 3 kHz without yet giving it that name.
The relevant caveat for CCM singers: the singer's formant is not characteristic of musical theatre, country, pop, rock, or jazz singing — genres that rely on amplification and aim for a speech-like rather than orchestrally-projected timbre. Belt and pop voices use fundamentally different resonance strategies, and attempting to impose classical formant clustering on a CCM singer can undermine their stylistic authenticity rather than improve their voice.
What belt does instead is called R1:2f₀ tuning — the first formant (R1) is raised, primarily through jaw opening and a speech-level or slightly elevated larynx, to track the second harmonic (2f₀) as pitch rises. This is why belting vowels tend to open and brighten on higher notes: the vowel modification is acoustically necessary to keep the right harmonic amplified. Miller and Schutte (1990) coined the term "formant tuning" for this deliberate harmonic alignment. Computational modeling by Herbst, Story, and Meyer (published in the Journal of Voice, online 2023) found that for most vowels, belt's characteristic dominant second harmonic can only be sustained over a pitch range of roughly a musical fifth centered on the pitch where f₀ equals half the first formant; for more open vowels like [ɑ] and [ɔ], that range extends closer to an octave. The practical implication is that vowel modification at the extremes of belt range is not optional but acoustically necessary — and the degree of modification depends on which vowel you're singing.
For general CCM work below the belt ceiling, the practical goal is to keep harmonics and formants well-matched as you move through vowels and pitch ranges, so tone stays consistent rather than drifting between bright and dull, nasal and covered.
Does "forward placement" or "mask resonance" mean anything?
Many teachers — and many exercises — cue "feel the sound forward," "sing in the mask," or "place it in your face." These are pedagogical metaphors, not anatomical descriptions: sound does not move forward and park in your sinuses.
What these cues tend to produce is a combination of effects that are real and distinct. The /m/ consonant (as in "mee may mah") semi-occludes the vocal tract at the lips — a mechanism that research on semi-occluded vocal tract (SOVT) exercises indicates lowers phonation threshold pressure and can improve vocal efficiency. Brighter vowels tune higher formant frequencies. And when singers aim for a "ring" or "buzz," they often narrow the aryepiglottic aperture slightly, which concentrates energy in the epilaryngeal region. These three things — SOVT effect, formant tuning, and aryepiglottic configuration — are separate mechanisms that can operate simultaneously; the cue conflates them into a single sensation.
Whether this is better described as "forward" or as "epilaryngeal narrowing plus appropriate formant balance" depends on which method you learned from. The sensation metaphor is useful for singers who find it. The acoustics underneath it are consistent regardless.
Where methods legitimately disagree is on how much brightness to use and when. Estill Voice Training, Complete Vocal Technique (Cathrine Sadolin), and Speech Level Singing all have distinctive takes on how much ring, twang, or forward placement is appropriate for different styles. None of them is factually wrong; they are optimizing for slightly different targets. If one cue language clicks for you and another does not, the mismatch may be vocabulary rather than underlying technique.
Common resonance problems and what to listen for
Hollow or muffled tone. Often associated with a lowered larynx or a tongue root that is retracting toward the throat. The vocal tract elongates, the formants drop, and the sound darkens beyond what the style calls for. A brief "nay-nay" on a conversational pitch can snap the larynx back toward neutral.
Overly nasal or whiny tone on non-nasal vowels. A low soft palate routes some acoustic energy into the nasal cavity, adding an anti-resonance that can muddy the sound. A "ng→ah" transition — lifting the palate while keeping the jaw relaxed — helps isolate the palate lift. Note: some nasal resonance (especially in the twang range) is not a problem in CCM; zero nasality would strip out brightness too.
Inconsistent tone across vowels. If "ee" sounds bright and focused but "ah" sounds pushed or spread, the vowel shapes may not be tracking formants consistently across pitch. The Italian vowel series — "mee, may, mah, moh, moo" — trains exactly this transition in one flowing phrase.
Tone that changes character across registers. A resonance strategy that works in chest voice may not carry into mix or head voice without adjustment. This is partly why five-note-scale work at moderate tempo is so widely used: it covers enough pitch range to force some formant adjustment without the extremes of a full-octave run.
Try it: Five-Note Scale Mee May Mah
Exercise: Five-Note Scale: Mee May Mah
Pattern: Scale degrees 1-2-3-4-5-4-3-2-1, sung on the syllables mee — may — mah — moh — moo — moh — mah — may — mee. The piano doubles the melody and a bell cue leads you in. The exercise moves up (and then back down) by half step across your range.
What it trains: The /m/ onset before each vowel briefly semi-occludes the vocal tract, which lowers phonation threshold pressure and brings tone forward before the vowel opens. Rotating through the five Italian pure vowels (ee, ay, ah, oh, oo) in a single ascending-then-descending stepwise phrase gives the tract practice tuning formants across a spectrum of tongue and lip positions without any register break (the five-note span stays within one register for most voices). The descending return reinforces consistent tone through the end of the phrase, when support tends to drop. Ascending and descending keys train the full usable range in one session.
Range: Soprano F4–D5 top key; Alto C4–A4; Tenor G3–F4; Baritone E3–B3. One half step per key, both directions.
Tip: On the "mah" syllable (scale degree 3), many singers want to spread the jaw wide. Resist that slightly — keep the jaw in a more oval shape so the "ah" stays warm rather than bright or shallow. On "mee" at the top of the phrase, let the vowel narrow very slightly toward "ih" if it helps stay connected. This is the acoustic adjustment mentioned above: the vowel tracks the harmonic structure as pitch rises.
Frequently asked questions
What is vocal resonance?
Vocal resonance is the amplification and shaping of the sound produced by the vocal folds as it passes through the vocal tract — the throat, mouth, soft palate, and nasal passages. Different configurations of these structures emphasize different harmonics, producing different timbres and projection qualities. Resonance is shaped every time a singer adjusts vowel, larynx position, or jaw opening.
How do I improve vocal resonance?
The most direct approach is practicing exercises that require precise vowel shaping across a pitch range — five-note scales on the Italian vowel series, "ng" humming, lip trills, and straw phonation. Recording yourself and comparing the tone across vowels and pitches is more informative than any real-time cue. Consistency across the phrase is the marker of improving resonance rather than any single "bright moment."
What does "singing in the mask" mean?
"Singing in the mask" is a pedagogical metaphor for a focused, forward-placed tone quality — typically associated with some aryepiglottic narrowing (which adds ring to the sound) and appropriate formant tuning. It is useful if the sensation cue helps you, but not all singers respond to it. Teachers from different schools use different metaphors for what is acoustically the same target.
Are resonance exercises different for CCM vs. classical singing?
The underlying acoustic principles are the same. The strategies differ. Classical male voices cultivate the singer's formant (a specific spectral clustering near 2.5–3.5 kHz for projection over an orchestra). CCM styles — pop, rock, R&B, theater belt — use formant-to-harmonic tracking strategies instead, with higher larynx positions and more open jaw configurations that optimize for a speech-like timbre and microphone-amplified performance. Exercises like the Italian vowel scale work for both, because they train tract mobility and consistency — the methods diverge on how much covering, modification, and ring to apply above the passaggio.
How long does it take to improve vocal resonance?
Individual variation is wide. Many singers notice a tonal shift within a single session of focused vowel work — the tract responds to position changes immediately. Consistent change in the voice's default timbre typically takes weeks to months of regular practice, since the goal is not just knowing what to do in a lesson but having it become reflexive across a full range and different contexts. Short daily sessions tend to outperform occasional long ones.
---
A note on vocal health: If you experience hoarseness or vocal roughness that has not resolved within four weeks, current clinical guidance (AAO-HNS 2018 Clinical Practice Guideline on Dysphonia) recommends evaluation by a laryngologist — earlier if you also have pain, difficulty swallowing, or a neck mass. Resonance training should feel efficient, not strained. Pain is not a normal part of this work.
Sources
•
Sundberg, J. (1974). "Articulatory interpretation of the singing formant." Journal of the Acoustical Society of America, 55(4), 838–844. DOI: 10.1121/1.1914609. (PubMed)
•
Miller, D.G. & Schutte, H.K. (1990). "Formant tuning in a professional baritone." Journal of Voice, 4(3), 231–237. (ScienceDirect abstract)
•
Herbst, C.T., Story, B.H., & Meyer, D. (2025). "Acoustical Theory of Vowel Modification Strategies in Belting." Journal of Voice, 39(5), 1192–1204. DOI: 10.1016/j.jvoice.2023.01.004. (PubMed)
•
Stachler, R.J. et al. (2018). "Clinical Practice Guideline: Hoarseness (Dysphonia) (Update)." Otolaryngology–Head and Neck Surgery. DOI: 10.1177/0194599817751030. (PubMed)
•
VoiceScience.org — Formant Tuning: Resonance Strategies in Singing (secondary/educational resource)
•
VoiceScience.org — Singer's Formant: The Spectral Peak That Lets Voices Carry (secondary/educational resource)
•
VoiceScience.org — Aryepiglottic Fold: Anatomy and Role in Vocal Resonance (secondary/educational resource)