Pitch perception beyond the traditional existence region of pitch
“…the perception of musical pitch at high frequencies is not constrained by temporal phase locking in the auditory nerve but may instead stem from higher-level constraints shaped by prior exposure to harmonic sounds.”
Humans’ ability to recognize musical melodies is generally limited to pure-tone frequencies below 4 or 5 kHz. This limit coincides with the highest notes on modern musical instruments and is widely believed to reflect the upper limit of precise stimulus-driven spike timing in the auditory nerve. We tested the upper limits of pitch and melody perception in humans using pure and harmonic complex tones, such as those produced by the human voice and musical instruments, in melody recognition and pitch-matching tasks. We found that robust pitch perception can be elicited by harmonic complex tones with fundamental frequencies below 2 kHz, even when all of the individual harmonics are above 6 kHz–well above the currently accepted existence region of pitch and above the currently accepted limits of neural phase locking. The results suggest that the perception of musical pitch at high frequencies is not constrained by temporal phase locking in the auditory nerve but may instead stem from higher-level constraints shaped by prior exposure to harmonic sounds.
Pitch perception refers to the ability to perceive and differentiate between different frequencies of sound, which are typically associated with the perception of musical notes. Traditionally, pitch perception has been studied within the range of frequencies that are audible to the human ear, roughly 20 Hz to 20,000 Hz. However, recent research has begun to explore pitch perception beyond this traditional range, investigating how the brain processes and interprets frequencies outside of the audible spectrum.
Here are some key points regarding pitch perception beyond the traditional existence region of pitch:
1. Ultrasonic and Infrasonic Frequencies: Ultrasonic frequencies refer to sound waves with frequencies higher than the upper limit of human hearing, while infrasonic frequencies are below the lower limit. Although humans cannot consciously perceive ultrasonic or infrasonic sounds, research suggests that the brain may still process these frequencies to some extent. Some studies have shown that certain brain regions respond to ultrasonic or infrasonic stimuli, indicating that pitch processing mechanisms may extend beyond the audible range.
2. Animal Communication: Many animals, such as bats, dolphins, and whales, communicate using ultrasonic frequencies that are beyond human hearing. Research on animal communication has provided insights into the perception and processing of ultrasonic pitch in non-human species. For example, bats use echolocation to navigate and locate prey, relying on the perception of ultrasonic frequencies to interpret their environment.
3. Technology and Artificial Sounds: Advances in technology have enabled the creation and manipulation of sounds outside of the traditional audible range. For example, ultrasound imaging techniques utilize ultrasonic frequencies to visualize internal structures in the body. Similarly, electronic devices can generate sound waves at frequencies beyond human hearing, raising questions about how the brain processes and interprets these artificial sounds.
4. Neural Mechanisms: Studies using neuroimaging techniques such as functional magnetic resonance imaging (fMRI) and electroencephalography (EEG) have provided insights into the neural mechanisms underlying pitch perception beyond the traditional range. These studies have identified brain regions involved in processing ultrasonic and infrasonic stimuli, suggesting that pitch processing mechanisms may be more flexible and adaptable than previously thought.
Overall, research on pitch perception beyond the traditional existence region of pitch highlights the complexity of auditory processing and the potential for the brain to interpret and respond to sounds outside of the audible spectrum. Further investigation into this phenomenon may provide valuable insights into the limits and capabilities of human auditory perception.
Content
Andrew J. Oxenham1, Christophe Micheyl, Michael V. Keebler, Adam Loper, and Sébastien Santurette
Edited by Bruce Henning, Max Planck Institute for Biological Cybernetics, Tübingen, Germany, and accepted by the Editorial Board March 22, 2011 (received for review October 11, 2010)
Abstract
Humans’ ability to recognize musical melodies is generally limited to pure-tone frequencies below 4 or 5 kHz. This limit coincides with the highest notes on modern musical instruments and is widely believed to reflect the upper limit of precise stimulus-driven spike timing in the auditory nerve. We tested the upper limits of pitch and melody perception in humans using pure and harmonic complex tones, such as those produced by the human voice and musical instruments, in melody recognition and pitch-matching tasks. We found that robust pitch perception can be elicited by harmonic complex tones with fundamental frequencies below 2 kHz, even when all of the individual harmonics are above 6 kHz–well above the currently accepted existence region of pitch and above the currently accepted limits of neural phase locking. The results suggest that the perception of musical pitch at high frequencies is not constrained by temporal phase locking in the auditory nerve but may instead stem from higher-level constraints shaped by prior exposure to harmonic sounds.
Pitch–the auditory percept related to the repetition rate of periodic sounds–plays a crucial role in music and speech (1, 2). Musical notes are defined by different pitches, which are combined to form chords (simultaneously sounded pitches) and melodies (pitch patterns over time). In speech, pitch is used to convey prosody in languages such as English as well as semantic information in tone languages such as Mandarin Chinese. Pitch can be elicited by pure tones, where the repetition rate corresponds to the tone’s frequency, and harmonic complex tones, such as musical instrument sounds and voiced speech, where the repetition rate and, usually, the perceived pitch correspond to the fundamental frequency (F0) of the tone. The frequency component corresponding to the F0 does not need to be present in the sound for the corresponding pitch to be perceived, which is why this pitch is sometimes referred to as virtual pitch. Here, we use the more general expression of periodicity pitch (Fig. 1).
Fig. 1.
In a new window Download PPT
Fig. 1.
Schematic illustration of the relationship between acoustic periodicity and perceived pitch for pure and complex tones. A pure tone (Upper, blue) and a harmonic complex tone (Lower, red) are shown, which both produce the same pitch corresponding to the note A above middle C, as illustrated by the musical notation in Right Inset. Left shows the time waveforms with a repetition rate of 440 Hz. Right shows the tones’ spectral representations, including the fact that the complex tone does not have any energy at the fundamental frequency (F0) of 440 Hz.
The question of how pitch is perceived and coded in the auditory system has been a matter of scientific debate for well over a century (3-5). Currently, the neural coding of pitch is widely believed to rely on the precise timing of action potentials, or spikes, within the auditory nerve (6, 7). At low frequencies, spikes tend to occur at a certain phase within each cycle of a pure tone, leading to time intervals between spikes at multiples of the tone’s period. In the mammals studied so far, this phenomenon of phase locking has been found to be robust for frequencies up to about 1-2 kHz and then to degrade at higher frequencies, becoming difficult to detect experimentally at frequencies between 3 and 6 kHz, depending on the species (8). Phase locking cannot be measured directly in the human auditory nerve because of the invasive nature of the measurements, but the putative upper limit of phase locking has been used to explain a number of pitch phenomena. First, our ability to recognize melodies (9-11) and musical intervals (such as the octave) (12) is severely degraded for pure tones above about 4-5 kHz, as is our ability to detect small changes in frequency (13). Second, absolute pitch–the ability of some individuals to name musical notes without reference to other notes–vanishes or becomes very weak for tones with frequencies above about 4-5 kHz (14). Third, if all of the harmonics in a complex tone are above 5 kHz, then it has been claimed that no periodicity pitch is heard at all, even if the F0 itself is below 5 kHz (15). Finally, the highest note on the grand piano (C8–4,186 Hz) and the highest notes of the highest orchestral instruments (such as piccolo) all fall in the range of 4-5 kHz. The close correspondence between the putative limit of neural phase locking and the upper limit of musical pitch has led to the widespread belief that peripheral coding constraints may have determined the upper limit of musical pitch perception and even musical instrument notes (1, 16, 17).
Contrary to current beliefs, the present experiments reveal that tones consisting of only high-frequency harmonics, all above the putative existence region of pitch, can elicit a clear periodicity pitch corresponding to the F0 and that the pitch is strong enough to support accurate melody discrimination. The results suggest that the commonly accepted limit of phase locking in the mammalian auditory nerve determines neither the upper limit of melodic pitch perception nor the upper pitch limits of musical instruments.
Results
Pitch Matching. In Experiment 1, our participants were asked to adjust the frequency of a pure tone until its perceived pitch matched that of a preceding reference complex tone (Fig. 2A). The complex tones consisted of up to 12 consecutive harmonics, starting with the lowest harmonic rank of either 3 or 6. The majority of matches fell within ۪.25 semitones (or about 1.5%) of the true F0 or its octave equivalent (Fig. 2B and Fig. S1). For harmonic complex tones with components below 5 kHz, the pitch matches were accurate, in line with expectations (Fig. 2B Left). When all harmonics were above 5 kHz, the results were unexpected: all our participants showed an excellent ability to adjust the frequency of a pure tone to match the F0 of the complex tone, even though all of the harmonics were above the currently accepted 5-kHz upper limit of the existence region for periodicity pitch (Fig. 2B Center). Only when all the harmonics were above 10 kHz was some deterioration in performance observed (Fig. 2B Right). The decline in performance at very high frequencies may be due to the reduction in the number of audible harmonics; because we only screened participants for their ability to hear the stimuli up to frequencies of 16 kHz, in our highest-frequency condition, only three harmonics (12, 14, and 16 kHz) may have been audible. However, even in this condition, a clear peak in the pitch-match histogram was observed at the pure-tone frequency corresponding to the F0 of the harmonic complex or its octave equivalent.
Fig. 2.
In a new window Download PPT
Fig. 2.
Stimuli and results for Experiment 1 (pitch matches). (A) Schematic diagram of the stimuli used in the pitch-matching task for three sample conditions. The first stimulus in each trial was a complex tone that was fixed; the second tone was a pure tone, the frequency of which was adjusted up or down by the participants until its pitch-matched that of the complex tone. All tones were presented in a background of noise to mask possible distortion products (not included in the diagram). (B) Sample histograms from the same three sample conditions pooled across the six participants, with bin widths of 0.5 semitones. Green bars indicate matches that fell within ۪.25 semitones of the complex’s F0 (or its octave equivalents) and were considered correct. Yellow bars show matches that fell outside the range considered correct when estimating performance. (C) Average percent of pitch matches that were within ۪.25 semitones of the F0 of the fixed complex tone or an octave above or below as a function of the lowest frequency present in the complex tone. Chance performance, based on a uniform distribution, corresponds to 4.17%. The lowest harmonic rank present in the complex was either three (filled circles) or six (open circles). For one-half of the conditions (10 rightmost points), the lowest frequency present in the complex tone was above 5 kHz, the previously assumed upper limit of the existence region of musical pitch (dashed green line marked by the green arrow). Error bars represent 95% confidence intervals based on the posterior distribution of the percent of correct matches across subjects.
Overall, the proportion of accurate pitch matches remained very high for many conditions in which no harmonics below 5 kHz were present (Fig. 2C); even in the highest-frequency condition tested, the average performance of 46% was significantly above the chance performance of 4% (binomial test; P < 0.0001). As shown by the confidence intervals, performance was generally more variable between participants at the higher F0s. However, no evidence for an upper limit to the existence region of pitch was found at or close to 5 kHz, which contrasts with both current beliefs and predictions based on the putative limit of phase locking in the auditory nerve.
Melody Discrimination. A common operational definition of pitch is that it can convey melodic information (18). Consequently, in Experiment 2a we tested whether our findings of pitch perception at high frequencies generalized to a melody discrimination task. Participants were presented with two four-note melodies, both taken from the seven-note diatonic (major) scale. The second melody had either the same notes as the first or differed in its second or third note by one scale step (Fig. 3A). A change of only one scale step ensured that an upward-going interval was never replaced by a downward-going interval (or vice versa), and therefore, the melodic contour was never reversed. In the pure-tone low condition, performance was good–the average percent-correct score was about 85% (Fig. 3B Left, light blue bar). This outcome was expected: all of the tones were well below 5 kHz and therefore, should have conveyed clear melodic information. In the pure-tone high condition, performance was much poorer, with an average percent-correct score of about 59%–close to the chance level of 50% (Fig. 3B Left, dark blue bar). Again, this result was expected based on earlier studies showing that pure-tone melody discrimination deteriorates at frequencies above about 4-5 kHz (9). The condition of most interest is the complex-tone high condition in which the second melody consisted of complex tones with F0s between 1 and 2 kHz but filtered so as to contain only harmonics above 6 kHz. Here, performance was uniformly good, with an average percent correct score of 94% (Fig. 3B Left, red bar). As indicated by 95% confidence intervals (CIs) around percent-correct scores, performance in this third condition (CI = 88-98%) was significantly better than in the second condition (CI = 48-68%) and not significantly different from performance in the first condition (CI = 78-91%). The results from this final condition show that melodic pitch information can be extracted from complex tones that contain only frequencies above 6 kHz.
Fig. 3.
In a new window Download PPT
Fig. 3.
Stimuli and results for experiment 2 (melody discrimination). (A) Representations of a sample melody in the conditions tested as spectrograms and in musical notation. In each condition, the first four-note melody was always comprised of pure tones. The second melody was comprised of either pure tones or filtered complex tones and was the same as the first or its second or third note was raised or lowered by one step in the diatonic scale. In this example, the second note is lowered by one step. Melody 1 in the pure-tone low condition was drawn from frequencies between 500 Hz and 1 kHz; melody 1 in the pure-tone high condition was drawn from frequencies between 1.5 and 3 kHz. In both conditions, melody 2 was transposed up by two octaves. In the complex-tone high condition, melody 1 consisted of pure tones drawn from 1 to 2 kHz, and melody 2 consisted of complex tones filtered between 7.5 and 16 kHz, with F0s between 1 and 2 kHz. The complex-tone shifted condition was the same, except that the components within each complex tone were shifted up or down by the same random amount to disrupt their harmonic relations while maintaining the same frequency spacing between components. The shift was selected for each complex tone independently from between 10% and 90% of the F0 with uniform distribution. (B) Average percentage of correct responses in the different conditions in Experiment 2 a and b. Performance in the complex-tone high condition (red bars) was good in both experiments, despite the absence of components below 6 kHz, whereas single pure tones above 6 kHz in the pure-tone high condition (dark blue bar) did not convey reliable melodic information. The poor performance in the complex-tone shifted condition (pink bar) from Experiment 2b shows that listeners were not able to reliably perceive melodic pitch using periodic temporal envelope or beating cues. Error bars represent the 95% confidence interval around the mean.
Envelope Repetition Pitch. Is the pitch from our high-frequency complex tones elicited by the individual harmonics or by interactions between adjacent harmonics? Interactions between harmonics can lead to fluctuations in the temporal envelope at a rate corresponding to the F0–the so-called envelope repetition pitch (18-21). If our results reflected sensitivity to the temporal envelope, then they would not speak to the limit of phase locking to individual upper harmonics, because the frequency of interest would be the envelope repetition rate (1-2 kHz) rather than the frequencies of the individual components (>6 kHz). Experiment 2b tested the influence of envelope-repetition pitch by replicating conditions 1 and 3 from Experiment 2a and adding a new variant of condition 3, in which the filtered complex tone contained components that were spaced regularly apart by the F0 but were all randomly shifted in frequency by the same amount in Hertz. A useful property of these frequency-shifted complex tones is that their temporal envelope is identical to that of their harmonic counterparts, but their frequencies are no longer harmonically related (22). Thus, if the perceived pitch is based on the envelope repetition rate, then performance should be the same for the harmonic and shifted conditions. However, if the perceived pitch is based on frequency information from the individual harmonics, then the ambiguous pitches produced by the inharmonic frequency-shifted complexes should lead to poorer performance. The results from Experiment 2b indicated good performance for both of the original conditions (Fig. 3B Right, red and blue bars) but poor performance with the frequency-shifted complex tones, close to the chance level of 50% (Fig. 3B Right, pink bar). Confidence intervals for the three conditions were 82-93%, 81-94%, and 45-63%, respectively. Thus, the good performance found for harmonic complex tones with all components above 6 kHz cannot be accounted for by envelope repetition rate at the F0 or by the regular spacing of the component frequencies (23) (SI Results, Experiment 2c provides another test of envelope repetition pitch).
Discussion
The results from both the pitch-matching and melody discrimination experiments show that accurate pitch information can be conveyed by harmonics that are all above 6 kHz. The results are in direct contradiction to the widely accepted 5-kHz limit to the existence region of pitch established in early but highly influential studies (15, 24). What might explain this apparent discrepancy? There are many differences between the techniques of the earlier studies and the current one. We believe four of these may be most critical. First, the early study that tested both high F0s and high spectral regions (15) used only three consecutive harmonics, whereas our study used many (at least five, depending on the experiment and the F0). Complex tones that contain few harmonics generally produce a weaker pitch than complex tones with many harmonics (25). Therefore, the earlier study may have underestimated the ability of high-frequency harmonics to convey pitch. Second, our experiments used objective measures of performance (i.e., percent correct in a pitch-matching task and percent correct in a melody discrimination task), whereas the earlier studies were based on the more subjective measure of whether subjects believed that they could hear a periodicity pitch. Performance measures typically provide less variable and more reliable results than subjective measures, because they can be made relatively free of the influence of individual bias and criterion effects (26). Third, our tones were embedded in a noise background, whereas the tones in earlier studies were presented in quiet. We used the noise primarily to rule out the possibility that participants could detect distortion products at lower frequencies (27). However, embedding tones at levels just above masked threshold may also enhance the percept of the missing fundamental, perhaps by making the presence of the missing components more plausible, thereby encouraging perceptual filling in of the missing components and hence, the F0 (28-30). Finally, no data were provided in the early studies regarding the audibility of very high-frequency components. In our study, more than one-third of the potential participants, all of whom had clinically normal hearing, were excluded because of their high absolute thresholds at 16 kHz. It is, therefore, possible that limited audibility may have affected the early studies of the upper limit of pitch.
In physiological (31) and psychological (32) experiments on pitch it is important to ensure that distortion products are not contributing to the observed effects. To test that our background noise level was sufficient to mask any distortion products produced by the ear, we measured the distortion products elicited by a subset of the experimental stimuli. We found that the effective levels of distortion in the ear were very low [less than 20 dB sound pressure level (SPL) in all cases] and therefore, were at least 20 dB below the masked thresholds produced by our background noise (SI Results, Experiment 3).
Our results challenge the link between the upper limit of periodicity pitch and the putative limit of phase locking in the human auditory nerve. In particular, we observed a dissociation between pure tones above 6 kHz, which did not convey reliable melodic information, and those same tones when combined within a harmonic complex with an F0 (and a pitch) well below 6 kHz, which did convey reliable melodic information. The apparent dissociation may be explained in part by the possibility that the information from the multiple tones within the harmonic complex is integrated to form a more accurate representation of pitch than can be formed from a single pure tone. Previous data obtained at lower frequencies do not support this possibility: pitch discrimination thresholds for complex tones are not typically better than thresholds for pure tones (33). However, our tones were higher in frequency than those that have been used in previous studies, and it is possible that benefits of combining information across multiple frequency components are observed only at very high frequencies, where peripheral sensory limitations (e.g., poor phase locking of auditory-nerve fibers) may predominate over more central ones, ranging from neural noise in brainstem structures to higher-level constraints such as echoic memory (34). To test this possibility, we measured frequency and F0 difference limens [just noticeable differences (JNDs)] for pure and complex tones from a subset of the tones used in Experiment 2 (SI Results, Experiment 4). In contrast to earlier studies, we found discrimination thresholds that were better for the high-frequency complex tones than for the high-frequency pure tones, suggesting spectral integration of information. However, even the high-frequency pure tones produced mean JNDs of less than 1.5%–well below the 6% difference in frequency between neighboring notes in the chromatic scale. Data in the literature indicate that pitch discrimination thresholds as large as 3-5% can be sufficient for pitch interval and melody discrimination or identification (18, 20, 35, 36). Thus, performance in our melody discrimination experiments seems to have been limited not by peripheral factors (such as degraded spike-timing information in the auditory nerve at high frequencies) but by central factors occurring at or after the stage at which frequency information from different peripheral channels is combined in the CNS to estimate the pitch of complex tones. Candidate sites for the necessary neural integration range from the cochlear nucleus, where inputs from primary auditory neurons are first integrated over wide frequency regions (37), to the auditory cortex, where neural correlates of F0 and pitch and pitch strength have been identified in both human imaging studies (38-40) and neurophysiological studies in nonhuman primates (31).
If not the limits of auditory-nerve phase locking, what explains the fact that melody perception is limited to pitches below 4-5 kHz? One possibility is that melodic pitch perception is learned through repeated exposure to vocal and instrumental sounds during development and that the lack of exposure to notes higher than those found on current musical instruments results in a reduced ability to process high-F0 pitch information. Similar arguments have been explored in attempts to explain many musically related phenomena, including complex pitch perception in general and the perception of consonance or dissonance (41-43). For pitch perception, it has been argued that harmonic templates may emerge through exposure to sound (44) and that templates do not form for sounds that are not encountered in the everyday acoustic environment (45); a similar argument may apply to melodic pitch above 4-5 kHz. Indeed, recent work using EEG has suggested strong short- and long-term plasticity in pitch coding at levels as low as the auditory brainstem structures in humans (46, 47). In this way, the brainstem and/or cortical representations of pitch and pitch relations may only emerge during normal development with exposure to the appropriate sounds.
If future physiological studies were to confirm that precise spike timing in the human auditory nerve is severely degraded or absent at high frequencies, as is currently believed, then our psychophysical findings would lead to the conclusion that accurate phase locking in the auditory nerve is not necessary for complex pitch perception. This conclusion would support so-called place models of pitch perception, which rely on spatial representations of sound frequencies (i.e., tonotopy) in the cochlea (48), over time models, which rely on precise spike timing in the auditory nerve (7, 49). An earlier study suggested that spike timing information from individual harmonics was not sufficient for periodicity pitch perception (50); the current study suggests that it may not be necessary. An alternative interpretation is that phase-locked timing information in the auditory nerve can be extracted at much higher frequencies than is currently believed. This could be the case if the human auditory nerve has different response properties than those found in other mammals studied so far, or if the residual amount of phase locking at high frequencies in the human auditory nerve is similar to that in other mammals but is still sufficient to support frequency coding even at high frequencies (51-53). Distinguishing between these possibilities will require further physiological investigation of auditory-nerve properties, including the development of more direct ways to study temporal responses in the human auditory nerve.
Methods
Experiment 1: Pitch Matching. The harmonic complex tones were generated by adding together harmonics with equal amplitudes (55 dB SPL per component). The starting phase of each harmonic was drawn randomly from a uniform distribution on each presentation. The pure tones were set to 65 dB SPL to better approximate the loudness of the complex tones. The tones were embedded in a broadband (20 Hz to 22 kHz) threshold-equalizing noise (TEN) with a level of 45 dB SPL within the estimated equivalent rectangular bandwidth (ERBN) of the auditory filter at 1 kHz (54) to mask any possible distortion products generated by the ear (27). The tones all had a total duration of 500 ms, including 20-ms raised-cosine onset and offset ramps.
Each trial began with a 500-ms complex tone followed by a 500-ms gap and then, a 500-ms pure tone. The background noise began 300 ms before the standard tone, continued through the trial, and ended 300 ms after the end of the comparison tone. Participants were asked to adjust the frequency of the pure tone until its pitch matched that of the complex tone. The starting frequency of the pure tone was randomly selected on each block from a uniform distribution on a discrete semitone scale 䔶 semitones around the F0 of the complex tone. After each trial, participants could adjust the frequency of the pure tone up or down by 4, 1, or 0.25 semitones, could elect to hear the trial at the same frequencies again, or could indicate that they were satisfied with the pitch match by using virtual buttons on a graphical user interface. Participants were encouraged to bracket the pitch of the reference complex tone by adjusting the pitch of the pure tone below and above that of the reference before making a final decision. The complex tones had F0s of 400, 765, 840, 960, 1,050, 1,200, 1,400, 1,600, 1,800, and 2,000 Hz and were comprised of up to 12 consecutive harmonics, beginning on the third or the sixth. Harmonics above 20 kHz were not generated.
Each of the six participants completed 15 matches for each combination of F0 and lowest-harmonic rank (third and sixth), resulting in a total number of matches per participant of 300 (15 matches for each of 10 F0s and two lowest-harmonic ranks). The order of presentation of the matches was randomized within each block of the 20 conditions and was different for each participant and each of the 15 repetitions. No feedback was provided to participants regarding their performance on the task.
Experiment 2: Melody Discrimination with Harmonic and Frequency-Shifted Tones. The melody discrimination tasks involved two groups of six participants. On each trial, a four-note melody was generated from a set of eight consecutive notes from the diatonic (major) scale with equal temperament–do, re, mi, fa, sol, la, ti, and do in solfège notation with a moving do. The notes were selected randomly with uniform distribution and replacement, with the sole constraint that no three consecutive notes could be the same. The melody was presented two times. The notes in the second presentation were either unchanged (same trial) or either the second or third note of the melody was raised or lowered by one scale step (different trial). Because the change was only one scale step (either one or two semitones), it never resulted in a reversal of the melodic contour.
Participants in Experiment 2a were presented with three conditions (Fig. 3A and examples in Audios S1, S2, S3, S4, S5, and S6). Participants in Experiment 2b were presented with two of the original conditions plus a condition involving inharmonic complexes (complex-tone shifted condition). The stimuli were organized into blocks of 60 trials, consisting of 20 presentations of each condition (10 same and 10 different trials), presented in a new random order for each block. After some exposure to the low-frequency pure-tone conditions to ensure that participants understood the task and were capable of performing it, each participant completed a total of 12 blocks. The first two blocks were discarded as practice, and therefore, the reported data comprise a total of 10 blocks or 200 trials per condition per participant. Participants were informed that there would be equal numbers of same and different trials to reduce potential bias, but no trial by trial feedback on performance was provided.
The background noise was the sum of a broadband 45 dB SPL per ERBN TEN, as in Experiment 1, and an additional 55 dB SPL per ERBN TEN, bandpass-filtered between 20 Hz and 5.5 kHz to further exclude any possibility of audible distortion products (27). The level of the pure tones below 3 kHz was 65 dB SPL, and the level of the pure and complex tones above 6 kHz was 55 dB SPL per component before filtering. The complex tones were bandpass-filtered with corner frequencies of 7.5 and 16 kHz and spectral slopes of 30 dB/octave. This filtering ensured that tones below 6 kHz would fall below the masked threshold of the background noise and would therefore be inaudible. The starting phases of the harmonics were again selected at random from a uniform distribution. The F0s of the harmonic tones in the reference melody were between 1,000 and 2,000 Hz, and so the lowest audible harmonic ranged from the third to the sixth, depending on the F0. The inharmonic complexes were generated by shifting all of the components of the original complex up in frequency by the same amount between 10% and 90% of the F0. The size of the shift was selected randomly and uniformly from that distribution for each note independently. In this way, the spacing between each component (and hence, the envelope repetition rate) remained the same as the F0, but the tones were no longer harmonic (22).
The notes each had a total duration of 300 ms, including 10-ms raised-cosine onset and offset ramps. Within a melody, notes were separated by 200-ms gaps, producing an overall presentation rate of 2 tones/s (120 beats/min). Each melody lasted 1.8 s, and the two melodies within a trial were separated by a 700-ms gap. The background noise was gated on 200 ms before the first note of the first melody and was gated off 400 ms after the end of the final note of the second melody using 10-ms raised-cosine ramps.
Methods are described in more detail in SI Methods.
Acknowledgments
The authors are grateful to Laurent Demany, Xavier Perrot, Josh McDermott, Bob Carlyon, Chris Plack, and Brian C. J. Moore for helpful comments and discussions. This work was supported by Grant R01 DC 005216 from US National Institutes of Health (to A.J.O.).
Footnotes
1To whom correspondence should be addressed. E-mail: oxenham@umn.edu.
Author contributions: A.J.O. designed research; M.V.K., A.L., and S.S. performed research; A.J.O. and C.M. analyzed data; and A.J.O. and C.M. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission. B.H. is a guest editor invited by the Editorial Board.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pna s.1015291108/-/DCSupplemental.
Previous Section
References
ਪµ Plack CJ, Oxenham AJ, Popper AN, Fay R, eds (2005) Pitch: Neural Coding and Perception (Springer, New York). Search Google Scholar
ਪµ McDermott JH, Oxenham AJ (2008) Music perception, pitch, and the auditory system. Curr Opin Neurobiol 18:452-463. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Seebeck A (1841) Beobachtungen über einige Bedingungen der Entstehung von Tönen. Ann Phys Chem 53:417-436. Search Google Scholar
ਪµ Ohm GS (1843) ¯eber die Definition des Tones, nebst daran geknüpfter Theorie der Sirene und ähnlicher tonbildender Vorrichtungen. Ann Phys Chem 59:513-565. Search Google Scholar
ਪµ Helmholtz HLF (1885) On the Sensations of Tone (Dover, New York). Search Google Scholar
ਪµ Cariani PA, Delgutte B (1996) Neural correlates of the pitch of complex tones. I. Pitch and pitch salience. J Neurophysiol 76:1698-1716. Abstract/FREE Full Text
ਪµ Meddis R, O’Mard L (1997) A unitary model of pitch perception. J Acoust Soc Am 102:1811-1820. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Palmer AR, Russell IJ (1986) Phase-locking in the cochlear nerve of the guinea-pig and its relation to the receptor potential of inner hair-cells. Hear Res 24:1-15. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Attneave F, Olson RK (1971) Pitch as a medium: A new approach to psychophysical scaling. Am J Psychol 84:147-166. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Burns EM, Feth LL (1983) in Hearing–Physiological Bases and Psychophysics, Pitch of sinusoids and complex tones above 10 kHz, eds Klinke R, Hartmann R (Springer, Berlin), pp 327-333. Search Google Scholar
ਪµ Semal C, Demany L (1990) The upper limit of “musical” pitch. Music Percept 8:165-176. Web of Science Search Google Scholar
ਪµ Ward WD (1954) Subjective musical pitch. J Acoust Soc Am 26:369-380. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Moore BCJ (1973) Frequency difference limens for short-duration tones. J Acoust Soc Am 54:610-619. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Ohgushi K, Hatoh T (1991) in Proceedings of the Ninth International Symposium on Hearing: Auditory Physiology and Perception, Perception of the musical pitch of high frequency tones, eds Cazals Y, Demany L, Horner K (Pergamon, Oxford). Search Google Scholar
ਪµ Ritsma RJ (1962) Existence region of the tonal residue. I. J Acoust Soc Am 34:1224-1229. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Moore BCJ (2003) An Introduction to the Psychology of Hearing (Academic, London), 5th Ed. Search Google Scholar
ਪµ Plack CJ (2005) The Sense of Hearing (Lawrence Erlbaum Associates, London). Search Google Scholar
ਪµ Burns EM, Viemeister NF (1976) Nonspectral pitch. J Acoust Soc Am 60:863-869. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Carlylon RP, Deeks JM (2002) Limitations on rate discrimination. J Acoust Soc Am 112:1009-1025. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Kaernbach C, Bering C (2001) Exploring the temporal mechanism involved in the pitch of unresolved harmonics. J Acoust Soc Am 110:1039-1048. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Pollack I (1969) Periodicity pitch for white noise–fact or artifact? J Acoust Soc Am 45:237-238. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ de Boer E (1956) Pitch of inharmonic signals. Nature 178:535-536. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Roberts B, Brunstrom JM (1998) Perceptual segregation and pitch shifts of mistuned components in harmonic complexes and in regular inharmonic complexes. J Acoust Soc Am 104:2326-2338. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Ritsma RJ (1963) Existence region of the tonal residue II. J Acoust Soc Am 35:1241-1245. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Houtsma AJM, Smurzynski J (1990) Pitch identification and discrimination for complex tones with many harmonics. J Acoust Soc Am 87:304-310. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Macmillan NA, Creelman CD (2005) Detection Theory. A User’s Guide (Lawrence Erlbaum Associates, London). Search Google Scholar
ਪµ Smoorenburg GF (1972) Audibility region of combination tones. J Acoust Soc Am 52:603-614. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Houtgast T (1976) Subharmonic pitches of a pure tone at low S/N ratio. J Acoust Soc Am 60:405-409. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Hall JW, Peters RW (1981) Pitch from nonsimultaneous successive harmonics in quiet and noise. J Acoust Soc Am 69:509-513. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ McDermott JH, Oxenham AJ (2008) Spectral completion of partially masked sounds. Proc Natl Acad Sci USA 105:5939-5944. Abstract/FREE Full Text
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Bendor D, Wang X (2005) The neuronal representation of pitch in primate auditory cortex. Nature 436:1161-1165. CrossRefMedline Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Oxenham AJ, Micheyl C, Keebler MV (2009) Can temporal fine structure represent the fundamental frequency of unresolved harmonics? J Acoust Soc Am 125:2189-2199. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Moore BCJ, Glasberg BR, Shailer MJ (1984) Frequency and intensity difference limens for harmonics within complex tones. J Acoust Soc Am 75:550-561. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Kaernbach C, Schlemmer K (2008) The decay of pitch memory during rehearsal. J Acoust Soc Am 123:1846-1849. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Formby C (1985) Differential sensitivity to tonal frequency and to the rate of amplitude modulation of broadband noise by normally hearing listners. J Acoust Soc Am 78:70-77. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Burns EM, Viemeister NF (1981) Played again SAM: Further observations on the pitch of amplitude-modulated noise. J Acoust Soc Am 70:1655-1660. CrossRefWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Sayles M, Winter IM (2008) Reverberation challenges the temporal representation of the pitch of complex sounds. Neuron 58:789-801. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Griffiths TD, Uppenkamp S, Johnsrude I, Josephs O, Patterson RD (2001) Encoding of the temporal regularity of sound in the human brainstem. Nat Neurosci 4:633-637. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Penagos H, Melcher JR, Oxenham AJ (2004) A neural representation of pitch salience in non-primary human auditory cortex revealed with fMRI. J Neurosci 24:6810-6815. Abstract/FREE Full Text
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Hyde KL, Peretz I, Zatorre RJ (2008) Evidence for the role of the right auditory cortex in fine pitch resolution. Neuropsychologia 46:632-639. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Terhardt E (1974) Pitch, consonance, and harmony. J Acoust Soc Am 55:1061-1069. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Schwartz DA, Purves D (2004) Pitch is determined by naturally occurring periodic sounds. Hear Res 194:31-46. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ McDermott JH, Lehr AJ, Oxenham AJ (2010) Individual differences reveal the basis of consonance. Curr Biol 20:1035-1041. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Shamma S, Klein D (2000) The case of the missing pitch templates: How harmonic templates emerge in the early auditory system. J Acoust Soc Am 107:2631-2644. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Bernstein JG, Oxenham AJ (2003) Pitch discrimination of diotic and dichotic tone complexes: Harmonic resolvability or harmonic number? J Acoust Soc Am 113:3323-3334. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Wong PC, Skoe E, Russo NM, Dees T, Kraus N (2007) Musical experience shapes human brainstem encoding of linguistic pitch patterns. Nat Neurosci 10:420-422. MedlineWeb of Science Search Google Scholar
ਪµ Song JH, Skoe E, Wong PC, Kraus N (2008) Plasticity in the adult human auditory brainstem following short-term linguistic training. J Cogn Neurosci 20:1892-1902. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Wightman FL (1973) The pattern-transformation model of pitch. J Acoust Soc Am 54:407-416. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ de Cheveigné A (2005) in Pitch: Neural Coding and Perception, Pitch perception models, eds Plack CJ, Oxenham AJ, Popper AN, Fay R (Springer, New York). Search Google Scholar
ਪµ Oxenham AJ, Bernstein JG, Penagos H (2004) Correct tonotopic representation is necessary for complex pitch perception. Proc Natl Acad Sci USA 101:1421-1425. Abstract/FREE Full Text
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Heinz MG, Colburn HS, Carney LH (2001) Evaluating auditory performance limits: I. One-parameter discrimination using a computational model for the auditory nerve. Neural Comput 13:2273-2316. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Moore BCJ, S¨TM)k A (2009) Sensitivity of the human auditory system to temporal fine structure at high frequencies. J Acoust Soc Am 125:3186-3193. CrossRefMedlineWeb of Science Search Google Scholar
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Recio-Spinoso A, Temchin AN, van Dijk P, Fan YH, Ruggero MA (2005) Wiener-kernel analysis of responses to noise of chinchilla auditory-nerve fibers. J Neurophysiol 93:3615-3634. Abstract/FREE Full Text
SocialCiteAppropriate Citation?Strong Evidence?Add More
ਪµ Moore BCJ, Huss M, Vickers DA, Glasberg BR, Alcantara JI (2000) A test for the diagnosis of dead regions in the cochlea. Br J Audiol 34:205-224. MedlineWeb of Science Search Google Scholar
http://www.pnas.org/content/108/18/7629.full
===
Pitch perception traditionally refers to the perception of auditory sensations related to the fundamental frequency of a sound, typically within the range of 20 Hz to 20,000 Hz. However, research has shown that pitch perception can extend beyond this traditional frequency range under certain conditions. Here are some key points regarding pitch perception beyond the traditional existence region of pitch:
1. **Ultrasonic Pitch Perception**: While humans are generally not able to hear sounds above 20 kHz (the upper limit of human hearing), some studies have suggested that under specific circumstances, individuals may perceive ultrasonic frequencies as pitch. For example, experiments with animal vocalizations or ultrasonic musical instruments have shown that humans can perceive and discriminate ultrasonic pitches.
2. **Contextual Effects**: The perception of pitch can be influenced by contextual factors such as harmonic structure, temporal patterns, and spectral content of the sound. Even if the fundamental frequency of a sound falls outside the traditional range of pitch perception, listeners may still perceive a pitch-like sensation based on the harmonic relationships or spectral envelope of the sound.
3. **Bimodal and Multimodal Perception**: Pitch perception can be influenced by interactions between different sensory modalities, such as auditory and tactile or auditory and visual stimuli. For example, studies have shown that individuals can perceive pitch-like sensations when presented with bimodal or multimodal stimuli that combine auditory and tactile or auditory and visual information.
4. **Individual Differences**: The ability to perceive pitch beyond the traditional existence region may vary among individuals and may be influenced by factors such as age, hearing sensitivity, musical experience, and cognitive processing abilities. Some individuals may have a heightened sensitivity to ultrasonic frequencies or may be more adept at integrating information across different sensory modalities.
5. **Clinical and Practical Applications**: Understanding the mechanisms of pitch perception beyond the traditional range has practical implications for various fields, including music technology, psychoacoustics, and auditory neuroscience. For example, ultrasonic pitch perception has applications in ultrasound imaging technology and communication systems.
Overall, while pitch perception is traditionally associated with the auditory domain and the range of human hearing, research suggests that it can extend beyond the traditional existence region of pitch under certain conditions. Further investigation is needed to elucidate the mechanisms underlying pitch perception in these extended frequency ranges and its implications for perception, cognition, and technology.
===
Perfect pitch, also known as absolute pitch, is the ability to identify or produce a musical note without the need for a reference tone. Individuals with perfect pitch can recognize and name a note simply by hearing it, without needing any contextual cues.
The effects of perfect pitch can vary depending on the individual and their musical background. Some potential effects of perfect pitch include:
1. **Enhanced Musical Abilities:** Individuals with perfect pitch often have a heightened sensitivity to pitch and musical nuances. This can lead to advanced musical abilities, such as the ability to play by ear, easily transpose music to different keys, and quickly learn new pieces of music.
2. **Improved Music Theory Skills:** Perfect pitch can facilitate a deeper understanding of music theory concepts, such as chord progressions, harmonies, and key relationships. This enhanced understanding can be beneficial for composers, arrangers, and music educators.
3. **Increased Musical Memory:** Perfect pitch can contribute to a stronger musical memory, allowing individuals to recall melodies, harmonies, and musical structures with greater accuracy and detail.
4. **Challenges in Music Performance:** While perfect pitch can be advantageous in many musical contexts, it can also present challenges, particularly in ensemble settings where instruments may not be perfectly tuned or in non-standard tunings. Individuals with perfect pitch may be more sensitive to discrepancies in tuning and intonation, which can affect their performance experience.
5. **Potential for Musical Career:** Perfect pitch can be a valuable asset for individuals pursuing careers in music, such as performers, composers, arrangers, and music producers. It can also be beneficial in fields such as music therapy, music education, and audio engineering.
It’s important to note that perfect pitch is relatively rare, occurring in only a small percentage of the population. While it can offer advantages in certain musical contexts, it is not a prerequisite for musical success, and individuals without perfect pitch can still achieve high levels of musical proficiency through training, practice, and dedication.
===
The opposite of perfect pitch is often referred to as “relative pitch.” Relative pitch is the ability to identify or reproduce musical notes by comparing them to a reference note or pitch. Unlike perfect pitch, which involves recognizing notes without a reference, relative pitch relies on the perception of intervals between notes and their relationship to one another.
Individuals with relative pitch may not be able to identify or produce specific notes without a reference point, but they can discern the distance between notes and recognize patterns in music. Relative pitch skills can be developed through ear training exercises and exposure to music, and many musicians rely on relative pitch to learn and perform music, even if they do not have perfect pitch.