Spectrographic Comparisons

Posted 10/01/1993
In Forensic Science

Enhancement of tape recorded voices to facilitate transcription & aural identification

Bruce E. Koenig
Federal Bureau of Investigation

Spectrographic Comparisons

The spectrographic laboratory technique is the most well-known and possibly the most accurate of the laboratory testing procedures presently available for comparing verbatim voice samples under forensic conditions. However, some scientists believe that aural identifications of very familiar voices are more accurate (Hecker 1971). The spectrographic technique has been described in numerous forensic and scientific publications, including an overview article published in the Crime Laboratory Digest (Koenig 1986). Therefore, a detailed explanation will not be rendered here; the following paragraphs provide a brief summary of the examination, a review of the new comprehensive standards passed by the IAI, and its status in government and private laboratories.

When properly conducted, spectrographic voice identification is a relatively accurate but not conclusive examination for comparing a recorded unknown voice sample with a suspect repeating the identical contextual information over the same type of transmission system (e.g., a local telephone line). The examiner uses both the short-term memory process previously detailed and a spectral pattern comparison between identically spoken sounds on spectrograms.

Figures 1A and 1B are sound spectrograms of different male speakers saying “salt and pepper.”

The horizontal axis represents time, divided into 0.1-second intervals by the short vertical bars near the top, and the vertical axis is frequency, ranging linearly from 80 Hz to 4000 Hz, with horizontal lines every 1000 Hz. The speech energy is reflected in the gray scale from black (highest level) to white (lowest level). The frequency range of the voice is analogous to the range of a musical instrument, where the lowest notes are at the lowest frequency and the highest notes at the highest frequency. The mostly horizontal bands of darkness reflect the vocal resonances and are called formants. The closely spaced vertical striations represent fundamental frequency (voice pitch) or the actual vibrations of the vocal cords. The spectrographic technique requires comparison of identical phrases between the voice samples, with a decision made at one of a number of confidence levels. The scientific support of this examination is limited, and the actual error rate under most investigative conditions is unknown. The research to date indicates that the technique has a certain error rate that is independent of examiner-induced errors, with errors of false elimination (the voice samples were actually from the same person, but the examination found that they did not match) appreciably higher than false identification (the voice samples were actually from different persons, but the examination found that the samples matched).

In July 1991, the Voice Identification and Acoustic Analysis Subcommittee of the IAI passed and published its first set of comprehensive spectrographic voice identification standards. These requirements, which became effective January 1, 1992, for all certified IAI members, include examiner qualifications, evidence handling, preparation of exemplars, preparation of copies, preliminary-examination, preparation of spectrograms, spectrographic/aural analysis, work notes, testimony, certification, and miscellaneous subjects. Table 1 lists the minimum qualifications for spectrographic examiners of the IAI and the FBI and updates a similar table published in an earlier issue of the Crime Laboratory Digest (Koenig 1986). Table 2 is another updated and expanded table from the same article concerning minimum criteria for spectrographic comparisons. Tables 1 and 2 and the previously published tables reflect that the upgraded IAI standards are now appreciably closer to the FBI’s criteria. The FBI’s standards require higher educational levels, more words for lower confidence decisions, enhancement procedures when needed, and a higher frequency voice range. The most important legal difference is the FBI’s policy not to provide testimony on spectrographic comparisons due to the inconclusive nature of the examination and the unknown error rate under specific investigative conditions.

Table 1. Minimum Qualifications for Spectrographic Examiners of the AIA and FBI
Qualification IAI FBI
Education High School Diploma BS Degree
Periodic Hearing Test Yes Yes
Length of Apprenticeship Usually 2 Years 2 Years
Number of Comparisons Conducted 100 100
Attendance at a Spectrographic School Yes Yes
Formal Certification Yes Yes
Table 2. Minimum Criteria for Spectrographic Comparison for the IAI and the FBI
Criteria IAI FBI
Words Needed for Highest Confidence Level 20 20
Words Needed for Lowest Confidence Level 10 20
Affirming Independent Secong Decision Yes Yes
Original Recording Required Yes Yes
Allows Testimony Yes No
Completely Verbatim Knon Samples Usually Usually
Speech Frequency Rate Above 2 KHz Above 2.5 KHz
Accuracy Statement om Report Yes Yes
Enhancement Proceedures When Needed Optional Yes
Speed Correction of All Recordings Yes Yes
Track Determiniation of All Recordings Yes Yes
Azimuth Alignment Correction Yes Yes

The use of the spectrographic technique since the mid­1980s continues to show a steady decline by both government laboratories and private examiners. As of mid-1993, the New York City Police Department and the FBI were the only government laboratories in this country regularly conducting these examinations. The private sector efforts were limited to less than a dozen part-time examiners. Professional meetings in the field have been sparsely attended, and no major spectrographic research is known to be under way. Problems still persist in the spectrographic voice identification field. Examples of these problems include the following: (1) separate sets of certified examiners making high­confidence decisions for both identification and elimination in the same case;1 (2) individuals with no experience, training, or education in the voice identification discipline making conclusive decisions under oath in court; and (3) examiners testifying that an unknown voice is not the defendant’s, although admitting their decisions are really inconclusive based upon accepted standards.

Note 1. Los Angeles Board of Civil Service Commisioners. Threat case decided March 25,1992, in which three IAI examiners made an identification at a high-confidence level, while two IM examiners eliminated the suspect.

This is part of a collection of posts on this subject:

  1. Forensic Sciences & Forensics: Science in Law
  2. Aural Identification of Familiar Voices
  3. Aural Identification of Unfamiliar Voices
  4. Spectrographic Comparisons
  5. Summary & Conclusion
 
FREE Consultation:
1-800-788-0450