Synthesized Speech

Name: Ragnar Digital, LLC
Price range: $

Bell Telephone Laboratories (Bell Labs)

1961

Music/Sound

The Listening Room

Description

Liner notes:

A MACHINE THAT TALKS:

There are many possible kind of synthesizers or "talking" machines. To save the expense and time of building, testing, and modifying them, John L. Kelly, Jr. and Louis J. Gertsman of the Visual and Acoustics Research Department at Bell Laboratories use a high-speed, general purpose computer to simulate them.

The computer is instructed to accept cards, to "operate" on this information similarly to the way an actual talking machine does, and to produce an "output" analogous to the output of the talking machine.

By changing the computer program it is comparatively easy to modify the characteristics of the talking machine.

The particular machine which was simulated in the computer to produce the speech on the recording is known technically as a "tandem resonant synthesizer." Usually this type of machine is operated by continuously feeding into it as a set of nine signals corresponding to voice pitch, voice loudness, tongue position, and other speech variables.

When every instant of sound is specified, the machine produces sounds that are amazingly human like speech.

A PHONETIC INPUT:

Doctors Kelly and Gerstman have contributed a significant advance in the art of speech synthesis by devising a computer program which permits them to feed into the computer, on punched cards, the names of speech sounds.

Since the standard phonetic symbols representing speech sounds are not included on the keyboard of an ordinary card-punching machine, Kelly and Gerstman devised a new phonetic code using the letters of the alphabet.

At present it consists of 22 consonants and 12 vowels:

CONSONANTS: P - B - T - D - K - G - M - N - NG (as in sing) - F - V - S - Z - SH (as in she) - ZH (as in azure) - H - W - R - L - Y - TH (as in thin) - DH (as in then)

VOWELS: EE (as in bee) - I (as in ill) - AY (as in rate) - E (as in end) - AE (as in add) - AH (as in ah) - AW (as in jaw) - (as in go) - OO (as in foot) - UU (as in food) - UH (as in up) - ER (as in her)

Each speech sound is specified on a separate punched card.

When a sequence of cards is fed into the computer, it "operates' on the information - following the rules set up in the second part of its program - to produce the nine control signals that activate the talking machine program. For example, if the sequence of cards, H,EE,S,AW,DH,UH,K,AE, T is put into the computer, the talking machine will say "He Saw The Cat" in measured monotone voice.

To obtain natural intonation and phrasing it is necessary to specify on each card (in addition to the speech sound) both the pitch of the sound and timing information.

A SPEECH LIKE OUTPUT

The "Speech" of the simulated talk machine comes out of the computer in the form of tiny magnetized spots on half-inch magnetic tape. This tape is fed to another machine which converts the digital information to a variable magnetic sound track suitable for playing on an ordinary tape recorder playback.

ON THIS RECORDING

The samples of speech on this recording illustrate the present state-of-the-art of speech synthesis.

Music From Mathematics (Box Set) Multiple Artists 1960 Music/Sound

Computer Music / Musik der Senoi (Mayala) Native… Multiple Artists 1965 Music/Sound

Computer Speech – Hee Saw Dhuh Kaet Bell Telephone Laboratories (Bell Labs) / D.H. Van Lenten 1963 Music/Sound

rot 6 Theo Lutz / Max Bense 1961 Prose/Poetry (Computer-Generated)

Untitled from Drawing Machine One Desmond Paul Henry 1961 Machine/Robot Drawing

Synthesized Voices Charles Dodge 1975 Music/Sound

Unseen Worlds Laurie Spiegel 1990-2019 Music/Sound

A Small Computer Plays Some Examples of Mozart's… Solidac Experimental Computer 1967 Music/Sound