Obaidul Karim Views Over Technology
Obaidul Karim’s Speech
about technology relates to the technologies designed to duplicate and respond
to the human voice. They have many uses. These include aid to the
voice-disabled, the hearing-disabled, and the blind, along with communication
with computers without a keyboard. They enhance game software and aid in
marketing goods or services by telephone.
Orion Group Chairman said "Speech synthesis is the artificial production of human speech". A computer
system used for this purpose is called a speech synthesizer, and can be implemented
in software or hardware. A text-to-speech (TTS) system converts normal language
text into speech; other systems render symbolic linguistic representations like
phonetic transcriptions into speech.
Synthesized speech can be created by concatenating pieces of recorded speech
that are stored in a database. Systems differ in the size of the stored speech
units; a system that stores phones or diphones provides the largest output
range, but may lack clarity. For specific usage domains, the storage of entire
words or sentences allows for high-quality output. Alternatively, a synthesizer
can incorporate a model of the vocal tract and other human voice
characteristics to create a completely “synthetic” voice output.
The quality of a speech synthesizer is judged by its similarity to the human
voice and by its ability to be understood. An intelligible text-to-speech
program allows people with visual impairments or reading disabilities to listen
to written works on a home computer. Many computer operating systems have
included speech synthesizers since the early 1990s.Mr.Karim mentioned that speech recognition is the translation of spoken words into
text. It is also known as “automatic speech recognition”, “ASR”, “computer
speech recognition”, “speech to text”, or just “STT”.
Speech recognition (SR) is technology that can translate spoken words into
text. Some SR systems use “training” where an individual speaker reads sections
of text into the SR system. These systems analyze the person’s specific voice
and use it to fine tune the recognition of that person’s speech, resulting in
more accurate transcription. Systems that do not use training are called
“Speaker Independent” systems. Systems that use training are called “Speaker
Dependent” systems.
Speech recognition applications include voice user interfaces such as voice
dialing (e.g., “Call home”), call routing (e.g., “I would like to make a
collect call”), domestic appliance control, search (e.g., find a podcast where
particular words were spoken), simple data entry (e.g., entering a credit card
number), preparation of structured documents (e.g., a radiology report),
speech-to-text processing (e.g., word processors or emails), and aircraft.
No comments:
Post a Comment