Automatic Speech Recognition

The Development of the SPHINX System

Nonfiction, Computers, Advanced Computing, Artificial Intelligence, Science & Nature, Technology, Electronics
Cover of the book Automatic Speech Recognition by Kai-Fu Lee, Springer US
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Kai-Fu Lee ISBN: 9781461536505
Publisher: Springer US Publication: December 6, 2012
Imprint: Springer Language: English
Author: Kai-Fu Lee
ISBN: 9781461536505
Publisher: Springer US
Publication: December 6, 2012
Imprint: Springer
Language: English

Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates; slow response time (minutes to hours) to instantaneous response time. These characteristics taken together increase the computational complexity of the problem by several orders of magnitude. Further, speech provides a challenging task domain which embodies many of the requirements of intelligent behavior: operate in real time; exploit vast amounts of knowledge, tolerate errorful, unexpected unknown input; use symbols and abstractions; communicate in natural language and learn from the environment. Voice input to computers offers a number of advantages. It provides a natural, fast, hands free, eyes free, location free input medium. However, there are many as yet unsolved problems that prevent routine use of speech as an input device by non-experts. These include cost, real time response, speaker independence, robustness to variations such as noise, microphone, speech rate and loudness, and the ability to handle non-grammatical speech. Satisfactory solutions to each of these problems can be expected within the next decade. Recognition of unrestricted spontaneous continuous speech appears unsolvable at present. However, by the addition of simple constraints, such as clarification dialog to resolve ambiguity, we believe it will be possible to develop systems capable of accepting very large vocabulary continuous speechdictation.

View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates; slow response time (minutes to hours) to instantaneous response time. These characteristics taken together increase the computational complexity of the problem by several orders of magnitude. Further, speech provides a challenging task domain which embodies many of the requirements of intelligent behavior: operate in real time; exploit vast amounts of knowledge, tolerate errorful, unexpected unknown input; use symbols and abstractions; communicate in natural language and learn from the environment. Voice input to computers offers a number of advantages. It provides a natural, fast, hands free, eyes free, location free input medium. However, there are many as yet unsolved problems that prevent routine use of speech as an input device by non-experts. These include cost, real time response, speaker independence, robustness to variations such as noise, microphone, speech rate and loudness, and the ability to handle non-grammatical speech. Satisfactory solutions to each of these problems can be expected within the next decade. Recognition of unrestricted spontaneous continuous speech appears unsolvable at present. However, by the addition of simple constraints, such as clarification dialog to resolve ambiguity, we believe it will be possible to develop systems capable of accepting very large vocabulary continuous speechdictation.

More books from Springer US

Cover of the book Diagnostic and Interventional Catheterization in Congenital Heart Disease by Kai-Fu Lee
Cover of the book Endometrial Cancer by Kai-Fu Lee
Cover of the book Efficacy, Agency, and Self-Esteem by Kai-Fu Lee
Cover of the book Performance Benchmarking by Kai-Fu Lee
Cover of the book Research on Alcoholism Treatment by Kai-Fu Lee
Cover of the book Midwives, Research and Childbirth by Kai-Fu Lee
Cover of the book In-Vehicle Corpus and Signal Processing for Driver Behavior by Kai-Fu Lee
Cover of the book Helicobacter pylori Infection and Immunity by Kai-Fu Lee
Cover of the book Predictability of corporate failure by Kai-Fu Lee
Cover of the book HACCP by Kai-Fu Lee
Cover of the book Process Control Systems by Kai-Fu Lee
Cover of the book Ventilator-Associated Pneumonia by Kai-Fu Lee
Cover of the book Information Hiding: Steganography and Watermarking-Attacks and Countermeasures by Kai-Fu Lee
Cover of the book Financial Ethics by Kai-Fu Lee
Cover of the book Theory and Research in Behavioral Pediatrics by Kai-Fu Lee
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy