Shop Sourbellies
Enjoy fast, free delivery, exclusive deals, and award-winning movies & TV shows.
Buy new:
-13% $35.00
FREE delivery Monday, March 16
Ships from: Amazon.com
Sold by: Amazon.com
$35.00 with 13 percent savings
List Price: $40.00 Image
FREE delivery Monday, March 16
Or Prime members get FREE delivery Friday, March 13. Order within 7 hrs 20 mins. Join Prime
In Stock
$$35.00 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$35.00
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Shipper / Seller
Amazon.com
Amazon.com
Shipper / Seller
Amazon.com
Returns
FREE 30-day refund/replacement
FREE 30-day refund/replacement
This item can be returned in its original condition for a full refund or replacement within 30 days of receipt.
Read full return policy
Payment
Secure transaction
Your transaction is secure
We work hard to protect your security and privacy. Our payment security system encrypts your information during transmission. We don’t share your credit card details with third-party sellers, and we don’t sell your information to others. Learn more
$15.99
Connecting readers with great books since 1972! Used books may not include companion materials, and may have some shelf wear or limited writing. We ship orders daily and Customer Service is our top priority! Connecting readers with great books since 1972! Used books may not include companion materials, and may have some shelf wear or limited writing. We ship orders daily and Customer Service is our top priority! See less
$4.99 delivery March 20 - 25. Details
Or fastest delivery March 18 - 20. Details
Only 1 left in stock - order soon.
$$35.00 () Includes selected options. Includes initial monthly payment and selected options. Details
Price
Subtotal
$$35.00
Subtotal
Initial payment breakdown
Shipping cost, delivery date, and order total (including tax) shown at checkout.
Access codes and supplements are not guaranteed with used items.
Ships from and sold by HPB Inc..
Added to

Sorry, there was a problem.

There was an error retrieving your Wish Lists. Please try again.

Sorry, there was a problem.

List unavailable.
Kindle app logo image

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet, or computer - no Kindle device required.

Read instantly on your browser with Kindle for Web.

Using your mobile phone camera - scan the code below and download the Kindle app.

QR code to download the Kindle App

  • The Voice in the Machine: Building Computers That Understand Speech (Mit Press)

Follow the author

Get new release updates & improved recommendations
Something went wrong. Please try your request again later.

The Voice in the Machine: Building Computers That Understand Speech (Mit Press) Paperback – March 23, 2012

4.8 out of 5 stars (11)

{"desktop_buybox_group_1":[{"displayPrice":"$35.00","priceAmount":35.00,"currencySymbol":"$","integerValue":"35","decimalSeparator":".","fractionalValue":"00","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"cmACtoMM6ahNjDBYv3RNdOHBEACVVGYK6qIjbrN%2BlVj9aG1K5lJlB9Z4QOqIP2lLV%2FR9W6GYTVXm2RQxTtNPYt2bFjSTEado0MgTpUJ8lP4KUsP6l%2Bw5ISWO7kDfhF%2FWkADeMVF3YAH7bcmS193Fiw%3D%3D","locale":"en-US","buyingOptionType":"NEW","aapiBuyingOptionIndex":0}, {"displayPrice":"$15.99","priceAmount":15.99,"currencySymbol":"$","integerValue":"15","decimalSeparator":".","fractionalValue":"99","symbolPosition":"left","hasSpace":false,"showFractionalPartIfEmpty":true,"offerListingId":"cmACtoMM6ahNjDBYv3RNdOHBEACVVGYK4%2BZ5d8%2BivRKbnAmT1gHbXWLIj9uOl6wDUqdA6Ywbqai%2FxWOTZ2x5dk8APvF34U3NrLhL3shO5Kiv9Uq5uh1uftkQl9zSDTcComj5Z7IeZfmMO7j5BIxpDE86NGsgeXlV5LulpYkKz6lDkmy18Top8g%3D%3D","locale":"en-US","buyingOptionType":"USED","aapiBuyingOptionIndex":1}]}

Purchase options and add-ons

An examination of more than sixty years of successes and failures in developing technologies that allow computers to understand human spoken language.

Stanley Kubrick's 1968 film 2001: A Space Odyssey famously featured HAL, a computer with the ability to hold lengthy conversations with his fellow space travelers. More than forty years later, we have advanced computer technology that Kubrick never imagined, but we do not have computers that talk and understand speech as HAL did. Is it a failure of our technology that we have not gotten much further than an automated voice that tells us to “say or press 1”? Or is there something fundamental in human language and speech that we do not yet understand deeply enough to be able to replicate in a computer? In The Voice in the Machine, Roberto Pieraccini examines six decades of work in science and technology to develop computers that can interact with humans using speech and the industry that has arisen around the quest for these technologies. He shows that although the computers today that understand speech may not have HAL's capacity for conversation, they have capabilities that make them usable in many applications today and are on a fast track of improvement and innovation.

Pieraccini describes the evolution of speech recognition and speech understanding processes from waveform methods to artificial intelligence approaches to statistical learning and modeling of human speech based on a rigorous mathematical model—specifically, Hidden Markov Models (HMM). He details the development of dialog systems, the ability to produce speech, and the process of bringing talking machines to the market. Finally, he asks a question that only the future can answer: will we end up with HAL-like computers or something completely unexpected?

The%20Amazon%20Book%20Review
The Amazon Book Review
Book recommendations, author interviews, editors' picks, and more. Read it now

Editorial Reviews

Review

This is a fascinating tour of the development of modern speech technologies and applications…A wonderful historical account of the growth of speech technology.—Choice

About the Author

Roberto Pieraccini, Director of ICSI, the International Computer Science Institute in Berkeley, California, has been active for more than thirty years in speech research and technology.

Product details

  • Publisher ‏ : ‎ The MIT Press
  • Publication date ‏ : ‎ March 23, 2012
  • Language ‏ : ‎ English
  • Print length ‏ : ‎ 354 pages
  • ISBN-10 ‏ : ‎ 0262533294
  • ISBN-13 ‏ : ‎ 978-0262533294
  • Item Weight ‏ : ‎ 1.3 pounds
  • Dimensions ‏ : ‎ 7 x 0.8 x 9 inches
  • Best Sellers Rank: #5,159,662 in Books (See Top 100 in Books)
  • Customer Reviews:
    4.8 out of 5 stars (11)

About the author

Follow authors to get new release updates, plus improved recommendations.
Roberto Pieraccini
Brief content visible, double tap to read full content.
Full content visible, double tap to read brief content.

Since March 2018 I am a director of engineering at Google in Zurich, Switzerland.

I have been in the speech technology research and business for more than 30 years. Prior to joining Google, I led a team that build the conversational capability of Jibo, a startup aiming at the commercialization of the first consumer social robot. In 2012 I was the director of the International Computer Science Institute (ICSI) in Berkeley, CA, an independent research institution affiliated with the University of California at Berkeley. Before that I was the Chief Technology Officer of SpeechCycle, a company specialized in advanced spoken human-machine interaction systems for enterprise customer care (yes, those annoying "please tell me the reason you are calling about" computers that prevent you to talk to human operators when you need them). Trying to make those annoying computers better, I led an effort to develop new technology that tried to make those computers learn from their own mistakes and improve the quality of the interactions with customers.

Before SpeechCycle, between 2003 and 2005, I managed a speech research team at IBM T.J. Watson Research, in Yorktown Heights, NY, and prior to that, between 1999 and 2003, I was at SpeechWorks International, which is now known as Nuance, today's largest worldwide computer speech company.

The turning point in my computer speech research career was when, in 1988, I joined AT&T Bell Laboratories (later known as AT&T Laboratories). There I worked with some of the most influential scientists in computer speech, such as Larry Rabiner and Bishnu Atal. I arrived at Bell Laboratories from Italy, where in the 1980s I was a researcher at CSELT, the laboratories of the national Italian telephone company.

During all this time I wrote, as an author or co-author, about 150 scientific papers and articles in the fields of speech recognition, spoken language understanding and dialog, multimodal interaction, and machine learning. I am best known for my original contributions to statistical methods for spoken language understanding and reinforcement learning for spoken dialog systems.

My first book, "The Voice in the Machine", published by MIT Press in 2012, narrates the story of 60 years of computer speech technology evolution in a way that is accessible to general scientific readers. My second book, "AI Assistants", published by MIT Press in 2021, still for a general audience of readers, looks at the recent development of human-machine voice interaction after Siri, Alexa, and Google Assistant were introduced and new technologies, such as Deep learning, dramatically changed the way computers recognize and understand human speech.

Sponsored

Customer reviews

4.8 out of 5 stars
11 global ratings
Sponsored

Top reviews from the United States

  • Reviewed in the United States on March 21, 2015
    Format: HardcoverVerified Purchase
    It's a great book
  • Reviewed in the United States on August 1, 2012
    Format: Hardcover
    I enjoyed reading this book! It is a comprehensive description of the evolution of the speech technologies focused on the major results of research and the changes of directions that the technology had in the last decades. The last chapter is about the advent of Siri and what will happen in the next future. Reading the book you will encounters many and many protagonists with their anecdotes, ideas and achievements.

    I see two main categories of people that might gain great advantage by reading this book. The first are those not involved in the evolution of speech technologies, the second are the insiders, who were involved either in research or at any level, even non technical, in the speech industry. For the former the book explains how a complex technology evolves in reality with all the roadblocks, turns, and steep paths while the author puts all his effort in explaining very complex engineering problems without formulas or technicalities, but using simple and enlightening analogies and examples. The book will help them to understand what is behind Siri, Google Voice, or every other speaking machine. For the latter, the professionals of the voice science and industry, it is very interesting to see how the author assembles a map of the past and current technology, the motivations and the forces behind it, and shows how all the pieces fit together in a technological landscape of the area in which they are currently engaged. For them it is like stepping out for a minute to gain a vantage point perspective and different points of view.

    I belong to the second category because I spent 20 years in R&D in the research lab in Italy where Roberto Pieraccini moves his first steps and then I was deep involved in the newborn speech industry.

    A last little advice is for the readers who would like to move from the author's examples to more technical readings. I found the Notes section very interesting, like a book inside the book. You might read it from the top to the bottom and you will find there some formulas, pointers to literature and complementary thoughts.

    Now, I'll eagerly wait a continuation from Roberto Pieraccini to look forward instead of backward, but I strongly suggest to read this marvelous book now.
    2 people found this helpful
    Report
  • Reviewed in the United States on May 24, 2015
    A very good high-level overview. The earlier parts of the book provide more implementation details than the later parts, which tend to gloss over the details in favor of recounting history.
  • Reviewed in the United States on October 4, 2013
    Format: Hardcover
    Roberto Pieraccini's The Voice in the Machine is a phenomenal read. I found myself enjoying every single page. The writing is clear, precise, personal, folksy, with entertaining anecdotes. As I got closer to the end of the book, I became sadder and sadder, realizing that the time when I would be entertained and educated by Roberto was drawing to a close.

    Mostly about developments in the speech recognition field (for completeness, Pieraccini has one chapter on Text-to-Speech), it's a very well-written, comprehensive survey of the history and current developments in speech technology.

    It covers everything from the earliest attempts, through all the government-sponsored ARPA speech recognition challenges, to recent commercial deployments. The book would well serve as a reference for a college course or just for leisure reading: it's the best example I've ever seen of a book that explains concepts behind complex math, intuitively, without using a single equation. Roberto's writing style could almost be called poetic. It definitely conveys the passion behind the science. You must get this!
    4 people found this helpful
    Report
  • Reviewed in the United States on November 19, 2012
    Format: Hardcover
    This book deserves 6, 7, 8 stars. It takes a technical subject and does a really good job of showing the essence of the issues involved.
    Be aware that the target audience is NEITHER
    - people who already understand computer speech technology (unless perhaps they want to learn some history) OR
    - the intellectually lazy. This is a difficult subject, and to get the most out of it, you will occasionally have to close the book and think about what you have just read.

    But assuming you are in this target audience (you're an engineer in another field, a physicist, an astronomer, basically someone curious about the world around you) and want to learn the basic history, ideas, successes, and failures of computer speech understanding, I have never come across a book close to as good as this.

    I only wish there were a comparable book in similar fields like computer vision, or computer translation.
    One person found this helpful
    Report
  • Reviewed in the United States on June 15, 2012
    Format: Hardcover
    As someone who has been in the speech industry for quite some time, I can tell you this book is a terrific starting point for business people and students alike. Pierracini's great anecdotes are what makes this so enjoyable. Whether it's HAL 9000 or Victor Hugo he is employing to convey his point, the author makes learning enjoyable.
  • Reviewed in the United States on June 9, 2012
    Format: Hardcover
    Having just completed a course in NLP, I was looking for an introduction to speech processing in order to prep for more advanced reading on the subject. Pieraccini's book was just what I needed.

    The author starts out by describing in convincing detail why human speech is so complex and difficult to understand, and to recreate in a lab or a commercial setting. He then goes on to describe early attempts inspired by AI, eventually arriving at statistical approaches that are the basis of most modern speech processing systems.

    I like the book in its broad coverage, and while I do realize that the book is not aimed at techies, I'd have appreciated a little more coverage of HMMs and EM.

    At a handful of places, there are some editing oversights that are simply disappointing for a book from a writer of this caliber (Ch. 5: "...De Mori, who pursued a brilliant carrier first at McGill..." -- career, not carrier).

    Nonetheless, the book is a good read for someone interested in this technology.
    2 people found this helpful
    Report

Top reviews from other countries

  • RODI
    5.0 out of 5 stars Excellent reading , gets inside the abilities of machine recognition and the basics behind the technology
    Reviewed in the United Kingdom on February 16, 2013
    Format: HardcoverVerified Purchase
    Some basic knowledge of speech recognition principles to get the maximum from the book.
    However, even a novice will learn much from the excellent and structured presentation of such a complex technology.