What is "OKAY GOOGLE?
It is Voice search is a speech recognition technology that allows users to search by saying terms aloud rather than typing them into a search field. The proliferation of smartphones and other small, Web-enabled mobile devices has spurred interest in voice search. Applications of voice search include: Making search engine queries. Clarifying specifics of the request. Requesting specific information, such as a stock quote or sports score. Launching programs and selecting options. Searching for content in audio or video files. Voice dialing. There are many services as such from the likes of Googles Assistant embedded into every android phone (Okay Google) to that of Apple’s SIRI. Even Amazon has jumped on the bandwagon with the introduction of Alexa, a smart home assistant activated by voice search. How Speech OKAY GOOGLE Works? To convert speech to on-screen text or a computer command, a computer has to go through several complex steps. When you speak, you create vibrations in the air. The analog-to-digital converter (ADC) translates this analog wave into digital data that the computer can understand. To do this, it samples or digitizes, the sound by taking precise measurements of the wave at frequent intervals. The system filters the digitized sound to remove unwanted noise, and sometimes to separate it into different bands of frequency (frequency is the wavelength of the sound waves, heard by humans as differences in pitch). It also normalizes the sound or adjusts it to a constant volume level. It may also have to be temporally aligned. People don't always speak at the same speed, so the sound must be adjusted to match the speed of the template sound samples already stored in the system's memory. Next, the signal is divided into small segments as short as a few hundredths of a second, or even thousandths in the case of plosive consonant sounds -- consonant stops produced by obstructing airflow in the vocal track -- like "p" or "t." The program then matches these segments to known phonemes in the appropriate language. A phoneme is the smallest element of a language -- a representation of the sounds we make and put together to form meaningful expressions. There are roughly 40 phonemes in the English language (different linguists have different opinions on the exact number), while other languages have more or fewer phonemes.
0 Comments
Leave a Reply. |
Archives
January 2019
Categories |