News Column

Patent Issued for System and Method for the Secure, Real-Time, High Accuracy Conversion of General Quality Speech into Text

June 11, 2014



By a News Reporter-Staff News Editor at Telecommunications Weekly -- According to news reporting originating from Alexandria, Virginia, by VerticalNews journalists, a patent by the inventor Jaroker, Jon (New York, NY), filed on May 22, 2009, was published online on May 27, 2014.

The assignee for this patent, patent number 8738374, is j2 Global Communications, Inc. (Los Angeles, CA).

Reporters obtained the following quote from the background information supplied by the inventors: "Converting speech to text can be accomplished currently by several methods. Each method has different levels of accuracy, security, speed, tolerance of poor audio quality and price. Court reporters, or stenographers, for example, provide verbatim transcription but at a high price and with a time delay. Computer-based speech recognition is much less accurate, but is less expensive and instantaneous. Transcription of stored messages (such as voice mail) is more difficult for computer-based speech recognition technology to perform accurately due to poor audio quality. No current speech to text method provides the ideal combination of accuracy, security, speed and tolerance when transcribing general-quality speech.

"Linking the speech to text conversion process to an existing communications service, such as a telephone system, makes the conversion simpler to achieve. Local telephone companies offer Call Forward on Busy ('CFB'), Call Forward on No Answer ('CFNA'), Call Forwarding ('CF'), Distinctive Ring and other services. FIG. 1 shows a traditional phone system 1 which may offer the services described above. When a user of the traditional phone system 1 places a call, the system has an Automatic Number Identification ('ANI') service 10 that identifies the number from which the call has been placed. Similarly, the traditional phone system 1 has a Dialed Number Identification Service ('DNIS') service 20 which identifies the number that the caller dialed. This information is received by the local phone company 30 and the call is directed to the receiving phone which is termed a Plain Old Telephone Service ('POTS') device 40."

In addition to obtaining background information on this patent, VerticalNews editors also obtained the inventor's summary information for this patent: "A system, comprising an audio shredder receiving an audio segment, the audio segment being a portion of an audio stream, the audio shredder creating an audio shred from the audio segment, an audio mixer receiving the audio shred and randomizing the audio shred with other audio shreds from other audio streams and a plurality of transcribers, wherein one of the transcribers receives the audio shred and transcribes the audio shred into text.

"In addition, a method, comprising the steps of receiving an audio stream, filtering the audio stream to separate identifiable words in the audio stream from unidentifiable words, creating a word text file for the identifiable words and storing the word text file in a database, the word text file including word indexing information. Creating audio segments from the audio stream, the audio segments including portions of the audio stream having unidentifiable words, creating audio shreds from the audio segments, the audio shreds including audio shred indexing information to identify each of the audio shreds and storing the audio shred indexing information in the database. Mixing the audio shreds with other audio shreds from other audio streams, delivering the audio shreds to a plurality of transcribers, transcribing each of the audio shreds into a corresponding audio shred text file, the audio shred text file including the audio shred indexing information corresponding to the audio shred from which the audio shred text file was created and reassembling the audio shred text files and the word text files into a conversation text file corresponding to the audio stream.

"Furthermore, a system, comprising a service platform for receiving, processing and directing streaming audio and a user device connected to the service platform and configured to receive streaming audio from the service platform and transmit streaming audio to the service platform, the user device further configured to signal the service platform to begin a transcription of the streaming audio transmitted and received by the user device. The service platform including a filter receiving the streaming audio, identifying words within the streaming audio and creating a word text file corresponding to each of the identified words, the filter further creating audio segments from the streaming audio, the audio segments including portions of the audio stream having unidentifiable words, an audio shredder creating a plurality of audio shreds from each of the audio segments, an audio mixer randomizing the audio shreds with other audio shreds from other streaming audio, wherein the service platform delivers the randomized audio shreds to a plurality of transcribers which transcribe the audio shreds into audio shred text files corresponding to the audio shreds, and a reassembler creating a conversation text file corresponding to the streaming audio from the audio shred text files and the word text files.

"A system, comprising an audio stream element including information corresponding to an audio stream, the information including a begin time of the audio stream, an end time of the audio stream, a conversation identification of the audio stream and the audio stream file, a word element including information corresponding to a word identified in the audio stream by a speech recognition filter, the information including an identification of the audio stream from which the word was identified, a begin time of the word, an end time of the word, an audio file of the word and text corresponding to the word, an audio segment element including information corresponding to an audio segment of the audio stream, the audio segment being a portion of the audio stream without identifiable words, the information including the identification of the audio stream from which the audio segment originates, the begin time of the audio segment, the end time of the audio segment and the audio file of the audio segment, an audio shred element including information corresponding to an audio shred of the audio segment, the information including an identification of the audio segment from which the audio shred originates, the begin time of the audio shred, the end time of the audio shred and the audio file of the audio shred and a text token element including information corresponding to a textual representation of the audio shred, the information including an identification of the audio shred from which the textual representation originates and the textual representation. The information included in each of the audio stream element, the word element, the audio segment element, the audio shred element and the text token element is processed to generate a text transcription of the audio stream.

"A system for the management of a distributed workforce of speech to text transcribers, the testing and monitoring of these transcribers, and financial accounting system to pay these transcribers and set the equilibrium price at which demand for services matches supply.

"A system for the capture of spoken conversations or retrieval of stored audio that is then processed by the platform for conversion to text."

For more information, see this patent: Jaroker, Jon. System and Method for the Secure, Real-Time, High Accuracy Conversion of General Quality Speech into Text. U.S. Patent Number 8738374, filed May 22, 2009, and published online on May 27, 2014. Patent URL: http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&l=50&s1=8738374.PN.&OS=PN/8738374RS=PN/8738374

Keywords for this news article include: Telecommunications.

Our reports deliver fact-based news of research and discoveries from around the world. Copyright 2014, NewsRx LLC


For more stories covering the world of technology, please see HispanicBusiness' Tech Channel



Source: Telecommunications Weekly


Story Tools






HispanicBusiness.com Facebook Linkedin Twitter RSS Feed Email Alerts & Newsletters