<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>insideCTI &#187; speech</title>
	<atom:link href="http://insidecti.com/wordpress/tag/speech/feed/" rel="self" type="application/rss+xml" />
	<link>http://insidecti.com/wordpress</link>
	<description>Things could get ugly when computing and telecom collide.</description>
	<lastBuildDate>Fri, 03 Feb 2012 03:47:48 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Nuance to acquire Vlingo, may run out of competitors to buy (sue) in 2012</title>
		<link>http://insidecti.com/wordpress/news/nuance-to-acquire-vlingo-may-run-out-of-competitors-to-buy-in-2012/</link>
		<comments>http://insidecti.com/wordpress/news/nuance-to-acquire-vlingo-may-run-out-of-competitors-to-buy-in-2012/#comments</comments>
		<pubDate>Tue, 20 Dec 2011 15:52:32 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[acquisition]]></category>
		<category><![CDATA[nuance]]></category>
		<category><![CDATA[speech]]></category>
		<category><![CDATA[vlingo]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4296</guid>
		<description><![CDATA[Major breaking news in speech tech: Nuance announced today that it will scoop up Vlingo. Oh, and happy Hanukkah (not in the press release). Supposedly Vlingo was the other company that Apple was evaluating when developing Siri on the iPhone 4S. Here&#8217;s a bit of backstory between Vlingo and Nuance. It&#8217;s not pretty, but now [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Major breaking news in speech tech: Nuance announced today that it will scoop up Vlingo. Oh, and happy Hanukkah (not in the press release).</p>
<p>Supposedly Vlingo was the <a href="http://insidecti.com/wordpress/internet/siri-did-apple-buy-nuances-silence/">other company</a> that Apple was evaluating when developing Siri on the iPhone 4S. Here&#8217;s a bit of <a href="http://insidecti.com/wordpress/news/nuance-nuisance-to-some-competitors/">backstory</a> between Vlingo and Nuance. It&#8217;s not pretty, but now it&#8217;s over. TechCrunch also <a href="http://techcrunch.com/2011/12/20/after-years-of-patent-litigation-nuance-acquires-vlingo/">chimes in</a>.</p>
<p>This would be the gazillionth company that Nuance has acquired in recent years. In fact, there are hardly any other competitors left, so I predict that the speech giant may enter a new market soon. I&#8217;m thinking a wearable device for pets which will translate barks and meows into synthesized human speech. It&#8217;ll need a lot of animal test subjects in such case, therefore don&#8217;t be surprised if it starts buying pet stores (chains and mom-and-pops) as well as operate zoos.</p>
<p>General consumers and techies alike have raised concerns over AT&amp;T buying T-Mobile (deal is now off) for fear of an anti-competitive carrier marketplace. Yet nobody thinks about the speech marketplace whenever they ask Siri to perform a task.</p>
<p>The <a href="http://www.nuance.com/company/news-room/press-releases/vlwebrelease.doc">press release</a>:</p>
<blockquote><p>Burlington, Mass. – December 20, 2011 – Nuance Communications, Inc. (NASDAQ: NUAN) announced it has signed an agreement to acquire Vlingo, Inc. Fueled by unprecedented demand for intelligent voice interfaces that combine voice, language understanding and semantic processing, Nuance and Vlingo will combine their deep innovation and R&amp;D expertise to deliver next-generation natural language interfaces across numerous markets and industries.</p>
<p>Consumer interest and demand for virtual assistant and voice-enabled capabilities have exploded in recent months, creating a $5 billion market opportunity that spans phones, tablets, cars, televisions, navigation devices, music players, PCs and more. Both Nuance and Vlingo see an unprecedented appetite for intelligent devices that understand the spoken word and deliver outcomes for consumers and professionals.</p>
<p>“Inspired by the introduction of services such as Apple’s Siri and our own Dragon Go!, virtually every mobile and consumer electronics company on the planet is looking for ways to integrate natural, conversational voice interactions into their mobile products, applications, and services,” said Mike Thompson, Senior Vice President and General Manager, Nuance Mobile.  “By acquiring Vlingo, we are able to accelerate the pace of innovation to meet this demand.”</p>
<p>“Vlingo and Nuance have long shared a similar vision for the power and global proliferation of mobile voice and language understanding. As a result of our complementary research and development efforts, our companies are stronger together than alone. Our combined resources afford us the opportunity to better compete, and offer a powerful proposition to customers, partners and developers,” said Dave Grannan, CEO, Vlingo.</p>
<p>By harnessing the combined expertise in voice, language and multilingual capabilities, Nuance will be able to take advantage of the adoption of intelligent mobile assistants, where consumers, businesses, doctors and patients can engage in more human, natural interactions with devices and systems all over the world.</p>
<h3 id="pressReleases"></h3>
<p>Vlingo is a Virtual Assistant that turns your words into action by combining voice to text technology, natural language processing, and Vlingo’s Intent Engine to understand the user’s intent and take the appropriate action. Founded in 2006, Vlingo is backed by Charles River Ventures, Sigma Partners, Yahoo! and AT&amp;T and headquartered in Cambridge, Massachusetts. For more information, go to <a href="http://www.vlingo.com/" target="_blank">www.vlingo.com</a>.</p>
<h3 id="pressReleases"></h3>
<p>Nuance is a leading provider of voice and language solutions for businesses and consumers around the world. Its technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses experience Nuance’s proven applications and professional services. For more information, please visit: <a href="http://www.nuance.com/" target="_blank">nuance.com</a>.</p>
<p><em>Nuance and the Nuance logo are trademarks or registered trademarks of Nuance Communications, Inc. or its subsidiaries in the United States of America and/or other countries. All other company names or product names may be the trademarks of their respective owners.</em></p>
<p><em>Statements in this press release regarding the proposed transaction between Nuance and Vlingo, the market opportunity, the pace of innovation, increased customer demand in the mobile market, future product offerings by the combined company, and any other statements about Nuance managements’ future expectations, beliefs, goals, plans or prospects constitute forward looking statements within the meaning of the Private Securities Litigation Reform Act of 1995. Any statements that are not statements of historical fact (including statements containing the words &#8220;believes,&#8221; &#8220;plans,&#8221; &#8220;anticipates,&#8221; &#8220;expects,&#8221; estimates and similar expressions) should also be considered to be forward looking statements. There are a number of important factors that could cause actual results or events to differ materially from those indicated by such forward looking statements, including the ability to consummate the transaction; the ability of Nuance to successfully integrate Vlingo’s operations and employees; the ability to realize anticipated synergies and cost savings; the failure to retain customers; and the other factors described in the ability of Nuance to integrate the product offerings of the combined companies and other the factors described in Nuance&#8217;s Annual Report on Form 10 K for the fiscal year ended September 30, 2011 and other filings with the U.S. Securities and Exchange Commission. Nuance disclaims any intention or obligation to update any forward looking statements as a result of developments occurring after the date of this press release.</em></p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/news/nuance-to-acquire-vlingo-may-run-out-of-competitors-to-buy-in-2012/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Microsoft Tellme talks itself into embarrassment against Siri</title>
		<link>http://insidecti.com/wordpress/implementation/microsoft-tellme-talks-itself-into-embarrassment-against-siri/</link>
		<comments>http://insidecti.com/wordpress/implementation/microsoft-tellme-talks-itself-into-embarrassment-against-siri/#comments</comments>
		<pubDate>Tue, 29 Nov 2011 12:56:33 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[Implementation]]></category>
		<category><![CDATA[craig mundie]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[speech]]></category>
		<category><![CDATA[tellme]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4268</guid>
		<description><![CDATA[The Xbox 360 is great. And with Kinect it&#8217;s phenomenal. But for Microsoft Chief Research and Strategy Officer Craig Mundie to dismiss Apple Siri and brag about Tellme in Windows smartphones? Utter embarrassment, especially coming from an executive who&#8217;s supposed to be the technology visionary of the company. (Heck, we all know Ballmer isn&#8217;t the guy.) [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>The Xbox 360 is great. And with Kinect it&#8217;s phenomenal. But for Microsoft Chief Research and Strategy Officer Craig Mundie to dismiss Apple Siri and brag about Tellme in Windows smartphones?</p>
<p><iframe width="500" height="281" src="http://www.youtube.com/embed/8akOCfqe-v0?fs=1&#038;feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>Utter embarrassment, especially coming from an executive who&#8217;s supposed to be the technology visionary of the company. (Heck, we all know Ballmer isn&#8217;t the guy.)</p>
<p>Tellme was rumored to have cost $800 million in 2007, but has the technology been <a href="http://insidecti.com/wordpress/development/what-tellme-microsoft-is-up-to/">stuck in the past</a>?</p>
<p>It&#8217;s now evident that the software giant does not consider speech technology important and will likely pay a price for it, much like when it played catch-up during the Web boom. While Microsoft wasn&#8217;t looking, Nuance had scooped up dozens of other companies and partnered with Apple to give Siri a voice in the cloud.</p>
<p>And now the blogosphere is abuzz with this:</p>
<p><iframe width="500" height="281" src="http://www.youtube.com/embed/SHoukZpMhDE?fs=1&#038;feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>What does Tellme tell you?</p>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/implementation/microsoft-tellme-talks-itself-into-embarrassment-against-siri/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hacking Siri into bits (and bytes)</title>
		<link>http://insidecti.com/wordpress/implementation/hacking-siri-into-bits-and-bytes/</link>
		<comments>http://insidecti.com/wordpress/implementation/hacking-siri-into-bits-and-bytes/#comments</comments>
		<pubDate>Wed, 16 Nov 2011 01:18:22 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[Implementation]]></category>
		<category><![CDATA[apple]]></category>
		<category><![CDATA[iphone]]></category>
		<category><![CDATA[nuance]]></category>
		<category><![CDATA[siri]]></category>
		<category><![CDATA[speech]]></category>
		<category><![CDATA[speex]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4257</guid>
		<description><![CDATA[Besides the significant hardware upgrades, the other selling point of the Apple iPhone 4S is Siri, the speech-enabled assistant that can help you schedule appointments, set reminders, read and compose SMS, and many other tasks. The technology is essentially a massive (considering how many iPhone 4S users are there in the U.S.) cloud-based speech recognition [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Besides the significant hardware upgrades, the other selling point of the Apple iPhone 4S is Siri, the speech-enabled assistant that can help you schedule appointments, set reminders, read and compose SMS, and many other tasks. The technology is essentially a massive (considering how many iPhone 4S users are there in the U.S.) cloud-based speech recognition service <a href="http://insidecti.com/wordpress/development/apple-nuance-deal-could-usher-in-new-era-of-smart-customer-service/">reportedly provided by Nuance software</a>.</p>
<p>Unfortunately Apple decided to make Siri only available on the iPhone 4S even though various resourceful hackers have proven that it can be run on the older 3GS and 4 models. Even the iPad 2 didn&#8217;t get Siri although its hardware is similar with the 4S.</p>
<p>Well, the folks at Applidium have cracked Siri&#8217;s protocol and published very informative technical details. Among them:</p>
<ul>
<li>Siri uses TCP port 443, over HTTPS (Secure HTTP)</li>
<li>The request is a custom &#8220;ACE&#8221; method (as opposed to commonly used &#8220;GET&#8221;)</li>
<li>Raw audio data is sent but compressed using <a href="http://www.speex.org/">Speex</a> codec</li>
<li>An iPhone 4S identifier is required with Siri</li>
</ul>
<div>Don&#8217;t be shy, <a href="http://applidium.com/en/news/cracking_siri/">read the whole thing</a> to get intimate with Siri.</div>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/implementation/hacking-siri-into-bits-and-bytes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance starts new developer program, opens floodgates for more voice apps</title>
		<link>http://insidecti.com/wordpress/development/nuance-starts-new-developer-program-opens-floodgates-for-more-voice-apps/</link>
		<comments>http://insidecti.com/wordpress/development/nuance-starts-new-developer-program-opens-floodgates-for-more-voice-apps/#comments</comments>
		<pubDate>Wed, 28 Sep 2011 11:09:43 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[android]]></category>
		<category><![CDATA[apple]]></category>
		<category><![CDATA[ios]]></category>
		<category><![CDATA[ndev mobile]]></category>
		<category><![CDATA[nuance]]></category>
		<category><![CDATA[speech]]></category>
		<category><![CDATA[windows phone 7]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4233</guid>
		<description><![CDATA[Is it game over for other speech tech companies? Nuance is truly the dominating force in this area and with a partner like Apple, it&#8217;s reminiscent of the famed Wintel alliance that still exists today&#8230; Today the company announced its NDEV Mobile program which offers its popular Dragon SDK to multiple platforms: Web-based, iOS, Android, [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Is it game over for other speech tech companies?</p>
<p>Nuance is truly the dominating force in this area and <a href="http://insidecti.com/wordpress/development/apple-nuance-deal-could-usher-in-new-era-of-smart-customer-service/">with a partner like Apple</a>, it&#8217;s reminiscent of the famed Wintel alliance that still exists today&#8230;</p>
<p>Today the company announced its NDEV Mobile program which offers its popular Dragon SDK to multiple platforms: Web-based, iOS, Android, and Windows Phone 7. Even the lowest tiered NDEV Silver membership will give a developer plenty of access to Nuance&#8217;s popular speech technologies.</p>
<p>And NDEV Silver is <em>free</em>. Yup, get ready to see many, many occurrences of people talking <em>to</em> their phones in the coming months&#8230;</p>
<p>See Nuance&#8217;s <a href="http://nuance.com/company/news-room/press-releases/ndev.doc">press release</a>:</p>
<blockquote><p><strong>Burlington, Mass. – September 27, 2011</strong> – Nuance Communications, Inc. (NASDAQ: NUAN) today announced that its Nuance Mobile Developer Program, now <a href="http://dragonmobile.nuancemobiledeveloper.com/public/index.php" target="_blank"><span style="text-decoration: underline;">NDEV Mobile</span></a>, features new services that make it easier than ever for developers to voice-enable apps with Nuance’s industry leading speech technology. NDEV Mobile now features three new service tiers – Silver, Gold, and Emerald – that give developers more choice and flexibility in how they bring Dragon-powered innovations to market. The Dragon Mobile SDK made available through the NDEV Mobile program now also supports Windows Phone 7 in addition to Android and iOS, as well as an HTTP web services interface, and, brings developers eight new languages for voice recognition.</p>
<p>NDEV Mobile has already experienced great success since its launch in January, 2011, with more than 4,000 iOS and Android developers signing on to access the Dragon Mobile SDK across the US, Europe, Asia, and Australia. This success has led to demand for even more flexibility and support in bringing voice-enabled applications to market. As such, NDEV Mobile now features three new tiers of service to better meet the diverse needs of the broader mobile developer community:</p>
<ul>
<li>NDEV Silver: NDEV Silver Services provide Android, Windows Phone 7 and iOS app developers with the ability to deploy their apps for free with the accuracy and reliability that only Nuance can provide. Developers will also have free access to Nuance’s connected text-to-speech (TTS) capabilities in over 30 languages, bringing natural sounding read back of text in the cloud. Further, NDEV Silver members get access to Bluetooth use cases and hands-free voice applications.NDEV Silver Services members have access to the world’s highest quality speech recognition technology for both general dictation and search. Additionally, the Dragon Mobile Developer Center provides a centralized resource for speech information and expertise including on-line forums, code samples, and full documentation.</li>
<li>NDEV Gold: NDEV Gold Services offer access to SSL data encryption, more deployment flexibility and enhanced support. In addition to iOS, Android, and Windows Phone 7, Gold Services members can take advantage of Nuance’s HTTP web service interface, which extends the power of Dragon to any mobile device while giving developers greater control over the user interface.NDEV Gold Services members’ applications can take advantage of SSL for encrypted data transmission, as well as open access to analytics tools that will help them maximize the impact and effectiveness of the voice experience their app delivers. Further, Gold Services members have access to one-on-one Nuance assistance in addition to the NDEV Mobile online community resources.</li>
<li>NDEV Emerald: NDEV Emerald Services are designed to meet the unique needs of NDEV Mobile community members looking for a truly differentiated speech experience on their apps. NDEV Emerald members have an array of customization and service options that bring Nuance’s extensive speech expertise and service capabilities to a diverse developer base – ranging from start-ups to wireless carriers, OEMs, and large enterprises. Emerald Services include custom vocabularies and grammars, user experience consulting, speech integration, advanced text-to-speech, high-availability deployment, and unmatched Nuance support – pre and post deployment, including dedicated account management resources.</li>
</ul>
<p>Available to all NDEV Silver, Gold, and Emerald members, are eight new speech-to-text languages supported by the Dragon Mobile SDK – Canadian French, Americas Spanish, Dutch, Norwegian, Swedish, Korean, Taiwan Mandarin, and Cantonese – in addition to already supporting US, Australian and UK English, European Spanish, European French, German, Italian, Mandarin Chinese, and Japanese for dictation and search. Additional languages will continue to be added through the end of the year, offering developers broader geographic expansion and availability of their innovative voice-enabled apps across the globe.</p>
<p>“The success of our NDEV Mobile developer program has proven that our market-leading voice capabilities drive incredible value for developers across a variety of markets,” said Michael Thompson, senior vice president and general manager, Nuance Mobile. “Our new multi-tiered services options deliver developers the options and services they’ve been asking for. It’s never been easier to deliver the power of voice-enabled apps to the market, tailored and customized to meet the needs of each app’s unique consumer base.”</p>
<p>NDEV Mobile has voice-enabled some of the market’s most popular apps, including Siri, Price Check by Amazon, Ask for iPhone, Merriam-Webster, Dictionary.com, RemoteLink from OnStar, SpeechTrans, Yellow Pages and AirYell from Avantar, iTranslate, Taskmind, SayHi Translate, Vocre, Bon’App, and many others.</p>
<h3 id="pressReleases"> <span class="Apple-style-span" style="font-size: 13px; font-weight: normal;">The Dragon Mobile SDK is available for iOS 4.0 (iPhone/iPad/iPod touch), Android 2.1 and higher, Windows Phone 7.1 and via an HTTP web services interface. For more information about the NDEV Silver, Gold, and Emerald Services, please visit the NDEV Mobile developer portal at dragonmobile.nuancemobiledeveloper.com</span></h3>
<p>Dragon has long been a leading speech recognition application on the desktop, used by millions of people around the world. Nuance has launched an array of Dragon applications across a variety of mobile platforms including iOS, Android, BlackBerry, Windows Phone 7, and more.</p>
<p>To learn more, visit <a href="http://www.nuancemobilelife.com/" target="_blank"><span style="text-decoration: underline;">www.nuancemobilelife.com</span></a>.</p>
<p>Nuance Communications, Inc. (NASDAQ: NUAN) is a leading provider of voice and language solutions for businesses and consumers around the world.  Its technologies, applications and services make the user experience more compelling by transforming the way people interact with devices and systems. Every day, millions of users and thousands of businesses experience Nuance’s proven applications.  For more information, please visit<a href="http://www.nuance.com/" target="_blank"><span style="text-decoration: underline;">www.nuance.com</span></a>.</p>
<p><em>Nuance and the Nuance logo are trademarks or registered trademarks of Nuance Communications, Inc. or its subsidiaries in the United States of America and/or other countries. All other company names or product names may be the trademarks of their respective owners.</em></p>
<p><em>The statements in this press release relating to future plans, events or services, are forward-looking statements which are subject to specific risks and uncertainties. There are a number of factors which could cause actual events or results to differ materially from those indicated in such forward looking statements, including fluctuations in demand for the Nuance products, and the continued development of Nuance products. The reader is warned not to rely on these forward-looking statements without reservation, since these are simply reflections of the current situation. Nuance disclaims any obligation to update any forward-looking statements as a result of developments occurring after the date of this document.</em></p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/development/nuance-starts-new-developer-program-opens-floodgates-for-more-voice-apps/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Microsoft presents speech recognition breakthrough</title>
		<link>http://insidecti.com/wordpress/news/microsoft-presents-speech-recognition-breakthrough/</link>
		<comments>http://insidecti.com/wordpress/news/microsoft-presents-speech-recognition-breakthrough/#comments</comments>
		<pubDate>Tue, 30 Aug 2011 23:59:21 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[international speech communication association]]></category>
		<category><![CDATA[interspeech]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[neural network]]></category>
		<category><![CDATA[speech]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4197</guid>
		<description><![CDATA[Have you ever been to Interspeech, the annual Conference of the International Speech Communication Association, held in beautiful Florence, Italy? And you call yourself a hardcore speech technology nut?! Tell me if these PDFs make sense to you. Well, it&#8217;s good to know that while some companies are busy buying up smaller competitors, there are [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Have you ever been to <a href="http://interspeech2011.org/">Interspeech</a>, the annual Conference of the International Speech Communication Association, held in beautiful Florence, Italy?</p>
<p>And you call yourself a hardcore speech technology nut?! Tell me if these PDFs <a href="http://research.microsoft.com/pubs/144412/DBN4LVCSR-TransASLP.pdf">make</a> <a href="http://research.microsoft.com/pubs/153169/CD-DNN-HMM-SWB-Interspeech2011-Pub.pdf">sense</a> to you.</p>
<p>Well, it&#8217;s good to know that while some companies are busy buying up smaller competitors, there are brainiacs all over the world who actually fawn over speech recognition. Let&#8217;s thank Microsoft for pouring <em>billions</em> of dollars into its Research arms (yes, it has R&amp;D facilities around the globe) so that we may one day face a Terminator who won&#8217;t mix up &#8220;Go fetch me a beer!&#8221; with &#8220;Gopher meets a deer!&#8221; and blow us into pieces because it thinks we are too dumb.</p>
<p><a href="http://research.microsoft.com/en-us/news/features/speechrecognition-082911.aspx">Microsoft researchers</a> are improving large vocabulary speech recognition by enhancing neural network models of &#8220;senones&#8221; (so cutting edge that even Wikipedia doesn&#8217;t offer an explanation):</p>
<blockquote><p>Earlier work on DNNs had used phonemes. The research took a leap forward when Yu, after discussions with principal researcher <a href="http://research.microsoft.com/en-us/people/deng/">Li Deng</a> and <a href="http://research.microsoft.com/en-us/people/alexac/">Alex Acero</a>, principal researcher and manager of the Speech group, proposed modeling the thousands of senones, much smaller acoustic-model building blocks, directly with DNNs. The resulting paper<em>, <a href="http://research.microsoft.com/pubs/144412/DBN4LVCSR-TransASLP.pdf">Context-Dependent Pre-trained Deep Neural Networks for Large Vocabulary Speech Recognition</a></em> by Dahl, Yu, Deng, and Acero, describes the first hybrid context-dependent DNN-HMM (CD-DNN-HMM) model applied successfully to large-vocabulary speech-recognition problems.</p>
<p>“Others have tried context-dependent ANN models,” Yu observes, “using different architectural approaches that did not perform as well. It was an amazing moment when we suddenly saw a big jump in accuracy when working on voice-based Internet search. We realized that by modeling senones directly using DNNs, we had managed to outperform state-of-the-art conventional CD-GMM-HMM large-vocabulary speech-recognition systems by a relative error reduction of more than 16 percent. This is extremely significant when you consider that speech recognition has been an active research area for more than five decades.”</p></blockquote>
<p>Kudos, Microsoft. We cannot wait to see this breakthrough commercialized because, Lord knows, you could use some help:</p>
<p><object width="500" height="400"><param name="movie" value="http://www.youtube.com/v/KyLqUf4cdwc?version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/KyLqUf4cdwc?version=3" type="application/x-shockwave-flash" width="500" height="400" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p><em>(h/t <a href="http://www.linkedin.com/in/jasonpaulhersh">Jason Hersh</a>)</em></p>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/news/microsoft-presents-speech-recognition-breakthrough/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance picks up Loquendo</title>
		<link>http://insidecti.com/wordpress/news/nuance-picks-up-loquendo/</link>
		<comments>http://insidecti.com/wordpress/news/nuance-picks-up-loquendo/#comments</comments>
		<pubDate>Mon, 15 Aug 2011 02:32:21 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[acquisition]]></category>
		<category><![CDATA[loquendo]]></category>
		<category><![CDATA[nuance]]></category>
		<category><![CDATA[speech]]></category>
		<category><![CDATA[telecom italia]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4185</guid>
		<description><![CDATA[Earlier this month, there was the rumor about Telecom Italia offloading its speech services arm Loquendo. Then on August 13, The Washington Post reported Loquendo being sold to Nuance for $75.5 million. However, as of the date and time of this post, neither Loquendo or Nuance has this information on their websites. The press release [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Earlier this month, there was the <a href="http://opusresearch.net/wordpress/2011/08/05/telecom-italia-set-to-sell-loquendo-to-nuance/">rumor</a> about Telecom Italia offloading its speech services arm Loquendo. Then on August 13, <em>The Washington Post</em> <a href="http://www.washingtonpost.com/business/industries/telecom-italia-agrees-to-sell-voice-technology-unit-to-nuance-communications/2011/08/13/gIQAMr3IDJ_story.html">reported</a> Loquendo being sold to Nuance for $75.5 million.</p>
<p>However, as of the date and time of this post, neither Loquendo or Nuance has this information on their websites. The <a href="http://www.telecomitalia.com/tit/en/archivio/media/comunicati-stampa/telecom-italia/corporate/economico-finanziario/2011/08-13.html">press release</a> came from Telecom Italia:</p>
<blockquote>
<div>
<h2><span class="Apple-style-span" style="font-size: 13px; font-weight: normal;">Telecom Italia has announced the sale of its 99.98% stake in Loquendo to U.S. company Nuance Communications, Inc. on the basis of an enterprise value of €53 million.</span></h2>
</div>
<div>
<div>
<p>The sale of Loquendo, a 2001 voice technology spin-off from Telecom Italia&#8217;s research labs with a workforce of around 100, is part of a process of rationalization of the Group&#8217;s shareholdings and a shift of focus toward its core business.</p>
<p>Nuance is committed to keeping the company&#8217;s headquarters in Turin, and creating a global centre of excellence in voice technology R&amp;D and reinforcing its collaboration with Italian universities.</p>
<p>The deal is expected to close around the end of September.</p>
<p><em>Rome, 13 August 2011</em></p>
</div>
</div>
</blockquote>
<div>
<div>
<p>So there you have it, the almighty Nuance becoming even stronger. One less player in the speech industry. Is this good or bad for the industry?</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/news/nuance-picks-up-loquendo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why speech tech still has potential</title>
		<link>http://insidecti.com/wordpress/development/why-speech-tech-still-has-potential/</link>
		<comments>http://insidecti.com/wordpress/development/why-speech-tech-still-has-potential/#comments</comments>
		<pubDate>Tue, 19 Jul 2011 02:09:28 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[browser]]></category>
		<category><![CDATA[jennifer boriss]]></category>
		<category><![CDATA[speech]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4120</guid>
		<description><![CDATA[Firefox developer Jennifer Boriss went to a local mall to find test subjects for field research into browser usage, especially in hopes of improving Mozilla Firefox. Then she stumbled upon &#8220;Joe,&#8221; a 60-year-old gentleman with no knowledge of how to use a computer, let a lone a Web browser. She opened a browser and asked [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Firefox developer Jennifer Boriss went to a local mall to find test subjects for <a href="http://jboriss.wordpress.com/2011/07/06/user-testing-in-the-wild-joes-first-computer-encounter/">field research</a> into browser usage, especially in hopes of improving Mozilla Firefox.</p>
<p>Then she stumbled upon &#8220;Joe,&#8221; a 60-year-old gentleman with no knowledge of how to use a computer, let a lone a Web browser. She opened a browser and asked Joe to perform tasks with the goal of &#8220;finding a local restaurant to eat at.&#8221;</p>
<p>First, Microsoft Internet Explorer. Disastrous results:</p>
<blockquote><p><strong>Joe:</strong> “I don’t know what anything means.”</p>
<p>(Joe reads the text on IE and clicks on “Suggested Sites”)</p>
<p><a href="http://jboriss.files.wordpress.com/2011/07/2_suggested_sites.png"><img title="2_suggested_sites" src="http://jboriss.files.wordpress.com/2011/07/2_suggested_sites.png?w=168&amp;h=50" alt="" width="168" height="50" /></a></p>
<p><strong>Me:</strong> “Why did you click on that?”</p>
<p><strong>Joe:</strong> “I don’t really know what to do, so I thought this would suggest something to me.”</p></blockquote>
<p>Next, Firefox. No better, except the <em>Help</em> pull-down menu attracted Joe&#8217;s attention which, of course, was no help to the task at hand:</p>
<blockquote><p>“Help, that’s what I need!” says Joe. He clicks on <em>Help</em>, but looks disappointed at what he sees in the menu.</p>
<p><a href="http://jboriss.files.wordpress.com/2011/07/6_expanded_help.png"><img title="6_expanded_help" src="http://jboriss.files.wordpress.com/2011/07/6_expanded_help.png?w=304&amp;h=150" alt="" width="304" height="150" /></a></p>
<p>“None of these can help me,” he says.</p></blockquote>
<p>Last, Google Chrome. Score! &#8212; but only because of luck:</p>
<blockquote><p>He proceeds to read all of the words on Chrome’s new tab page, looking for any that may offer guidance. Luckily for Joe, he spies a link to Yelp which is marked <em>San Francisco</em> in Chrome’s new tab page. He clicks it, and, seeing restaurants, declares he’s won.</p></blockquote>
<p>Joe had absolutely no idea on even the very basics of computer usage&#8230; Mouse navigation, clicking, UI elements (e.g. text input field, scroll bar, etc.), but he knew that the computer was capable of finding information.</p>
<p>This is a good example of why speech technology will thrive in the years to come. For tech savvy folks speech tech may offer an alternative input method &#8212; a speedy one at that &#8212; but for many others speech recognition is the difference between being computer literate or not.</p>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/development/why-speech-tech-still-has-potential/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nuance, nuisance to some competitors</title>
		<link>http://insidecti.com/wordpress/news/nuance-nuisance-to-some-competitors/</link>
		<comments>http://insidecti.com/wordpress/news/nuance-nuisance-to-some-competitors/#comments</comments>
		<pubDate>Mon, 13 Jun 2011 11:56:43 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[nuance]]></category>
		<category><![CDATA[paul ricci]]></category>
		<category><![CDATA[speech]]></category>
		<category><![CDATA[tellme]]></category>
		<category><![CDATA[vlingo]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4074</guid>
		<description><![CDATA[Speech tech leader Nuance is basking in some positively stellar publicity lately, mostly riding on the buzz about Apple&#8217;s upcoming iOS 5, Mac OS X &#8220;Lion&#8221; and the mega data center situated in a remote North Carolina town. The latest Apple mobile and desktop operating systems are said to have some deep integration to speech [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Speech tech leader Nuance is basking in some positively stellar publicity lately, mostly riding on the buzz about Apple&#8217;s upcoming iOS 5, Mac OS X &#8220;Lion&#8221; and the mega data center situated in a remote North Carolina town. The latest Apple mobile and desktop operating systems are said to have some deep integration to speech services from Nuance.</p>
<p>After years of mergers, acquisitions, and lawsuits, Nuance has finally <a href="http://insidecti.com/wordpress/news/apple-and-nuance-said-to-be-in-negotiations/">struck gold</a> with the Apple partnership. Forget telecom and IVRs &#8212; mobile consumer technology is where the bling is. Smartphone and tablet sales will no doubt exceed that of IVRs and speech servers.</p>
<p>Nuance and the speech industry in general have an intriguing history.</p>
<p>Things really took shape when four major players emerged in the 1990s: Lernout &amp; Hauspie, Nuance, ScanSoft, and SpeechWorks. Back then the Internet was just blossoming, CPU, RAM, and storage were still expensive, and nobody&#8217;s heard of cloud computing yet. Speech innovation was highly dependent on the Ph.D. talents and R&amp;D money. So naturally, companies with the thicker cash stash gained an advantage.</p>
<p>ScanSoft picked up some notable deals: Lernout &amp; Hauspie (December 2001, filed for bankruptcy; Dragon Systems was acquired previously by L&amp;H), SpeechWorks (August 2003), and LocusDialog (January 2004). In the span of three years the industry consolidated to just <em>two </em>big players: Nuance and ScanSoft.</p>
<p>Then in September 2005 a merger was announced between them, and the new entity to be called Nuance Communications.</p>
<p>Now came an acquisition binge, thanks to CEO Paul Ricci. Since 2000 there were 43 acquisitions. Some of the better-known buys included: Dictaphone (March 2006), Tegic Communications (August 2007), Jott Networks (July 2009), Spinvox (December 2009), and MacSpeech (February 2010).</p>
<p>Growth through M&amp;A was just part of the story. Nuance was also busy in the courts fighting its up-and-coming, lesser-known competitors. One such unlucky competitor was Vlingo. According to a <em>BusinessWeek</em> <a href="http://www.businessweek.com/magazine/content/11_22/b4230037736600.htm">article</a> in May by author <a href="http://www.businessweek.com/bios/Peter_Burrows.htm">Peter Burrows</a>:</p>
<blockquote><p>&#8220;Competing with Nuance is like having a venereal disease that&#8217;s in remission,&#8221; says Dave Grannan, CEO of Vlingo, a speech-recognition startup that&#8217;s involved in five Ricci-related lawsuits. (Nuance has four suits against Vlingo; Vlingo has one against Nuance.) &#8220;We crush them whenever we go head-to-head with them. But just when you&#8217;re thinking life is great—boom, there&#8217;s a sore on your lip.&#8221;</p>
<p>Vlingo&#8217;s adventures with Ricci began in 2008, soon after Yahoo! (<a href="http://investing.businessweek.com/research/stocks/snapshot/snapshot.asp?symbol=YHOO">YHOO</a>) chose Vlingo software over Nuance. Three months later, Grannan learned from a <cite>Boston Globe</cite> reporter that Nuance had filed a patent suit—without contacting the company to discuss royalties. &#8220;It was clearly an effort to hurt our business,&#8221; says Grannan, who expects to spend $15 million on legal fees. Nuance spokesperson Rebecca Paquette said neither Ricci nor the company would comment on specific lawsuits against Vlingo or others. &#8220;In these highly technical fields, many companies attempt to gain advantage by simply using Nuance&#8217;s inventions rather than developing their own,&#8221; she wrote in an e-mail. &#8220;We have a duty to our stockholders to preserve the value of the company and its assets.&#8221;</p>
<p>By summer 2009, with Vlingo running out of cash, according to Grannan, Ricci approached him about an acquisition. On Sept. 21, they met in San Francisco for a 14-hour negotiation. No agreement. Two days later, Ricci surprised Grannan and Vlingo co-founder Mike Phillips by calling to offer two more alternatives. First, Ricci promised to pay them and co-founder John Nguyen $5 million each if they could persuade their board to sell at his price. If that failed, and the three execs agreed to jump ship to Nuance, he&#8217;d pay them the amount they would have received in an acquisition—plus another $5 million if they stayed with the company for two years. As Ricci talked over speakerphone, Grannan says he looked at Phillips, mouthed &#8220;What the f&#8212;?&#8221;, and asked Ricci to repeat. Ricci, who speaks in the measured tones of an academic, obliged. After notifying Vlingo&#8217;s board of the offer, Grannan called Ricci back to express the board&#8217;s displeasure. &#8220;I was flabbergasted,&#8221; says Vlingo board member Bob Davoli. &#8220;I&#8217;ve been on 55 boards in my career and been a CEO twice—but I&#8217;ve never heard of anything like this.&#8221;</p>
<p>Vlingo&#8217;s board later accepted a $15 million investment in the company, after Ricci suggested that such a deal would align their interests and lead to a cessation of hostilities, says Davoli. &#8220;That was wishful thinking,&#8221; he says. Rather than drop the lawsuits, Ricci stopped taking Grannan&#8217;s phone calls. When Vlingo&#8217;s board stopped admitting a Nuance-appointed director to its board meetings, Nuance sued for the right to attend.</p></blockquote>
<p>And guess what? Nuance is still knocking at Vlingo&#8217;s door: TechCrunch recently <a href="http://techcrunch.com/2011/06/09/nuance-sues-vlingo-again-over-voice-recognition-patents/">reported</a> a new patent infringement lawsuit.</p>
<p>Tellme Networks also found itself enduring the wrath of Ricci in 2006:</p>
<blockquote><p>In late 2006, Ricci took a run at a customer—Tellme Networks, which made an automated telephone-response system. Ricci had just purchased a company that made speech software used by Tellme. Mike McCue, Tellme&#8217;s then-CEO, says he was contacted by Ricci, who declared he&#8217;d sue Tellme, introduce a competing product, and refuse to sell it more software unless Tellme&#8217;s board agreed to sell to Nuance at Ricci&#8217;s price. &#8220;It was an all-out attack on every front,&#8221; says McCue, who now runs Flipboard, maker of a popular news app. McCue did sell the company in 2007—to Microsoft (<a href="http://investing.businessweek.com/research/stocks/snapshot/snapshot.asp?symbol=MSFT">MSFT</a>), for a far higher price. (Press reports had it at around $800 million.) A court later dismissed Nuance&#8217;s patent claims as invalid. &#8220;We were able to outlast Nuance,&#8221; says McCue. &#8220;But a lot of companies can&#8217;t handle the pressure and give in. It&#8217;s happened time and again.&#8221; Ricci declined to comment on dealings with Tellme or Vlingo.</p></blockquote>
<p>Nuance&#8217;s aggressive tactics aren&#8217;t just reserved for U.S. companies, either. The founder of an Indian speech company told me a story about his ordeal with Nuance. After being contracted to develop an application for Nuance, it poached the co-founder to work as an employee. (He left for Nuance within two months after being his business partner for nine years.) To add insult to injury, Nuance pre-emptively served legal notice in fear of a lawsuit going after this co-founder and his new employer. Efforts failed to negotiate a more reasonable, smoother transition for the co-founder to jump ship.</p>
<p>David vs. Goliath in the speech industry. David usually loses. From bleeding money and resources.</p>
<p>Indeed, from the BusinessWeek article:</p>
<blockquote><p><em>Nuance&#8217;s Paul Ricci built the dominant speech-recognition company with engineering, acquisitions—and a lot of lawsuits.</em></p></blockquote>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/news/nuance-nuisance-to-some-competitors/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Voice sentiment analysis gains traction</title>
		<link>http://insidecti.com/wordpress/development/voice-sentiment-analysis-gains-traction/</link>
		<comments>http://insidecti.com/wordpress/development/voice-sentiment-analysis-gains-traction/#comments</comments>
		<pubDate>Fri, 27 May 2011 13:29:37 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[saygent]]></category>
		<category><![CDATA[sentiment analysis]]></category>
		<category><![CDATA[speech]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4051</guid>
		<description><![CDATA[What if a machine can tell whether a human is angry, sad, happy, or indifferent? Facial recognition research has also enabled studies on facial sentiment analysis, determining the emotion of a human by analyzing the dynamics of facial muscles. But video and image analysis take a lot more horsepower and dollars to be effective, and [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>What if a machine can tell whether a human is angry, sad, happy, or indifferent? Facial recognition research has also enabled studies on facial sentiment analysis, determining the emotion of a human by analyzing the dynamics of facial muscles. But video and image analysis take a lot more horsepower and dollars to be effective, and facial recognition applications are still mainly affordable only to research institutions, law enforcement agencies, military facilities, and large casinos (according to Hollywood).</p>
<p>But there&#8217;s a good chance that voice sentiment analysis becomes mainstream before its facial counterpart. After all, in many instances where we cannot see each other during a communication, voice is still the next best thing. Voice sentiment analysis is a branch of speech analytics, and some speech tech vendors may already claim their software&#8217;s sentiment analysis capabilities. However, a lot of the claim is based on searching for words or phrases from a person&#8217;s speech input (for example, looking for curse words).</p>
<p>Obviously this isn&#8217;t the ideal approach. As we all know, these four-letter words can also be dropped into expressing excitement and positive sentiment. You and I know clearly that &#8220;I&#8217;m gonna f&#8211;king cancel the service&#8221; and &#8220;I&#8217;m so f&#8211;king happy with this product&#8221; have completely opposite sentiments. But to a machine analyzing based on keywords &#8212; both could be flagged as negative responses.</p>
<p>A somewhat advanced algorithm would also take into account other words in the response. For example, scoring the &#8220;cancel&#8221; as negative and &#8220;happy&#8221; as positive following the f-word.</p>
<p>Even better is to factor in the voice acoustics. The pitch, speed, intensity, etc. of a vocal response could offer insight into the sentiment. Pay attention to your own voice next time when you&#8217;re happy or upset, and you may notice the difference.</p>
<p>The benefits of good voice sentiment analysis is apparent in the contact center environment. And there appears to be traction in this niche space&#8230;</p>
<p>Recently, startup Saygent <a href="http://techcrunch.com/2011/05/19/voice-sentiment-analysis-startup-saygent-raises-1-million/">received $1 million</a> in funding to take voice sentiment analysis to the cloud. The founders/makers of Saygent also operate SayHired, an automated phone screening service with some high-profile customers like Hertz, esurance, and Great Clips. Saygent would be the new kid on the block to compete with some of the more established speech analytics vendors like Autonomy, CallMiner, Nexidia, NICE, and Verint.</p>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/development/voice-sentiment-analysis-gains-traction/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Apple, Nuance deal could usher in new era of smart customer service</title>
		<link>http://insidecti.com/wordpress/development/apple-nuance-deal-could-usher-in-new-era-of-smart-customer-service/</link>
		<comments>http://insidecti.com/wordpress/development/apple-nuance-deal-could-usher-in-new-era-of-smart-customer-service/#comments</comments>
		<pubDate>Fri, 20 May 2011 11:50:31 +0000</pubDate>
		<dc:creator>Eugene Liu</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[apple]]></category>
		<category><![CDATA[customer service]]></category>
		<category><![CDATA[speech]]></category>

		<guid isPermaLink="false">http://insidecti.com/wordpress/?p=4046</guid>
		<description><![CDATA[Recently there&#8217;s quite a buzz &#8212; and new evidence &#8212; about a close partnership between Apple and Nuance. Speech technology is not new and neither is the fact that computer operating systems have incorporated it to a certain extent (mainly text-to-speech, or TTS). Remember when Steve Jobs introduced the first ever Mac on January 24, [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Recently there&#8217;s quite a buzz &#8212; and <a href="http://www.engadget.com/2011/05/16/nuance-voices-found-in-os-x-lion-patent-application-suggests-ne/">new evidence</a> &#8212; about a close partnership between Apple and Nuance. Speech technology is not new and neither is the fact that computer operating systems have incorporated it to a certain extent (mainly text-to-speech, or TTS). Remember when Steve Jobs introduced the <em>first ever</em> Mac on January 24, 1984? He actually allowed Mac to greet the audience (see 3:30 mark):</p>
<p><object width="500" height="400"><param name="movie" value="http://www.youtube.com/v/2B-XwPjn9YY?version=3"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/2B-XwPjn9YY?version=3" type="application/x-shockwave-flash" width="500" height="400" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>The robotic TTS doesn&#8217;t sound as sexy as Mac OS X&#8217;s Agnes, Kathy, Vicki, or Victoria, but the crowd went wild nonetheless. Back then it was pretty impressive for such a tiny computer to be able to speak. Even until today, Apple stayed true to its core philosophy of delivering the best user-friendly products, and part of that means adopting I/O interfaces that are natural to a human: speech, handwriting, and gestures.</p>
<p>The Mac introduction footage showed speech in Mac OS. It also exists in Mac OS X and iOS (as Voice Control, to a limited extent).</p>
<p>Handwriting recognition (or, as some may <a href="http://images.ucomics.com/comics/db/1993/db930827.gif">ridicule</a>, the lack thereof) was one of the highlighted features of the Apple Newton, the grandfather of PDAs introduced in 1993. The technology is present in Mac OS X and iOS as well &#8212; for example, I&#8217;m able to input Chinese characters by writing on the trackpad or iPhone screen.</p>
<p>Starting with the iPhone came the prevalence of gesture input. We&#8217;re all quite familiar with the tapping, swiping, and pinching gestures in using the iPhone and other smartphones as well. Apple even made this available on devices with trackpads. This input method is by far the most natural &#8212; even toddlers &#8220;get it.&#8221;</p>
<p>But I digressed. This article is about how the Apple-Nuance partnership could impact customer service technologies.</p>
<p>So, what really caught my attention was a <a href="http://techcrunch.com/2011/05/09/apple-nuance-data-center-deal/">piece</a> from TechCrunch:</p>
<blockquote><p>In digging into the information about the relationship between the two companies, we had heard that Apple might actually <em>already</em> be using Nuance technology in their new (but yet to be officially opened) <a href="http://techcrunch.com/2010/07/20/apple-data-center/">massive data center in North Carolina</a>. Since then, we’ve gotten multiple independent confirmations that this is indeed the case. And yes, this is said to be the keystone of a partnership that Apple is likely to announce with Nuance at WWDC next month.</p>
<p>More specifically, we’re hearing that Apple is running Nuance software — and possibly some of their hardware — in this new data center. Why? A few reasons. First, Apple will be able to process this voice information for iOS users faster. Second, it will prevent this data from going through third-party servers. And third, by running it on their own stack, Apple can build on top of the technology, and improve upon it as they see fit.</p></blockquote>
<p>We already know Apple&#8217;s interested in better speech tech. It bought Siri and has been <a href="http://insidecti.com/wordpress/development/can-apple-re-invigorate-speech-industry/">soliciting speech-related engineers</a>. If TechCrunch is right, it would be Apple&#8217;s foray into cloud-based speech tech. The company certainly has the money and know-how to grow <em>another</em> massive online service besides iTunes, App Stores, and MobileMe.</p>
<p>Proven Nuance speech technology hosted on Apple&#8217;s massive infrastructure? That&#8217;s a dream come true for a customer service app developer!</p>
<p>A company with a speech-enabled IVR can look into developing an iOS app that&#8217;s also speech-enabled, without having to burden the company&#8217;s speech servers. If the app can deliver on a better user experience with speech capabilities, can you imagine the number of calls reduced to the IVR? Phone calls and speech licenses are expensive, so divert these interactions to the app and through the user&#8217;s own monthly data subscription and Apple&#8217;s servers in N.C.</p>
<p>Plenty of companies already have their own iOS apps, but not many are focused on customer service. It&#8217;s time they think about the next version of their Company App &#8212; with speech.</p>
<p>Of course, this all hinges on Apple making the speech services unrestricted to developers. More reason to keep an eye on this year&#8217;s WWDC announcements. That is, if you&#8217;re passionate about better customer service, speech tech, and mobile apps.</p>
]]></content:encoded>
			<wfw:commentRss>http://insidecti.com/wordpress/development/apple-nuance-deal-could-usher-in-new-era-of-smart-customer-service/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

