Welcome!

Microservices Expo Authors: Elizabeth White, Liz McMillan, Pat Romanski, Yeshim Deniz, Zakia Bouachraoui

Related Topics: Microservices Expo, Microsoft Cloud, Cognitive Computing , Agile Computing, Wearables

Microservices Expo: Article

Speech and Sound: The Next "Killer Paradigm Shift"..?

Speech recognition could impact the business and at a variety of levels

There was a time, not so very long ago, when IT directors and chief information officers dismissed the Internet as something of a passing fad. Somehow though, things took off pretty well with the whole web thing didn't they? Mobile telephony has also grown to a level of dominance that we could never have predicted when it first started appearing around 30 years ago.

Then came the tablet... just another fad right? Well, the first few were, but then "Magic Steve" produced the tablet we all love and cherish didn't he? (OK yes - I know Android is doing well in this space too, you don't need to write in)... so what's coming next?

What Is Our Next Killer Paradigm?
Many believe that "sound" will be the next killer element of "social computing" in terms of information share. After all, we share text in various forms, images and video and all the time. Shouldn't this mean that "sound" should be our next most logically interesting data-share element?

What kind of sound? Our own spoken voice, recorded speech, random commentary, music, environmental recordings -- it's a long list and you can certainly add at least one of your own if you give it a moment's thought. Yes we can link to each other's podcasts already, but we are talking about a level beyond that.

The next tier for sound is allied to its close first cousin "speech" and both could (arguably) be about to move from the playground to the boardroom and therefore potentially move into the CIO's line of sight.

The Speech Steeplechase
The problem is that in its early years, speech/voice recognition technology was something of a novelty. But look at the facts, fingerprint recognition biometrics only surfaced toward the end of the last millennium and now we have "secure USB flash drives" that work by a finger-swipe; so the rapid development curve for surface-level extremely user-facing technologies has been in overdrive for the last decade, if not more.

Speech recognition companies, like Nuance that produces the Dragon NaturallySpeaking off-the-shelf product, see a future in several corporate deployment scenarios for their technology which is grounded in individual user suitability. The company is something of a market leader with manufacturers from HP to Apple to IBM all working with its technology.

According to Nuance, the human voice is described as an "incredibly rich, natural and efficient means of communication" - and the industry is now working to build solutions that enable computers, phones, tablets, automobiles, TVs and consumer electronics to understand the human voice, providing a "natural interface" between man and machine.

Speech recognition could impact the business and at a variety of levels:

  • Speech is used in CRM analytics inside call center deployment scenarios so that customer conversations can be analyzed and filtered in order to discover what keywords customers are using.
  • Healthcare CIOs will already know that CLU (Clinical Language Understanding) technology has a huge role to play in terms of helping healthcare enterprises working to overcome challenges with "Big Data" and the ensuing challenges associated with the ability to collect, process, interpret and then utilise information.
  • Nuance is not alone...  Google is also said to be attempting to "pioneer" technology that will ultimately enable users to search by the spoken word. Microsoft has similar plans with Bing.
  • Mobile applications (at the consumer and enterprise level too) will have a large number of opportunities for speech recognition to be leveraged. From simple voice commands used to control smartphones, to more powerful voice-driven in-car entertainment and/or so-called "infotainment systems," speech arguably has a strong new role to play.

How Does It Work? Nuance Explains...

  1. A user speaks a command into a microphone
  2. System converts sound input into digital signal
  3. The signal is analyzed and chopped into component speech sounds called "phonemes"
  4. Each phoneme is examined in context with those around it and statistical probability algorithms used to determine the intended word from a stored list. This happens for each word
  5. Each word is examined in context with those around it and statistical probability algorithms used to determine the intended command
  6. The appropriate response for the command is triggered

The CIO's Central Message
It seems that many real-world scenarios could be using not only speech recognition technologies, but also its sister disciplines, i.e., text-to-speech technology and also document imaging and electronic dictation services, which do of course throw up their own data storage challenges.

Nuance VP Peter Mahoney has suggested that really robust industrial-grade speech recognition in the space-age style as depicted in Hollywood movies (or to give it its proper name - "robust natural language" technology) is not far off at all - and that we should see six to ten languages fully supported by this technology as soon as the end of this year.

It's not Star Trek quite yet, but we're close!

•   •   •

This post was first published on the Enterprise CIO Forum.

More Stories By Adrian Bridgwater

Adrian Bridgwater is a freelance journalist and corporate content creation specialist focusing on cross platform software application development as well as all related aspects software engineering, project management and technology as a whole.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Microservices Articles
SYS-CON Events announced today that DatacenterDynamics has been named “Media Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY. DatacenterDynamics is a brand of DCD Group, a global B2B media and publishing company that develops products to help senior professionals in the world's most ICT dependent organizations make risk-based infrastructure and capacity decisions.
Most DevOps journeys involve several phases of maturity. Research shows that the inflection point where organizations begin to see maximum value is when they implement tight integration deploying their code to their infrastructure. Success at this level is the last barrier to at-will deployment. Storage, for instance, is more capable than where we read and write data. In his session at @DevOpsSummit at 20th Cloud Expo, Josh Atwell, a Developer Advocate for NetApp, will discuss the role and value...
DevOpsSummit New York 2018, colocated with CloudEXPO | DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City. Digital Transformation (DX) is a major focus with the introduction of DXWorldEXPO within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term.
CloudEXPO New York 2018, colocated with DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
Enterprise architects are increasingly adopting multi-cloud strategies as they seek to utilize existing data center assets, leverage the advantages of cloud computing and avoid cloud vendor lock-in. This requires a globally aware traffic management strategy that can monitor infrastructure health across data centers and end-user experience globally, while responding to control changes and system specification at the speed of today’s DevOps teams. In his session at 20th Cloud Expo, Josh Gray, Chie...
Adding public cloud resources to an existing application can be a daunting process. The tools that you currently use to manage the software and hardware outside the cloud aren’t always the best tools to efficiently grow into the cloud. All of the major configuration management tools have cloud orchestration plugins that can be leveraged, but there are also cloud-native tools that can dramatically improve the efficiency of managing your application lifecycle. In his session at 18th Cloud Expo, ...
Discussions of cloud computing have evolved in recent years from a focus on specific types of cloud, to a world of hybrid cloud, and to a world dominated by the APIs that make today's multi-cloud environments and hybrid clouds possible. In this Power Panel at 17th Cloud Expo, moderated by Conference Chair Roger Strukhoff, panelists addressed the importance of customers being able to use the specific technologies they need, through environments and ecosystems that expose their APIs to make true ...
In his session at 20th Cloud Expo, Mike Johnston, an infrastructure engineer at Supergiant.io, discussed how to use Kubernetes to set up a SaaS infrastructure for your business. Mike Johnston is an infrastructure engineer at Supergiant.io with over 12 years of experience designing, deploying, and maintaining server and workstation infrastructure at all scales. He has experience with brick and mortar data centers as well as cloud providers like Digital Ocean, Amazon Web Services, and Rackspace. H...
"We do one of the best file systems in the world. We learned how to deal with Big Data many years ago and we implemented this knowledge into our software," explained Jakub Ratajczak, Business Development Manager at MooseFS, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
Using new techniques of information modeling, indexing, and processing, new cloud-based systems can support cloud-based workloads previously not possible for high-throughput insurance, banking, and case-based applications. In his session at 18th Cloud Expo, John Newton, CTO, Founder and Chairman of Alfresco, described how to scale cloud-based content management repositories to store, manage, and retrieve billions of documents and related information with fast and linear scalability. He addresse...