Skip to content

GAVS – Global IT Consulting

Menu
  • Industries
    • Industries

      GAVS Technologies focuses on serving various industry verticals in their digital transformation through infrastructure solutions, adopting innovation and technologies in different domains. We offer services and solutions aligned with technology trends to enable enterprises to take advantage of futuristic technologies like DevOps, Smart Machines, Cloud, IoT, Predictive Analytics, Managed Infrastructure Services, and Security services.

      • Healthcare
      • Life Sciences
      • Banking & Financial Services
      • Manufacturing
      • Hi-Tech & Software
      • Telecom
    Close
  • Services
    • Services & Technologies

      GAVS is a global IT services provider with focus on AI-led Managed Services and Digital Transformation. GAVS’ AIOps platform, Zero Incident Framework ™ (ZIF), enables proactive detection and remediation of incidents and increases uptime, helping organizations drive towards a Zero Incident Enterprise™ . GAVS has transformed IT Enterprise delivery through ZIF’s Discover, Monitor, Analyze, Predict, and Remediate modules, to optimize business services continuity.

      • Digital Product Engineering
      • Application services & modernization
        • Application Development
        • Application Modernization
        • Application Management
        • Close
      • Cloud Enablement
        • Cloud Consulting
        • Cloud Operations
        • Cloud Native Engineering
        • Cloud Data
        • Cloud Transformation
        • Cloud Consulting and Advisory
        • Cloud Managed Services
        • Close
      • Generative AI
      • Data Strategy and Modernization
        • Data Privacy
        • Close
      • Cyber Security
        • Governance, risk and compliance
        • Digital Identity Management
        • Infrastructure Security
        • Digital IDM
        • Data Privacy
        • Governance, Risk and Complaince
        • Vulnerability Management
        • Business Continuity Management System
        • Close
      • User Experience Design
      • Enterprise Applications
        • Managed Infrastructure Support
        • Remote Infrastructure Monitoring
        • Microsoft
        • Close
    • Services &Technologies
      • Reinforcement Learning- The Art of Teaching Machines

        Read more
    Close
  • Platforms & Products
    • Platforms & Products

      GAVS’ products will help change how you organize your IT Operations, bring meaningful and actionable insights to speed up network fixes, provide real data as quantifiable justification to adopt strategies that foster business improvements.

      • Products
        • ZIF
        • zIrrus
        • zDesk
        • Close
      • IP Accelerators
        • CloudGain
        • vKYC
        • ENWAT
        • IdentityDesk
        • Close
    • Reimagining your Digital Infrastructure with Zero Incident FrameworkTM

      Read more
    Close
  • Inside GAVS
    • Inside GAVS

      GAVS is a global IT services provider with focus on AI-led Managed Services and Digital Transformation. GAVS’ AIOps platform, Zero Incident Framework™ (ZIF), enables proactive detection and remediation of incidents and increases uptime, helping organizations drive towards a Zero Incident Enterprise™ . GAVS has transformed IT Enterprise delivery through ZIF’s Discover, Monitor, Analyze, Predict, and Remediate modules, to optimize business services continuity.

      • About Us
      • Client Speak
      • Alliances & Partnerships
      • Leadership Team
      • Social Responsibility
      • Events
      • Locations
      • Contact Us
      • Press Releases
      • Media Mentions
      • Awards and Recognitions
      • In Memoriam
      • Covid Care
    Close
  • Insights
    • Insights

      We bring you discerning insights on technology trends, innovation and organization culture, thru our collection of articles, blogs and more. Insights reflects our passion in driving advancements as we move forward creating new paradigms in business and work culture. You would find our thoughts on a variety of topics ranging from evolving technologies and ways it affects businesses and lives, transformational leadership, high impact teams, diversity, inclusion and much more.

      • Blogs
      • Articles
      • White Papers
      • Brochures
      • Videos
      • Case Studies
      • enGAge Magazine
    • insights
      • Seven Tips for Leading IT Modernization and Digital Transformation

        Read more

    Close
  • Work with Us
    • Work with us

      What it means to be a GAVSian?

      If you rate high on our SWAT test (Smart, Hardworking, Articulate, Technologically curious), GAVS’ hiring profile, we promise you excitement, inspiration and the freedom to succeed in our flat organization. Being a GAVSian, you would represent our cutting edge in technological advancement while we help you hone yourself into the person you aspire to be. That’s the level of personal interest we invest in you.

      • Career with GAVS
      • Company Culture
      • Diversity @ GAVS
      • Building a respectful workplace
    Close
    • Close
Back to blogs

Evolution of speech recognition

Jan 14, 2020
SHARE

In this blog post

  • Early Days
  • Current Landscape
  • Application of Speech Recognition Technology
  • Benefits include:
  • References:
  • About the Author:

Naveen KT

Speaking with inanimate objects and getting work done through them has transitioned from being a figment of our imagination to a reality. Case in point, personal assistant devices like Alexa can recognize our words, interpret the meaning and carry out commands.

The journey of speech recognition technology has been nothing short of a rollercoaster ride. Let us look at the developments that enabled commercialization of ASR and what these systems could accomplish, long before any of us had heard of Siri or Google Assistant.

The speech recognition field was propelled by both the application of different approaches and the advancement of technology. Over a decade, researchers would conceive of myriad ways to dissect language: by sounds, by structure and with statistics.

Early Days

Even though human interest in recognizing and synthesizing speech goes back centuries, it was only in the last century that something recognizable as ASR was built. The ‘digit recognizer’ named Audrey, by Bell Laboratories was among the first projects. It could identify spoken numbers by looking for audio fingerprints called formants, the distilled essences of sounds.

Even though human interest in recognizing and synthesizing speech goes back centuries, it was only in the last century that something recognizable as ASR was built. The ‘digit recognizer’ named Audrey, by Bell Laboratories was among the first projects. It could identify spoken numbers by looking for audio fingerprints called formants, the distilled essences of sounds.

Next came the Shoebox in the 1960s. Developed by IBM, the Shoebox could recognize numbers and arithmetic commands (like ‘plus’ and ‘total’). Shoebox could also pass on the math problem to an adding machine, to calculate and print the answer.

Half way across the world, in Japan, hardware was being built that could recognize the constituent parts of speech like vowels. Systems were also being built to evaluate the structure of speech to figure out where a word might end.

A team at University College in England had devised a system that could recognize 4 vowels and 9 consonants by analysing phonemes, the discrete sounds of a language.

However, these were all disjointed efforts and were lacking direction.

In a surprising turn of events, the funding for ASR programs in Bell Laboratories were stopped in 1969. The reasons cited were “lack of scientific rigor” in the field and “too much wild experimentation”. It was reinstated in 1971.

In the early 1970s, the U.S. Department of Defence’s ARPA (the agency now known as DARPA) funded a five-year program called Speech Understanding Research. Several ASR systems were created and the most successful one Harpy (by Carnegie Mellon University), could recognize over 1000 words. Efforts to commercialize the technology had also picked up speed. IBM was working on speech transcription in the context of office correspondence, and Bell Laboratories on ‘command and control’ scenarios.

The key turning point was the popularization of Hidden Markov Models (HMMs). These models used a statistical approach that translated to a leap forward in accuracy. Soon, ASR field began coalescing around a set of tests that provided a benchmark to compare to. This was further encouraged by the release of shared data sets that researchers could use to train and test their models on.

ASR as we know it today, was introduced in the 1990s. Dragon Dictate launched in 1990 for a staggering $9,000, with a dictionary of 80,000 words and features like natural language processing.

These tools were time-consuming and it required that users speak in a tilted manner; Dragon could initially recognize only 30–40 words a minute; people typically talk around four times faster than that. By 1997, they introduced Dragon NaturallySpeaking, which could capture words at a more fluid pace and at a much lower price tag of $150.

Current Landscape

Voice has been touted as the future. Tech giants are investing in it and placing voice-enabled devices at the core of their business strategy.

Machine learning has been behind major breakthroughs in the field of speech recognition. Google’s efforts in this field culminated in the introduction of Google Voice Search app in 2008. They further refined this technology, with the help of huge volumes of training data and finally launched the Google Assistant.

Digital assistants like Google Assistant, Siri, Alexa and others, are changing the way people interact with their devices. Digital assistants are intended to assist individuals with performing or completing fundamental assignments and react to enquiries.

With the capacity to retrieve data from a wide variety of sources, these robots help take care of issues progressively, upgrading the UX and human productivity.

Popular Voice assistants include:

  • Amazon’s Alexa
  • Apple’s Siri
  • Google’s Google Assistant
  • Microsoft’s Cortana

Application of Speech Recognition Technology

Speech recognition technology and the use of digital has moved rapidly from our phones to our homes, and its application in ventures, for example, business, banking, advertising, and health care is rapidly becoming obvious.

In Workplace: Speech recognition technology in the work environment has been a push to increase productivity and efficiency. Examples of office tasks digital assistants are, or will be, able to perform:

  • Search for documents or reports on computer
  • Create tables or graphs using data
  • Answer queries
  • On-request document printing
  • Record minutes
  • Perform other routine tasks like scheduling meetings and making travel arrangements

In Banking: Theaim of Speech Recognition, in Banking

  • Financial industries is to reduce friction for the customer. Voice-enacted banking could diminish the requirement for human client assistance and lower employee costs. A customized financial partner could consequently help consumer loyalty and satisfaction.

How speech recognition can improve banking:

  • Request financial information
  • Make payments
  • Receive information about your transaction history

In Marketing: Voice-search can and will cause shifts in consumer behaviour. It is essential to understand such shifts and tweak the marketing activities to keep up with the times.

  • With speech recognition, there will be another type of information accessible for advertisers to examine. People’s accents, speech patterns, and vocabulary can be utilized to translate a purchaser’s area, age, and other data with respect to their socioeconomics, for example, their social alliance.
  • Speaking allows for longer, more conversational searches. Advertisers and optimisers may need to concentrate on long-tail keywords and on creating conversational substances to remain in front of these patterns.

In HealthCare: In a situation where seconds are critical and clean working conditions are essential, hands-free, prompt access to data can have a positive effect on medical efficiency.

Benefits include:

  • Quick looking up of information from medical records
  • Less paperwork
  • Reduced time on inputting data
  • Improved workflow

This is just scratching the surface of the applications of this technology. The future of speech recognition technology holds a lot of promise across various industries.

References:

  1. https://www.getsmarter.com/blog/market-trends/applications-of-speech-recognition
  2. https://medium.com/swlh/the-past-present-and-future-of-speech-recognition-technology-cf13c179aaf
  3. https://www.globalme.net/blog/the-present-future-of-speech-recognition
  4. https://bit.ly/347MAYw
  5. https://bit.ly/2Oq9VOC
  6. https://bit.ly/2OtFp6t
  7. https://bit.ly/2pB6hcr
  8. https://bit.ly/2QAYR43

About the Author:

Naveen is a software developer at GAVS. He teaches underprivileged children and is interested in giving back to society in as many ways as he can. He is also interested in dancing, painting, playing keyboard and is a district-level handball player.



aiops providers
Understanding the Role of Automation in SRE and Techniques for Routine Task Automation
Read More
Best Cyber Security Services Companies
Best Strategies for Protecting Your Data and Infrastructure and The Evolution of Cybersecurity: How Digital Immune System (DIS) is Changing the Game
Read More
ai-led operations management services in healthcare
Transforming Healthcare Sector with Generative AI
Read More
GAVS – Global IT Consulting

Copyright © 2023, GAVS Technologies.

  • Privacy Policy
  • Cookie Policy
  • Terms of use
  • Contact Us
  • Platforms & Products
    • Platforms & Products
    • Products
      • Zero Incident Framework ™
      • Products
      • zDesk – Remote, Secure Desktop-as-a-Service (VDI+)
      • GTOps
      • TruOps
      • zIrrus
  • Services & Technologies
    • Services & Technologies
    • Digital Services
      • Digital Services
      • Auto Discovery and Dependency Mapping
      • Cloud Enablement
        • Cloud Advisory and Transformation
      • Automation
      • Blockchain
    • Data Privacy Services
    • Cyber Security Services
      • Cyber Security Services
      • Risk and Compliance
      • Security Automation
      • Managed Security Services (MSS)
      • Managed Detection and Response (MDR)
      • Identity and Access Management
      • Assessment and Advisory
    • Consulting & Implementation Services
      • Consulting & Implementation Services
      • Cloud Assessment & Advisory
      • Data Center Assessment
      • Data Center-as-a-Service (DCaaS)
      • Infrastructure re-engineering
      • Data Center Consolidation & Migration
    • Application Services
    • Enterprise Support Services
      • Enterprise Support Services
      • Managed Infrastructure Support
      • Remote Infrastructure Monitoring
      • End User Monitoring
    • Microsoft Services
  • Industries
    • Industries Overview
    • Healthcare
    • Banking & Financial Services
    • Manufacturing
    • Media & Publishing
  • Inside GAVS
    • Inside GAVS
    • About Us
    • Industries
    • Client Speak
    • Alliances & Partnerships
    • Leadership Team
    • Social Responsibility
    • Events
    • Find us
    • Reaching us
    • Press Releases
    • Media Mentions
    • Awards and recognitions
    • In Memoriam
    • Covid Care
  • Insights
    • Insights
    • Articles
    • Blogs
    • White Papers
    • Case Studies
    • Brochures
    • Videos
    • enGAge Magazine
  • Work with us
    • Work with us
    • Career with GAVS
    • Company Culture
    • Diversity @ GAVS
    • Building a respectful workplace

Schedule a Demo