Skip to content

GAVS – Global IT Consulting

Menu
  • Platforms & Products
    • Platforms & Products

      GAVS’ products will help change how you organize your IT Operations, bring meaningful and actionable insights to speed up network fixes, provide real data as quantifiable justification to adopt strategies that foster business improvements.

      • ZIF
      • Products
        • zDesk – Remote, Secure Desktop-as-a-Service (VDI+)
        • zIrrus
        • GTOps
        • TruOps
        • Close
    • Products & Platforms
      • Reimagining your Digital Infrastructure with Zero Incident FrameworkTM

        Read more
    Close
  • Services & Technologies
    • Services & Technologies

      GAVS is a global IT services provider with focus on AI-led Managed Services and Digital Transformation. GAVS’ AIOps platform, Zero Incident Framework ™ (ZIF), enables proactive detection and remediation of incidents and increases uptime, helping organizations drive towards a Zero Incident Enterprise™ . GAVS has transformed IT Enterprise delivery through ZIF’s Discover, Monitor, Analyze, Predict, and Remediate modules, to optimize business services continuity.

      • Digital Services
        • Auto Discovery and Dependency Mapping
        • Cloud Enablement
          • Cloud Advisory and Transformation
          • Close
        • Automation
        • Blockchain
        • Close
      • Cyber Security Services
        • Assessment & Advisory
        • Identity & Access Management (IAM)
        • Managed Detection & Response (MDR)
        • Managed Security Services (MSS)
        • Security Automation
        • Risk & Compliance
        • Close
      • Data Privacy Services
      • Consulting & Implementation Services
        • Cloud Advisory and Transformation
        • Data Center Assessment
        • Data Center-as-a-Service (DCaaS)
        • Infrastructure re-engineering
        • Data Center Consolidation & Migration
        • Close
      • Application Services
      • Enterprise Support Services
        • Managed Infrastructure Support
        • Remote Infrastructure Monitoring
        • End User Monitoring
        • Close
      • Microsoft Services
    • Services &Technologies
      • Reinforcement Learning- The Art of Teaching Machines

        Read more
    Close
  • Industries
    • Industries

      GAVS Technologies focuses on serving various industry verticals in their digital transformation through infrastructure solutions, adopting innovation and technologies in different domains. We offer services and solutions aligned with technology trends to enable enterprises to take advantage of futuristic technologies like DevOps, Smart Machines, Cloud, IoT, Predictive Analytics, Managed Infrastructure Services, and Security services.

      • Industries Overview
      • Healthcare
      • Banking & Financial Services
      • Manufacturing
      • Media & Publishing
    Close
  • Inside GAVS
    • Inside GAVS

      GAVS is a global IT services provider with focus on AI-led Managed Services and Digital Transformation. GAVS’ AIOps platform, Zero Incident Framework™ (ZIF), enables proactive detection and remediation of incidents and increases uptime, helping organizations drive towards a Zero Incident Enterprise™ . GAVS has transformed IT Enterprise delivery through ZIF’s Discover, Monitor, Analyze, Predict, and Remediate modules, to optimize business services continuity.

      • About Us
      • Client Speak
      • Alliances & Partnerships
      • Leadership Team
      • Social Responsibility
      • Events
      • Locations
      • Contact Us
      • Press Releases
      • Media Mentions
      • Awards and Recognitions
      • In Memoriam
      • Covid Care
    Close
  • Insights
    • Insights

      We bring you discerning insights on technology trends, innovation and organization culture, thru our collection of articles, blogs and more. Insights reflects our passion in driving advancements as we move forward creating new paradigms in business and work culture. You would find our thoughts on a variety of topics ranging from evolving technologies and ways it affects businesses and lives, transformational leadership, high impact teams, diversity, inclusion and much more.

      • Blogs
      • Articles
      • White Papers
      • Brochures
      • Videos
      • Case Studies
      • enGAge Magazine
    • insights
      • Seven Tips for Leading IT Modernization and Digital Transformation

        Read more

    Close
  • Work With Us
    • Work with us

      What it means to be a GAVSian?

      If you rate high on our SWAT test (Smart, Hardworking, Articulate, Technologically curious), GAVS’ hiring profile, we promise you excitement, inspiration and the freedom to succeed in our flat organization. Being a GAVSian, you would represent our cutting edge in technological advancement while we help you hone yourself into the person you aspire to be. That’s the level of personal interest we invest in you.

      • Career with GAVS
      • Company Culture
      • Diversity @ GAVS
      • Building a respectful workplace
    Close
Back to blogs

Root Cause Analysis (RCA) in IT Environment – How to connect the Dots ‘Layered Approach’

Feb 08, 2022
  • aiops artificial intelligence for it operations
  • digital transformation services and solutions
  • RCA
  • Root Cause Analysis
  • Root Cause Analysis (RCA) in IT Environment
  • ZIF AIOps platform
SHARE

In this blog post

  • Root Cause Analysis (RCA) in IT Environment – How to connect the Dots ‘Layered Approach’
  • Challenges and Connecting the Dots

Root Cause Analysis (RCA) in IT Environment – How to connect the Dots ‘Layered Approach’

Most of us have heard of the ‘5 Whys’ technique and lot more tools for the RCA. Irrespective of the tools and techniques, to make the RCA effective, it is imperative to have the 3 items listed below

Root Cause Analysis (RCA) in IT Environment
[Figure 1]
Let us understand more about the layers and how to use them for RCA. There are 4 key layers for a problem or incident in an IT environment and they are as follows

Layer A – The Application/Process/Software/Firmware that faces the issue

Layer B – The Application/Process/Software/Firmware that exists along with Layer A

Layer C – The Device/Server in which Layers A and B are running

Layer D – Other Devices/servers/Network Connectivity/External factors that co-exists and participate in this ecosystem

When RCA is to be done, slice your environment into these layers and relate the insight from these layers to arrive at the RCA. In the Figure 2 below, we can see how the Database and Web server along with Network Devices are depicted in layers. Let us take a service availability and reliability issue of a Web Application and go through various scenarios and the RCA in each type.

Database and Web server
[Figure 2]
Scenario 1 – The Network device (Layer D) has got a recent firmware upgrade and that didn’t go well, which resulted in connectivity issue between Container/Microservices layer and DB layer which are connected via this Network device, as a result the Web application is facing DB connectivity issue and user is unable to view the data in UI.

Scenario 2 – Due to recent hardening activity from security point of view, there was a recommendation to avoid default port in server (Layer C) and go for a different port to avoid security attach. While this has been done by the Server team across various servers, this change was not communicated to the Application team and the web application still is trying to reach the default port. This caused the database connectivity issue and impacted the application, where the user is unable to view the data in UI.

Scenario 3 – The Storage Disk (Layer B) of the database is not functioning properly and that resulted in Query timeout. This caused the impact on the application, where the user is unable to view the data in UI. In this case “Physical Disk Avg. Disk sec/Transfer” is high and the FileTransfer utility in the Database server was reading and writing heavily at that time, which has kept the disk busy, and that impacted the database read/write resulted in query timeout

Scenario 4 – The database application (e.g. Sqlservr.exe) (Layer A) service didn’t auto start properly after rebooting the machine. This caused the database connectivity issue and impacted on the application, where the user is unable to view the data in UI.

Challenges and Connecting the Dots

In all these scenarios, we have seen the Root cause and it varies in each scenario. The respective IT, Application, Server, Network team may follow a similar process as given in Figure 1 and 2, but doing these steps manually is almost not possible in practical situations. The other challenges are,

  1. Good Monitoring tools are required for getting good insights
  2. Even if good monitoring tools are in place, it has to be properly configured to capture parameters as deep as ‘Physical Disk Avg. Disk sec/Transfer’ as seen in Scenario 3 above and much more
  3. Getting to know how these different insights are related in real time

In all these scenarios discussed above, it is clear that the problem in any Layer (A, B, C, D) may impact the other layer(s) and affect the environment in an unexpected way. Hence, it is important to know how the various components like Core switch, Router, Floor Switch, L2, L3, Firewall, Servers, Virtual Environments, Workloads in Cloud are related to each other. Similarly, the list of Processes or Applications, Microservices, Containers running inside a server or device and how they are related to each other are to be known.

RCA Solution

ZIFTM provides the RCA through the Unified solution. ZIFTM Discovery, ZIFTM Monitoring and ZIFTM A&P does this seamlessly by discovering all the layers, like physical and logical, and application layer along with relationship. Since relationship is available automatically, the process of narrowing down the Root Cause is natural and accurate in ZIFTM with all the insights.

Author

Suresh Kumar Ramasamy

Suresh has 20 years of experience in Native Applications, Web, Cloud and Hybrid platforms from Engineering to Product Management. He has designed & hosted the monitoring solutions. He has been instrumental in conglomerating components to structure the Environment Performance Management suite of ZIF Monitor.

Suresh enjoys playing badminton with his children. He is passionate about gardening, especially medicinal plants.



Imposter Syndrome at Workplace
You Deserve This! – Dealing with Imposter Syndrome at Workplace
Read More
Sustaining Competitive Advantage
Sustaining Competitive Advantage
Read More
Citizen Developers
Citizen Developers
Read More
GAVS – Global IT Consulting

Copyright © 2022, GAVS Technologies.

  • Privacy Policy
  • Cookie Policy
  • Terms of use
  • Contact Us
  • Platforms & Products
    • Platforms & Products
    • Products
      • Zero Incident Framework ™
      • Products
      • zDesk – Remote, Secure Desktop-as-a-Service (VDI+)
      • GTOps
      • TruOps
      • zIrrus
  • Services & Technologies
    • Services & Technologies
    • Digital Services
      • Digital Services
      • Auto Discovery and Dependency Mapping
      • Cloud Enablement
        • Cloud Advisory and Transformation
      • Automation
      • Blockchain
    • Data Privacy Services
    • Cyber Security Services
      • Cyber Security Services
      • Risk and Compliance
      • Security Automation
      • Managed Security Services (MSS)
      • Managed Detection and Response (MDR)
      • Identity and Access Management
      • Assessment and Advisory
    • Consulting & Implementation Services
      • Consulting & Implementation Services
      • Cloud Assessment & Advisory
      • Data Center Assessment
      • Data Center-as-a-Service (DCaaS)
      • Infrastructure re-engineering
      • Data Center Consolidation & Migration
    • Application Services
    • Enterprise Support Services
      • Enterprise Support Services
      • Managed Infrastructure Support
      • Remote Infrastructure Monitoring
      • End User Monitoring
    • Microsoft Services
  • Industries
    • Industries Overview
    • Healthcare
    • Banking & Financial Services
    • Manufacturing
    • Media & Publishing
  • Inside GAVS
    • Inside GAVS
    • About Us
    • Industries
    • Client Speak
    • Alliances & Partnerships
    • Leadership Team
    • Social Responsibility
    • Events
    • Find us
    • Reaching us
    • Press Releases
    • Media Mentions
    • Awards and recognitions
    • In Memoriam
    • Covid Care
  • Insights
    • Insights
    • Articles
    • Blogs
    • White Papers
    • Case Studies
    • Brochures
    • Videos
    • enGAge Magazine
  • Work with us
    • Work with us
    • Career with GAVS
    • Company Culture
    • Diversity @ GAVS
    • Building a respectful workplace

Schedule a Demo