Welcome!

Microservices Expo Authors: John Worthington, Pat Romanski, Stackify Blog, Automic Blog, Simon Hill

Related Topics: Microservices Expo, Java IoT, Industrial IoT, Microsoft Cloud, Containers Expo Blog, @CloudExpo

Microservices Expo: Article

Preventive IT Analytics

An ounce of prevention is worth $2 billion

Managing software application performance is complex and increasingly challenging. Surprisingly, even today, massive IT resources remain chained to ensuring applications and infrastructures are up and running smoothly, rather than proactively leveraging strategic technologies to solve business problems or achieve competitive advantage.

That world is changing.
A new generation of preventive analytic technologies is emerging in the APM (Application Performance Management) industry, unlocking untapped value in this $2+ billion market that is growing $300 million per year. These technologies are only now achievable via revolutionary machine learning technologies, advanced quantitative analytics, and the natural evolution of BI ... as BI-like analytics are bleeding into the APM world.

IT can now actually predict and anticipate problems. In doing so, IT professionals are preventing the fire drills that result in MTTR (mean time to repair) focus and metrics. In turn, IT has more time to prevent performance incidents from occurring at all, pursue these preventive "fixes" in an orderly and efficient manner, and, ultimately, devote more time to optimizing the use of technology for business gain.

It may sound like a science fiction movie, but today's application performance analytics technology is truly able to predict the future, identify incidents before the occur, and allow IT to solve them in a pre-emptive manner.

The chart below from a recent TRAC Research study on TC-APM, Transaction Centric Application Performance Management demonstrates the traditional state of APM in a MTTR (mean time to repair) based world.

Chart 1: Average Mean Time to Repair (in minutes) for Issues with Multi-Tier Applications per Incident. TRAC Research

Source: TRAC Research, June 2012.

As the APM market has evolved from Infrastructure-Centric Application Performance Management (ICAPM) to Transaction-Centric APM (TC-APM), there have been massive gains in accelerating MTTR. In fact, MTTR is more than 70% faster when IT operations can drive analytics and insights from the application transaction layer down versus trying to correlate insights from monitoring infrastructure components and in an upward fashion through the infrastructure.

Follow the trend line of the chart above and you can see how the next major step is preventing incidents in the first place - Mean Time to Prevent (MTTP) will become the new, more important metric, over taking MTTR.

For Example...
An Internet ecommerce customer using preventative or predictive APM analytic technologies such as those provided by Appnomic Systems, Netuitive or Prelert can readily assess a potential issue in real-time to prevent outages or end user affecting response times. The graphic below is a real-world example from an Appnomic Systems' customer.

Chart 2: Average Mean Time to Prevent (in hours) for an Internet ecommerce application pre-outage early warning alert.

This chart illustrates how an Internet ecommerce company identified the root cause of an ultimate outage that occurred six hours after an alert from the company's APM solution with preventive analytics. While, in this case, an early alert did not result in preventing the ultimate outage because this organization was in early stage deployment of preventive analytics and still learning how to use the technology, the IT operations professionals using the technology had an unprecedented, visceral experience of how preventive analytics can work for them. While it may have seemed unbelievable at the time, this company now knows preventive analytics are not science fiction.

BI Meets APM
The chart below helps put this industry trend in perspective. The graphic, from The Data Warehousing Institute (TDWI), frames up how the BI (Business Information) industry has been evolving. As BI-types of analytics are penetrating IT operations and APM tools, analysts including Gartner, EMA, TRAC Research, and others are actively working to help technology buyers consider how to take advantage of this technology confluence and take advantage of the benefits of applying advanced analytics in the APM realm.

Chart 3: TDWI Model of the BI Industry with an overlay of how Preventive Analytics are an emerging new frontier.

Source: Predictive Analytics: Extending the Value of Your Data Warehousing Investment by Wayne Eckerson, TDWI Best Practices Report, Q1 2007. Reprinted with permission. For more information about TDWI Research, visit tdwi.org.

A rich, natural evolution of analytics technology is underway, laying the foundation for the newest application of analytics to the world of IT. On the fundamental TWDI framework, an overlay of the emerging, next phase of BI being applied to application performance management has been added in green where preventive analytics are highlighted and of greatest impact.

Why now?
There are three key reasons why preventive analytics are emerging now.

Not long ago, car enthusiasts would spend time under the hood juicing their "hot rods" performance. Now, with the incorporation of complex chipsets throughout the vehicle operations, those days of tinkering are gone. The same is true with most of today's enterprise grade application stacks - IT "under the hood" is just too complex for anyone to understand and manage without adequate tools.

Second, the IT professional's job is nearly impossible with the proliferation of hot IT trends including enterprise social media, mobility, "Big Data," and end-user demand for SaaS or cloud application support. IT organizations are struggling with these trends and losing the race to get in front of applying technology for business benefit versus maintaining technology operations. Preventive analytics helps shift the time required to maintain operations by preventing incidents, war room exercises, RCAs (root cause analyses), and the like.

Finally, advanced mathematics, machine-learning technologies, higher compute power and low cost are all enabling the processing of these massive volumes of data and computational algorithms in speeds that were not possible before.

What you should do about it.
Get going...

For the IT operations and application operations organizations around the globe who are mature enough or who have enough to gain from preventing application performance affecting incidents, they are seeking out those APM or related technology vendors developing this preventive analytics practice and technology. They want to test it out in real-world settings and are regularly surprised - positively - with the results.

If you are operating in a rapidly changing and early stage organization, you may want to keep an eye on these technologies and know that, when you are ready, they are available to help you mature your organization and operations.

Note, the key is not trying to measure or monitor everything. Successful organizations are focusing on a particular pain or value point and are taking the technology for a test run.

In some cases, the deployment may take some time and there may be a "discovery" or "due diligence" process as you identify the best application and the right metrics to feed into the analytics engine, but there is an alluring and worthwhile return on investment in time and technology.

It does not have to be a world of science fiction to know that an ounce of prevention equals a pound of cure - and, in this case, potentially millions or billions of dollars as the APM industry figures out how to shift from MTTR focused activities to MTTP - more time to PREVENT incidents in the first place.

More Stories By Ray Solnik

Ray Solnik is President of Appnomic Systems. As president of Appnomic Systems, he has P & L responsibility with a focus on business growth in North America. He brings to Appnomic twenty years of experience in cloud computing, managed network services, and data communications.

Prior to Appnomic, Ray was president and COO of OpSource, an early SaaS/IaaS provider, which was acquired and is now the core Cloud offering of Dimension Data - a $4 billion systems integrator. Ray has helped multiple next generation companies develop and drive strategies resulting in successful fundraising from top venture capital investors, including Gengo, PowerCloud Systems, and CrowdFlower.

Earlier in his career, Ray was chief development officer of New Edge Networks (acquired by EarthLink), and president of AT&T’s consumer Internet services business, AT&T WorldNet. He has a bachelor's degree in economics from the University of Michigan and an MBA from Stanford Graduate School of Business. He lives in Silicon Valley.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@MicroservicesExpo Stories
identify the sources of event storms and performance anomalies will require automated, real-time root-cause analysis. I think Enterprise Management Associates said it well: “The data and metrics collected at instrumentation points across the application ecosystem are essential to performance monitoring and root cause analysis. However, analytics capable of transforming data and metrics into an application-focused report or dashboards are what separates actual application monitoring from relat...
While some developers care passionately about how data centers and clouds are architected, for most, it is only the end result that matters. To the majority of companies, technology exists to solve a business problem, and only delivers value when it is solving that problem. 2017 brings the mainstream adoption of containers for production workloads. In his session at 21st Cloud Expo, Ben McCormack, VP of Operations at Evernote, discussed how data centers of the future will be managed, how the p...
DevOps teams have more on their plate than ever. As infrastructure needs grow, so does the time required to ensure that everything's running smoothly. This makes automation crucial - especially in the server and network monitoring world. Server monitoring tools can save teams time by automating server management and providing real-time performance updates. As budgets reset for the New Year, there is no better time to implement a new server monitoring tool (or re-evaluate your current solution)....
The benefits of automation are well documented; it increases productivity, cuts cost and minimizes errors. It eliminates repetitive manual tasks, freeing us up to be more innovative. By that logic, surely, we should automate everything possible, right? So, is attempting to automate everything a sensible - even feasible - goal? In a word: no. Consider this your short guide as to what to automate and what not to automate.
Cavirin Systems has just announced C2, a SaaS offering designed to bring continuous security assessment and remediation to hybrid environments, containers, and data centers. Cavirin C2 is deployed within Amazon Web Services (AWS) and features a flexible licensing model for easy scalability and clear pay-as-you-go pricing. Although native to AWS, it also supports assessment and remediation of virtual or container instances within Microsoft Azure, Google Cloud Platform (GCP), or on-premise. By dr...
Let's do a visualization exercise. Imagine it's December 31, 2018, and you're ringing in the New Year with your friends and family. You think back on everything that you accomplished in the last year: your company's revenue is through the roof thanks to the success of your product, and you were promoted to Lead Developer. 2019 is poised to be an even bigger year for your company because you have the tools and insight to scale as quickly as demand requires. You're a happy human, and it's not just...
"Opsani helps the enterprise adopt containers, help them move their infrastructure into this modern world of DevOps, accelerate the delivery of new features into production, and really get them going on the container path," explained Ross Schibler, CEO of Opsani, and Peter Nickolov, CTO of Opsani, in this SYS-CON.tv interview at DevOps Summit at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...
It’s “time to move on from DevOps and continuous delivery.” This was the provocative title of a recent article in ZDNet, in which Kelsey Hightower, staff developer advocate at Google Cloud Platform, suggested that “software shops should have put these concepts into action years ago.” Reading articles like this or listening to talks at most DevOps conferences might make you think that we’re entering a post-DevOps world. But vast numbers of organizations still struggle to start and drive transfo...
The nature of test environments is inherently temporary—you set up an environment, run through an automated test suite, and then tear down the environment. If you can reduce the cycle time for this process down to hours or minutes, then you may be able to cut your test environment budgets considerably. The impact of cloud adoption on test environments is a valuable advancement in both cost savings and agility. The on-demand model takes advantage of public cloud APIs requiring only payment for t...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, discussed how by using ne...
While we understand Agile as a means to accelerate innovation, manage uncertainty and cope with ambiguity, many are inclined to think that it conflicts with the objectives of traditional engineering projects, such as building a highway, skyscraper or power plant. These are plan-driven and predictive projects that seek to avoid any uncertainty. This type of thinking, however, is short-sighted. Agile approaches are valuable in controlling uncertainty because they constrain the complexity that ste...
"We're developing a software that is based on the cloud environment and we are providing those services to corporations and the general public," explained Seungmin Kim, CEO/CTO of SM Systems Inc., in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
The cloud revolution in enterprises has very clearly crossed the phase of proof-of-concepts into a truly mainstream adoption. One of most popular enterprise-wide initiatives currently going on are “cloud migration” programs of some kind or another. Finding business value for these programs is not hard to fathom – they include hyperelasticity in infrastructure consumption, subscription based models, and agility derived from rapid speed of deployment of applications. These factors will continue to...
"This all sounds great. But it's just not realistic." This is what a group of five senior IT executives told me during a workshop I held not long ago. We were working through an exercise on the organizational characteristics necessary to successfully execute a digital transformation, and the group was doing their ‘readout.' The executives loved everything we discussed and agreed that if such an environment existed, it would make transformation much easier. They just didn't believe it was reali...
"CA has been doing a lot of things in the area of DevOps. Now we have a complete set of tool sets in order to enable customers to go all the way from planning to development to testing down to release into the operations," explained Aruna Ravichandran, Vice President of Global Marketing and Strategy at CA Technologies, in this SYS-CON.tv interview at DevOps Summit at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
We just came off of a review of a product that handles both containers and virtual machines in the same interface. Under the covers, implementation of containers defaults to LXC, though recently Docker support was added. When reading online, or searching for information, increasingly we see “Container Management” products listed as competitors to Docker, when in reality things like Rocket, LXC/LXD, and Virtualization are Dockers competitors. After doing some looking around, we have decided tha...
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
"Codigm is based on the cloud and we are here to explore marketing opportunities in America. Our mission is to make an ecosystem of the SW environment that anyone can understand, learn, teach, and develop the SW on the cloud," explained Sung Tae Ryu, CEO of Codigm, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Many enterprise and government IT organizations are realizing the benefits of cloud computing by extending IT delivery and management processes across private and public cloud services. But they are often challenged with balancing the need for centralized cloud governance without stifling user-driven innovation. This strategy requires an approach that fundamentally reshapes how IT is delivered today, shifting the focus from infrastructure to services aggregation, and mixing and matching the bes...