Welcome!

Microservices Expo Authors: Liz McMillan, Lori MacVittie, Dana Gardner, Elizabeth White, Pat Romanski

Related Topics: @CloudExpo, Microservices Expo, Containers Expo Blog, Agile Computing, @BigDataExpo, SDN Journal

@CloudExpo: Article

Twitter Is Not a SaaS Monitoring Solution

The crowd can help IT, but only if the right information is shared

A few weeks ago I was trying to update some files I have stored on a cloud storage service (that will remain nameless). I had moved my files there a while back as a way to make it easier to access them from my various devices and to avoid losing them during the next inevitable hard drive failure. For the most part I've been happy with the service, but on this day, I was unable to access the site.

Not good, as I was rushing to make some changes and send the files to a colleague.

Frustrated by my situation, I asked a co-worker to see if he was also having problems. He was, so we did the next logical thing you would expect. We went to the service provider's status page to see what they had to say. According to it, the service was healthy and there were no current service or maintenance notices.

#nowwhat?
Twitter! Of course. Whenever services like YouTube or Hulu have outages, users light-up Twitter with comments and laments. Sure enough, a quick Twitter search showed that, yes, there was a widespread problem that had started only a few minutes prior, and already there was a trending hashtag.

This example shows what's great about Twitter. It is an immensely powerful platform for creating instant virtual communities sharing information and opinion around a topic of common interest. The Twitter community as a group was able to do a better job than the service provider itself of informing users that there was a problem with the service.  I and the other storage service users -- at least the ones also on Twitter -- had formed an impromptu global network of monitors, watching the service from hundreds of thousands of access points. Together could confirm for each other that there was a service-wide outage.

#problemsolved?
Well, not really. Yes, I could see a number of people on Twitter reporting that they couldn't access the service, but this was all anecdotal information (along with a fair amount of opinion). I had no idea who these other users were or where they were located. For all I knew we might all be customers of the same internet service provider and maybe the problem was there and not with the storage service itself. In addition, while I could go to Twitter to confirm that I wasn't the only one experiencing an outage -- even as the service provider's status dashboard said everything was okay -- I was still searching for evidence after the fact. There was no practical way for me to be notified proactively, nor was I able to reliably see service performance degrading prior to the outage.

Herein lies the problem for manufacturers, or any organization, looking to leverage SaaS applications -- particularly mission critical email, collaboration, and document storage -- as part of their IT infrastructure. While it may be okay for me to use Twitter to monitor Hulu, you obviously can't operate a business this way. Organizations need the same level of visibility and troubleshooting capability for SaaS apps that they've come to rely on for traditional on-premise applications. This includes:

  • Proactive issue detection and alerting
  • Quantitative data on application performance
  • Ability to accurately measure service level attainment v. target goals
  • Ability to identify problem sources so the time to isolate and fix is minimized

That last one is particularly tricky for SaaS since most of the datacenter and network infrastructure is outside organizations' IT perimeters. You can't directly see or touch the server or network equipment and neither can your traditional monitoring and management tools. It's not surprising, then, that we often hear from IT admins that they have had to resort to using Twitter because otherwise they are flying completely blind. It's not enough, but at least it's something.

#saasvisibility
Despite its shortcomings, there is a lot to be said for the "power of the crowd" that is so fundamental to Twitter. What if we could take that same model and use it to proactively monitor our SaaS applications?  First, it would require some type of active monitoring behind your firewall at the locations where users access their SaaS applications. These "sensors" could act like Twitter users, constantly running transactions against the service and collecting data on transaction and network node performance. They would also allow you to proactively detect and notify an IT Admin of any outages or performance anomalies BEFORE they impact your users.

Then, what if we could collect and share real-time performance data from those sensors (yours as well as other users' sensors) into a global database maintained as part of your cloud service. You'd then be able to access this data to gain visibility into the health of the complete service delivery chain between you and the SaaS provider. For example, you could:

  • View current status, alerts, network statistics, and performance trends for one or more of your own sensors to determine if you have service issues affecting a particular location or subnet, so you can point and fix faults in your own infrastructure and get users back online quickly
  • Analyze your sensor data with the rest of the crowd to determine whether service issues are systemic to the application provider or the result of downstream internet service provider problems; you may not be able to fix these directly, but with this information you would know which service provider to call and could provide them with details to speed their time to resolution
  • Confirm exactly what service levels you are getting from your application service providers, with detailed outage data needed both for internal reporting and for provider service level guaranty refund requests

The goal of every IT shop is to keep their application users online and happy. But with SaaS, that's more difficult to do because administrators do not have the same visibility that they do with on-premise applications. We, as a community, need to come up with ways to change that. Taking a cue from Twitter, and leveraging the crowd - seems like a great place to start.

More Stories By Patrick Carey

Patrick Carey is vice president of product management and marketing for Exoprise, a provider of cloud-based monitoring and enablement solutions for Software-as-a-Service (SaaS) applications. He spends his free time thinking about how companies can get to the cloud faster and stay there longer.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@MicroservicesExpo Stories
19th Cloud Expo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterpri...
Sharding has become a popular means of achieving scalability in application architectures in which read/write data separation is not only possible, but desirable to achieve new heights of concurrency. The premise is that by splitting up read and write duties, it is possible to get better overall performance at the cost of a slight delay in consistency. That is, it takes a bit of time to replicate changes initiated by a "write" to the read-only master database. It's eventually consistent, and it'...
The burgeoning trends around DevOps are translating into new types of IT infrastructure that both developers and operators can take advantage of. The next BriefingsDirect Voice of the Customer thought leadership discussion focuses on the burgeoning trends around DevOps and how that’s translating into new types of IT infrastructure that both developers and operators can take advantage of.
With so much going on in this space you could be forgiven for thinking you were always working with yesterday’s technologies. So much change, so quickly. What do you do if you have to build a solution from the ground up that is expected to live in the field for at least 5-10 years? This is the challenge we faced when we looked to refresh our existing 10-year-old custom hardware stack to measure the fullness of trash cans and compactors.
The emerging Internet of Everything creates tremendous new opportunities for customer engagement and business model innovation. However, enterprises must overcome a number of critical challenges to bring these new solutions to market. In his session at @ThingsExpo, Michael Martin, CTO/CIO at nfrastructure, outlined these key challenges and recommended approaches for overcoming them to achieve speed and agility in the design, development and implementation of Internet of Everything solutions wi...
DevOps at Cloud Expo, taking place Nov 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 19th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The widespread success of cloud computing is driving the DevOps revolution in enterprise IT. Now as never before, development teams must communicate and collaborate in a dynamic, 24/7/365 environment. There is no time to wait for long dev...
Thomas Bitman of Gartner wrote a blog post last year about why OpenStack projects fail. In that article, he outlined three particular metrics which together cause 60% of OpenStack projects to fall short of expectations: Wrong people (31% of failures): a successful cloud needs commitment both from the operations team as well as from "anchor" tenants. Wrong processes (19% of failures): a successful cloud automates across silos in the software development lifecycle, not just within silos.
A company’s collection of online systems is like a delicate ecosystem – all components must integrate with and complement each other, and one single malfunction in any of them can bring the entire system to a screeching halt. That’s why, when monitoring and analyzing the health of your online systems, you need a broad arsenal of different tools for your different needs. In addition to a wide-angle lens that provides a snapshot of the overall health of your system, you must also have precise, ...
Using new techniques of information modeling, indexing, and processing, new cloud-based systems can support cloud-based workloads previously not possible for high-throughput insurance, banking, and case-based applications. In his session at 18th Cloud Expo, John Newton, CTO, Founder and Chairman of Alfresco, described how to scale cloud-based content management repositories to store, manage, and retrieve billions of documents and related information with fast and linear scalability. He addres...
The following fictional case study is a composite of actual horror stories I’ve heard over the years. Unfortunately, this scenario often occurs when in-house integration teams take on the complexities of DevOps and ALM integration with an enterprise service bus (ESB) or custom integration. It is written from the perspective of an enterprise architect tasked with leading an organization’s effort to adopt Agile to become more competitive. The company has turned to Scaled Agile Framework (SAFe) as ...
Monitoring of Docker environments is challenging. Why? Because each container typically runs a single process, has its own environment, utilizes virtual networks, or has various methods of managing storage. Traditional monitoring solutions take metrics from each server and applications they run. These servers and applications running on them are typically very static, with very long uptimes. Docker deployments are different: a set of containers may run many applications, all sharing the resource...
Internet of @ThingsExpo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 19th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devices - comp...
It's been a busy time for tech's ongoing infatuation with containers. Amazon just announced EC2 Container Registry to simply container management. The new Azure container service taps into Microsoft's partnership with Docker and Mesosphere. You know when there's a standard for containers on the table there's money on the table, too. Everyone is talking containers because they reduce a ton of development-related challenges and make it much easier to move across production and testing environm...
Cloud Expo 2016 New York at the Javits Center New York was characterized by increased attendance and a new focus on operations. These were both encouraging signs for all involved in Cloud Computing and all that it touches. As Conference Chair, I work with the Cloud Expo team to structure three keynotes, numerous general sessions, and more than 150 breakout sessions along 10 tracks. Our job is to balance the state of enterprise IT today with the trends that will be commonplace tomorrow. Mobile...
The 19th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Digital Transformation, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportuni...
SYS-CON Events announced today that Venafi, the Immune System for the Internet™ and the leading provider of Next Generation Trust Protection, will exhibit at @DevOpsSummit at 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Venafi is the Immune System for the Internet™ that protects the foundation of all cybersecurity – cryptographic keys and digital certificates – so they can’t be misused by bad guys in attacks...
As the world moves toward more DevOps and Microservices, application deployment to the cloud ought to become a lot simpler. The Microservices architecture, which is the basis of many new age distributed systems such as OpenStack, NetFlix and so on, is at the heart of Cloud Foundry - a complete developer-oriented Platform as a Service (PaaS) that is IaaS agnostic and supports vCloud, OpenStack and AWS. Serverless computing is revolutionizing computing. In his session at 19th Cloud Expo, Raghav...
DevOps at Cloud Expo – being held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA – announces that its Call for Papers is open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's largest enterprises – and delivering real results. Am...

Modern organizations face great challenges as they embrace innovation and integrate new tools and services. They begin to mature and move away from the complacency of maintaining traditional technologies and systems that only solve individual, siloed problems and work “well enough.” In order to build...

The post Gearing up for Digital Transformation appeared first on Aug. 25, 2016 12:15 PM EDT  Reads: 1,390

SYS-CON Events announced today that eCube Systems, a leading provider of middleware modernization, integration, and management solutions, will exhibit at @DevOpsSummit at 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. eCube Systems offers a family of middleware evolution products and services that maximize return on technology investment by leveraging existing technical equity to meet evolving business needs. ...