Welcome!

Microservices Expo Authors: Dalibor Siroky, Elizabeth White, Pat Romanski, John Katrick, Liz McMillan

Related Topics: @DevOpsSummit, Java IoT, Microservices Expo, Linux Containers, Containers Expo Blog, @CloudExpo

@DevOpsSummit: Article

DNS Experience Tests | @CloudExpo @CatchPoint #APM #DevOps

When monitoring and analyzing the health of your online systems, you need a broad arsenal of different tools

DNS Experience Tests: A Key Cog in the Online Ecosystem
By Mehdi Daoudi

A company’s collection of online systems is like a delicate ecosystem – all components must integrate with and complement each other, and one single malfunction in any of them can bring the entire system to a screeching halt.

That’s why, when monitoring and analyzing the health of your online systems, you need a broad arsenal of different tools for your different needs. In addition to a wide-angle lens that provides a snapshot of the overall health of your system, you must also have precise, scalpel-like tools that can isolate and analyze all of those different components (DNS, CDNs, internal and external servers, third-party tags, etc.).

Catchpoint was designed to be that exact kind of precision tool. When evaluating the health of your systems, a simple availability test is useful, but it only a basic binary state: you’re either up or you’re down. Figuring out why you are down, or why your customers are waiting longer than they should for a page to load, is a wholly different matter, and requires precise diagnostic and analytical capabilities in order to provide you with actionable data.

Of these many different parts that make up an online system, DNS is perhaps the most important. It’s the very first interaction that a customer has with an online brand, and therefore having a rapid DNS lookup and resolution process is vital to maintaining an exceptional customer experience. Yet to properly assess your DNS health, you need that aforementioned scalpel, not a broad sword. This is why Catchpoint has maintained close relationships with DNS solution providers such as NS1, keeping open channels of communication in order to create the most precise tools possible.

This type of relationship has manifested itself positively for NS1, which uses Catchpoint tests to ensure that they are providing their customers with the best possible DNS resolution times, but it’s equally important for clients of those providers to monitor their own DNS performance to detect any issues that the vendor might be missing.

One way to drill down and gain additional insight into a DNS performance issue is through the different types of DNS monitors that Catchpoint offers. To play out a scenario, let’s say that you catch a DNS resolution problem in a basic browser test:

Chrome test

While this data shows that your users are suffering from bad experience due to DNS latency, you have no way of knowing where the latency occurred in the DNS resolution process. To gain this information, you need a DNS monitoring solution that shows performance and error data for all the different steps and servers in the DNS chain. Additionally, you can keep an eye on specific types of records (Answers, Authoritative Name Servers, or Additional Records) from the DNS query, which allows you to detect issues such as wrong TTLs, DNS Cache Poisoning, misconfigurations, etc.

This is imperative when it comes to detecting a third party DNS vendor’s errors, because most organizations rely on external DNS registrars and vendors, but have little visibility in their performance and availability.

8-3 blog image-2_705_left[1]

Getting back to our hypothetical problem, once we run our DNS Experience tests, we get a result that looks like this:

DNS Experience Test

In this test, which hits a multitude of different name servers in succession, we see intermittent spikes in performance (blue line chart) and drops in availability (green line chart), which tells us that the problem is isolated to specific name servers as opposed to the whole lot of them. Therefore, we need to run DNS Direct tests to isolate each of those name servers:

dnsDirect

Now the exact source of the problem becomes clear. There’s one specific name server which has failed multiple times in the test timeframe, which means that we now have actionable data to work with. The DNS provider, if it hasn’t already located the source of the problem through the same process, can be made aware of the issue so that they can take that server offline while they fix the problem.

In addition to the advanced insight that specific monitors provide, one of Catchpoint’s strongest attributes is our global node coverage. As a global DNS provider, NS1 knows that the ability to test their servers from as many different locations as possible is imperative to understanding the full scope of end users’ DNS experience. NS1 uses Catchpoint nodes around the world to collect data, and then using Catchpoint’s Push/Pull APIs, they can input that data into any number of different tools in order to make it actionable.

Just like the ecosystem of different components that make up modern online systems, there is an ecosystem of tools of similar size and scope to keep an eye on all of those components. Catchpoint is a cog in that ecosystem; this is why it was designed to play nicely with other alerting, communication, and monitoring tools that IT Ops professionals regularly use. There are Catchpoint integrations in place with Slack, VictorOps, PagerDuty, Zapier, etc., and the APIs work with many other different tools, including (as NS1 themselves wrote about) OpenTDSB, an open source time series database.

The importance of getting precise, actionable data cannot be overstated when looking at the overall importance of digital performance analytics. By getting the most out of all the tools available to you and making sure that they complement and work well with each other can make all the difference in the health of your online systems.

The post DNS Experience Tests: A Key Cog in the Online Ecosystem appeared first on Catchpoint's Blog.

More Stories By Mehdi Daoudi

Catchpoint radically transforms the way businesses manage, monitor, and test the performance of online applications. Truly understand and improve user experience with clear visibility into complex, distributed online systems.

Founded in 2008 by four DoubleClick / Google executives with a passion for speed, reliability and overall better online experiences, Catchpoint has now become the most innovative provider of web performance testing and monitoring solutions. We are a team with expertise in designing, building, operating, scaling and monitoring highly transactional Internet services used by thousands of companies and impacting the experience of millions of users. Catchpoint is funded by top-tier venture capital firm, Battery Ventures, which has invested in category leaders such as Akamai, Omniture (Adobe Systems), Optimizely, Tealium, BazaarVoice, Marketo and many more.

@MicroservicesExpo Stories
The benefits of automation are well documented; it increases productivity, cuts cost and minimizes errors. It eliminates repetitive manual tasks, freeing us up to be more innovative. By that logic, surely, we should automate everything possible, right? So, is attempting to automate everything a sensible - even feasible - goal? In a word: no. Consider this your short guide as to what to automate and what not to automate.
Enterprises are adopting Kubernetes to accelerate the development and the delivery of cloud-native applications. However, sharing a Kubernetes cluster between members of the same team can be challenging. And, sharing clusters across multiple teams is even harder. Kubernetes offers several constructs to help implement segmentation and isolation. However, these primitives can be complex to understand and apply. As a result, it’s becoming common for enterprises to end up with several clusters. Thi...
The nature of test environments is inherently temporary—you set up an environment, run through an automated test suite, and then tear down the environment. If you can reduce the cycle time for this process down to hours or minutes, then you may be able to cut your test environment budgets considerably. The impact of cloud adoption on test environments is a valuable advancement in both cost savings and agility. The on-demand model takes advantage of public cloud APIs requiring only payment for t...
It’s “time to move on from DevOps and continuous delivery.” This was the provocative title of a recent article in ZDNet, in which Kelsey Hightower, staff developer advocate at Google Cloud Platform, suggested that “software shops should have put these concepts into action years ago.” Reading articles like this or listening to talks at most DevOps conferences might make you think that we’re entering a post-DevOps world. But vast numbers of organizations still struggle to start and drive transfo...
Many enterprise and government IT organizations are realizing the benefits of cloud computing by extending IT delivery and management processes across private and public cloud services. But they are often challenged with balancing the need for centralized cloud governance without stifling user-driven innovation. This strategy requires an approach that fundamentally reshapes how IT is delivered today, shifting the focus from infrastructure to services aggregation, and mixing and matching the bes...
"Codigm is based on the cloud and we are here to explore marketing opportunities in America. Our mission is to make an ecosystem of the SW environment that anyone can understand, learn, teach, and develop the SW on the cloud," explained Sung Tae Ryu, CEO of Codigm, in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Cavirin Systems has just announced C2, a SaaS offering designed to bring continuous security assessment and remediation to hybrid environments, containers, and data centers. Cavirin C2 is deployed within Amazon Web Services (AWS) and features a flexible licensing model for easy scalability and clear pay-as-you-go pricing. Although native to AWS, it also supports assessment and remediation of virtual or container instances within Microsoft Azure, Google Cloud Platform (GCP), or on-premise. By dr...
High-velocity engineering teams are applying not only continuous delivery processes, but also lessons in experimentation from established leaders like Amazon, Netflix, and Facebook. These companies have made experimentation a foundation for their release processes, allowing them to try out major feature releases and redesigns within smaller groups before making them broadly available. In his session at 21st Cloud Expo, Brian Lucas, Senior Staff Engineer at Optimizely, discussed how by using ne...
"CA has been doing a lot of things in the area of DevOps. Now we have a complete set of tool sets in order to enable customers to go all the way from planning to development to testing down to release into the operations," explained Aruna Ravichandran, Vice President of Global Marketing and Strategy at CA Technologies, in this SYS-CON.tv interview at DevOps Summit at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
Let's do a visualization exercise. Imagine it's December 31, 2018, and you're ringing in the New Year with your friends and family. You think back on everything that you accomplished in the last year: your company's revenue is through the roof thanks to the success of your product, and you were promoted to Lead Developer. 2019 is poised to be an even bigger year for your company because you have the tools and insight to scale as quickly as demand requires. You're a happy human, and it's not just...
We just came off of a review of a product that handles both containers and virtual machines in the same interface. Under the covers, implementation of containers defaults to LXC, though recently Docker support was added. When reading online, or searching for information, increasingly we see “Container Management” products listed as competitors to Docker, when in reality things like Rocket, LXC/LXD, and Virtualization are Dockers competitors. After doing some looking around, we have decided tha...
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
identify the sources of event storms and performance anomalies will require automated, real-time root-cause analysis. I think Enterprise Management Associates said it well: “The data and metrics collected at instrumentation points across the application ecosystem are essential to performance monitoring and root cause analysis. However, analytics capable of transforming data and metrics into an application-focused report or dashboards are what separates actual application monitoring from relat...
"Opsani helps the enterprise adopt containers, help them move their infrastructure into this modern world of DevOps, accelerate the delivery of new features into production, and really get them going on the container path," explained Ross Schibler, CEO of Opsani, and Peter Nickolov, CTO of Opsani, in this SYS-CON.tv interview at DevOps Summit at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
While some developers care passionately about how data centers and clouds are architected, for most, it is only the end result that matters. To the majority of companies, technology exists to solve a business problem, and only delivers value when it is solving that problem. 2017 brings the mainstream adoption of containers for production workloads. In his session at 21st Cloud Expo, Ben McCormack, VP of Operations at Evernote, discussed how data centers of the future will be managed, how the p...
DevOps teams have more on their plate than ever. As infrastructure needs grow, so does the time required to ensure that everything's running smoothly. This makes automation crucial - especially in the server and network monitoring world. Server monitoring tools can save teams time by automating server management and providing real-time performance updates. As budgets reset for the New Year, there is no better time to implement a new server monitoring tool (or re-evaluate your current solution)....
While we understand Agile as a means to accelerate innovation, manage uncertainty and cope with ambiguity, many are inclined to think that it conflicts with the objectives of traditional engineering projects, such as building a highway, skyscraper or power plant. These are plan-driven and predictive projects that seek to avoid any uncertainty. This type of thinking, however, is short-sighted. Agile approaches are valuable in controlling uncertainty because they constrain the complexity that ste...
"This all sounds great. But it's just not realistic." This is what a group of five senior IT executives told me during a workshop I held not long ago. We were working through an exercise on the organizational characteristics necessary to successfully execute a digital transformation, and the group was doing their ‘readout.' The executives loved everything we discussed and agreed that if such an environment existed, it would make transformation much easier. They just didn't believe it was reali...
"We're developing a software that is based on the cloud environment and we are providing those services to corporations and the general public," explained Seungmin Kim, CEO/CTO of SM Systems Inc., in this SYS-CON.tv interview at 21st Cloud Expo, held Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA.
The cloud revolution in enterprises has very clearly crossed the phase of proof-of-concepts into a truly mainstream adoption. One of most popular enterprise-wide initiatives currently going on are “cloud migration” programs of some kind or another. Finding business value for these programs is not hard to fathom – they include hyperelasticity in infrastructure consumption, subscription based models, and agility derived from rapid speed of deployment of applications. These factors will continue to...