Welcome!

Microservices Expo Authors: Pat Romanski, Zakia Bouachraoui, Elizabeth White, Liz McMillan, Yeshim Deniz

Related Topics: @DevOpsSummit, Java IoT, Microservices Expo, Linux Containers, Containers Expo Blog, @CloudExpo

@DevOpsSummit: Article

DNS Experience Tests | @CloudExpo @CatchPoint #APM #DevOps

When monitoring and analyzing the health of your online systems, you need a broad arsenal of different tools

DNS Experience Tests: A Key Cog in the Online Ecosystem
By Mehdi Daoudi

A company’s collection of online systems is like a delicate ecosystem – all components must integrate with and complement each other, and one single malfunction in any of them can bring the entire system to a screeching halt.

That’s why, when monitoring and analyzing the health of your online systems, you need a broad arsenal of different tools for your different needs. In addition to a wide-angle lens that provides a snapshot of the overall health of your system, you must also have precise, scalpel-like tools that can isolate and analyze all of those different components (DNS, CDNs, internal and external servers, third-party tags, etc.).

Catchpoint was designed to be that exact kind of precision tool. When evaluating the health of your systems, a simple availability test is useful, but it only a basic binary state: you’re either up or you’re down. Figuring out why you are down, or why your customers are waiting longer than they should for a page to load, is a wholly different matter, and requires precise diagnostic and analytical capabilities in order to provide you with actionable data.

Of these many different parts that make up an online system, DNS is perhaps the most important. It’s the very first interaction that a customer has with an online brand, and therefore having a rapid DNS lookup and resolution process is vital to maintaining an exceptional customer experience. Yet to properly assess your DNS health, you need that aforementioned scalpel, not a broad sword. This is why Catchpoint has maintained close relationships with DNS solution providers such as NS1, keeping open channels of communication in order to create the most precise tools possible.

This type of relationship has manifested itself positively for NS1, which uses Catchpoint tests to ensure that they are providing their customers with the best possible DNS resolution times, but it’s equally important for clients of those providers to monitor their own DNS performance to detect any issues that the vendor might be missing.

One way to drill down and gain additional insight into a DNS performance issue is through the different types of DNS monitors that Catchpoint offers. To play out a scenario, let’s say that you catch a DNS resolution problem in a basic browser test:

Chrome test

While this data shows that your users are suffering from bad experience due to DNS latency, you have no way of knowing where the latency occurred in the DNS resolution process. To gain this information, you need a DNS monitoring solution that shows performance and error data for all the different steps and servers in the DNS chain. Additionally, you can keep an eye on specific types of records (Answers, Authoritative Name Servers, or Additional Records) from the DNS query, which allows you to detect issues such as wrong TTLs, DNS Cache Poisoning, misconfigurations, etc.

This is imperative when it comes to detecting a third party DNS vendor’s errors, because most organizations rely on external DNS registrars and vendors, but have little visibility in their performance and availability.

8-3 blog image-2_705_left[1]

Getting back to our hypothetical problem, once we run our DNS Experience tests, we get a result that looks like this:

DNS Experience Test

In this test, which hits a multitude of different name servers in succession, we see intermittent spikes in performance (blue line chart) and drops in availability (green line chart), which tells us that the problem is isolated to specific name servers as opposed to the whole lot of them. Therefore, we need to run DNS Direct tests to isolate each of those name servers:

dnsDirect

Now the exact source of the problem becomes clear. There’s one specific name server which has failed multiple times in the test timeframe, which means that we now have actionable data to work with. The DNS provider, if it hasn’t already located the source of the problem through the same process, can be made aware of the issue so that they can take that server offline while they fix the problem.

In addition to the advanced insight that specific monitors provide, one of Catchpoint’s strongest attributes is our global node coverage. As a global DNS provider, NS1 knows that the ability to test their servers from as many different locations as possible is imperative to understanding the full scope of end users’ DNS experience. NS1 uses Catchpoint nodes around the world to collect data, and then using Catchpoint’s Push/Pull APIs, they can input that data into any number of different tools in order to make it actionable.

Just like the ecosystem of different components that make up modern online systems, there is an ecosystem of tools of similar size and scope to keep an eye on all of those components. Catchpoint is a cog in that ecosystem; this is why it was designed to play nicely with other alerting, communication, and monitoring tools that IT Ops professionals regularly use. There are Catchpoint integrations in place with Slack, VictorOps, PagerDuty, Zapier, etc., and the APIs work with many other different tools, including (as NS1 themselves wrote about) OpenTDSB, an open source time series database.

The importance of getting precise, actionable data cannot be overstated when looking at the overall importance of digital performance analytics. By getting the most out of all the tools available to you and making sure that they complement and work well with each other can make all the difference in the health of your online systems.

The post DNS Experience Tests: A Key Cog in the Online Ecosystem appeared first on Catchpoint's Blog.

More Stories By Mehdi Daoudi

Catchpoint radically transforms the way businesses manage, monitor, and test the performance of online applications. Truly understand and improve user experience with clear visibility into complex, distributed online systems.

Founded in 2008 by four DoubleClick / Google executives with a passion for speed, reliability and overall better online experiences, Catchpoint has now become the most innovative provider of web performance testing and monitoring solutions. We are a team with expertise in designing, building, operating, scaling and monitoring highly transactional Internet services used by thousands of companies and impacting the experience of millions of users. Catchpoint is funded by top-tier venture capital firm, Battery Ventures, which has invested in category leaders such as Akamai, Omniture (Adobe Systems), Optimizely, Tealium, BazaarVoice, Marketo and many more.

Microservices Articles
Lori MacVittie is a subject matter expert on emerging technology responsible for outbound evangelism across F5's entire product suite. MacVittie has extensive development and technical architecture experience in both high-tech and enterprise organizations, in addition to network and systems administration expertise. Prior to joining F5, MacVittie was an award-winning technology editor at Network Computing Magazine where she evaluated and tested application-focused technologies including app secu...
When building large, cloud-based applications that operate at a high scale, it’s important to maintain a high availability and resilience to failures. In order to do that, you must be tolerant of failures, even in light of failures in other areas of your application. “Fly two mistakes high” is an old adage in the radio control airplane hobby. It means, fly high enough so that if you make a mistake, you can continue flying with room to still make mistakes. In his session at 18th Cloud Expo, Lee A...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, discussed how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team at D...
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the bene...
Using new techniques of information modeling, indexing, and processing, new cloud-based systems can support cloud-based workloads previously not possible for high-throughput insurance, banking, and case-based applications. In his session at 18th Cloud Expo, John Newton, CTO, Founder and Chairman of Alfresco, described how to scale cloud-based content management repositories to store, manage, and retrieve billions of documents and related information with fast and linear scalability. He addresse...
The now mainstream platform changes stemming from the first Internet boom brought many changes but didn’t really change the basic relationship between servers and the applications running on them. In fact, that was sort of the point. In his session at 18th Cloud Expo, Gordon Haff, senior cloud strategy marketing and evangelism manager at Red Hat, will discuss how today’s workloads require a new model and a new platform for development and execution. The platform must handle a wide range of rec...
SYS-CON Events announced today that DatacenterDynamics has been named “Media Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY. DatacenterDynamics is a brand of DCD Group, a global B2B media and publishing company that develops products to help senior professionals in the world's most ICT dependent organizations make risk-based infrastructure and capacity decisions.
Discussions of cloud computing have evolved in recent years from a focus on specific types of cloud, to a world of hybrid cloud, and to a world dominated by the APIs that make today's multi-cloud environments and hybrid clouds possible. In this Power Panel at 17th Cloud Expo, moderated by Conference Chair Roger Strukhoff, panelists addressed the importance of customers being able to use the specific technologies they need, through environments and ecosystems that expose their APIs to make true ...
In his keynote at 19th Cloud Expo, Sheng Liang, co-founder and CEO of Rancher Labs, discussed the technological advances and new business opportunities created by the rapid adoption of containers. With the success of Amazon Web Services (AWS) and various open source technologies used to build private clouds, cloud computing has become an essential component of IT strategy. However, users continue to face challenges in implementing clouds, as older technologies evolve and newer ones like Docker c...