| By David Abramowski | Article Rating: |
|
| March 10, 2009 02:42 AM EDT | Reads: |
8,081 |
Cloud computing is a rather powerful tool that allows even the smallest of businesses to provide an enterprise class environment for web applications. In a nutshell, the cloud is nothing more than the ability to rent computer services on demand from a 3rd party provider. At MioWorks.com we use Amazon Web Services, but there are several other services out there for you to explore.
Mastering the cloud takes a bit of work, a dash of experience and an openness to learn from others. But once you do master it, the benefits are tremendous. You’ll never have to order another server or rent a rack in a data center. You’ll be able to fluidly control your environment by increasing and decreasing the services you need on the fly, saving time and money.
This power, flexibility and potential demands that you pay attention to the details. You must anticipate that the cloud can have hiccups and that as quickly as a server comes to life, that server can disappear. In previous blog posts I’ve already talked about the importance of backups and recovery drills, but let’s take a step back. Today let’s talk about monitoring and how important it is to your survival.
Ok I’ll bite, why is monitoring so important
Let me sum this up in a single sentence: Monitoring can be the difference between “whew that was close” and “holy s$%t we are down”. I lied - I need another sentence… Monitoring can also be the difference between a five minute outage and a five hour outage.
What to monitor
Every web based application environment in the cloud is a jigsaw puzzle of pieces. At the core you have your virtual hardware followed by your operating system. Each of your servers is then configured differently depending on its specific duty. You may have application servers, web servers, search servers, database servers and the list goes on. Each of these servers needs to be monitored from several points of view - both internally and externally.
Internal Monitoring
The big question isn’t “Is the server running?” it should be “Is the server and all of its pieces running correctly? Each virtual server in your setup is a maze of processes, files, directories and file systems. At any given time a hiccup can occur within this delicate environment that will eventually disrupt the end user’s ability to use your service. In our environment we use monit and munin (two open source tools) on the inside to provide us with critical monitoring, recovery & trending capabilities.
Monit provides systems monitoring and error recovery for our Unix systems. In our environment we have configured monit to watch dozens of potential failure points. Monit can start a process if it is not running and can kill/restart a process if it takes too many resources. Monit is also configurable as an intrusion detection system by watching for changes in files, directories and file systems. By spending a little time learning and using Monit your system administrator has a great tool to keep a constant eye on all the pieces of the puzzle.
In addition to the direct monitoring and error recovery system, we also like to see the bigger picture. We use Munin to aggregate information across our server pool. Munin provides a graphical view that allows your team to quickly see what’s different from yesterday. You can quickly determine your resource utlization and plan in ADVANCE any increase of capacity.
From the outside
Keeping track of all the pieces inside the cloud is very important, but you also need to know how your environment in the cloud is performing to the outside world. There are more external monitoring services out there than I can count. But I’ll tell you who we use. Our favorite at the moment is monitis.com. We like them because starting at just $10/month you get on demand fault & performance monitoring for your environment. This external watchdog system helps to keep everyone informed if/when the cloud is having issues. It also provides us with important statistics on response time and application performance that we use to determine how to adjust our infrastructure.
Continuous improvement
Your monitoring program must become a living, breathing element of your systems administration. As new problems arise or potential problems are identified, the monitoring system must be adjusted to be proactive. The good news is that the more you adjust your monitoring and error recovery system, the less you’ll be surprised in the future. It takes discipline to post mortem each problem and determine how to proactively detect for it in the future. And this discipline will distinguish your application in the frenzy of the cloud.
Real world results of a good monitoring program
In the real world your monitoring system can be the difference between keeping your systems alive and thriving OR having unhappy customers and missed SLAs. It can help you pinpoint exactly what went wrong and reduce the time it takes for the first responders to identify and solve the issue. There are lots of solutions in the marketplace including commerical and open source alternatives. It may seem overwhelming at first, but once you start the process and improve little by little, you’ll be amazed at the positive impact your monitoring program will have on your environment stability and your ability to get some sleep.
Published March 10, 2009 Reads 8,081
Copyright © 2009 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By David Abramowski
David Abramowski is one of the founders of zoomstra.com as well as a product strategy & marketing consultant. David's background as a technologist and a product marketing manager enables him to look at today's solutions from the perspective of the user. David's career spans early stage startups including Axent Technologies, Vignette and Morph Labs as well as enterprise mainstays such as Symantec. You can also follow David on twitter @dabramowski
- Big Data in Telecom: The Need for Analytics
- Patterns for Building High Performance Applications
- What Motivates Open Standards in the Cloud?
- What to Expect in 2012: Cloud Computing and Open Source Software
- Will PaaS Finally Bring Open Source Love to the Enterprise?
- Ten Hot Trends in Cloud Data for 2012
- Cross-Platform Mobile Website Development – a Tool Comparison
- Oracle Disaster Recovery Site Hosted by Amazon Cloud
- Three Buzzwords That Every CIO Hears but One They Should Listen To
- Write Once Run Anywhere or Cross Platform Mobile Development Tools
- Big Data Highlights from McKinsey: Part 2 - Production, Supply, and Logistics
- Microsoft’s New Cloudware Could Cast a Shadow over VMware
- The Future of Cloud Computing: Industry Predictions for 2012
- Gartner Hype Cycle for Emerging Technologies 2011
- Book Excerpt: Introducing HTML5
- Adobe Sends Flex to the Apache Foundation
- Big Data in Telecom: The Need for Analytics
- Book Excerpt: Java Application Profiling Tips and Tricks
- i-Technology in 2012: Five Industry Predictions
- Patterns for Building High Performance Applications
- Microsoft Tries Hadoop on Azure
- The Next Web Architecture
- Cloud Computing: A Comparison of Computing Models
- Amazon to Fix Some Kindle Fire Problems
- The i-Technology Right Stuff
- The Top 150 Players in Cloud Computing
- Who Are The All-Time Heroes of i-Technology?
- Where Are RIA Technologies Headed in 2008?
- Get the Message
- ESB Myth Busters: 10 Enterprise Service Bus Myths Debunked
- i-Technology Viewpoint: Is Web 2.0 the Global SOA?
- i-Technology Viewpoint: Thinking Outside the VC Box
- i-Technology Viewpoint: When to Leave Your First IT Job
- SOA Web Services Edge Conference Coverage on SYS-CON.TV
- SYS-CON.TV's "SOA Web Services" and "Enterprise Open Source" Programs To Air in December
- Five Reasons Why Web 2.0 Matters




















