Welcome!

Microservices Expo Authors: Elizabeth White, Pat Romanski, Carmen Gonzalez, Liz McMillan, Sematext Blog

Related Topics: @CloudExpo, Java IoT, Microservices Expo, Microsoft Cloud, Agile Computing, Release Management

@CloudExpo: Article

Why Your Analytics Should Be Hosted

It’s become increasingly clear that Big Data is transforming the business landscape

It's become increasingly clear that Big Data, and the tools for manipulating, visualizing and analyzing it, are transforming the business landscape. McKinsey released a report in 2011 that projects 40 percent growth in global data generated per year. This is all well and good, but more and more companies are finding that their toolbox for dealing with all of this data is antiquated and confusing.

Indeed, 58 percent of enterprise decision makers surveyed in March 2012 by DataXu felt they lacked the skills and technology required for marketing analytics. Marketers should be chomping at the bit to fruitfully employ the data they have. Successful marketing requires proper segmentation of the customer base to create more targeted campaigns. Real-time insight into the performance of existing campaigns and a clear grasp of where to redirect efforts can also turn a campaign that would have failed into a success. These are the promises made by the drivers of the current "data movement." The unfortunate reality, however, is that the accumulation of data just adds to the costs of an organization as it struggles to merely store the incoming torrent of data, let alone harness it and allow non-technical individuals to explore and understand it.

Luckily, this isn't the first time that industries have experienced this type of problem. The data movement is just like any other one that starts out as a niche interest to a select few people, eventually growing into a commoditized marketplace that competes on usability and ease of access.

Of all metaphors to pick for this process, the restaurant is an apt one. Cooking is something everyone can do. Mix up some batter, put it on a hot skillet, and you'll get pancakes. Add some eggs and a glass of orange juice and you've either got your brain on drugs or a complete breakfast. You can also go to your local IHOP and order the same thing. If you make it yourself, you know everything that's in it and can control the various aspects of the meal. But you also have to deal with acquiring the ingredients, having the facilities to cook, and doing the cleanup. If you go to a restaurant, all you have to do is show up, tell them what you want, and pay.

Similarly, the analytics space has two types of offerings. You can choose to do it yourself or you can use a hosted service to take care of things for you. As with cooking versus going to a restaurant, there are costs and benefits associated with both, but my biased opinion is that a hosted solution is the best choice for tackling the current influx of data.

Economies of Scale
Restaurants provide the benefits of economies of scale to their patrons, allowing customers to consume and enjoy foods that they normally wouldn't be able to at home. High-quality tuna is rather expensive and generally comes in quantities that no individual person could ever consume before it goes bad. Yet, you can go to a sushi restaurant and get various parts of the fish. This is economies of scale in action. The restaurant can afford to put down a significant sum of money to acquire the whole tuna and resell it in pieces to its patrons.

Hosted analytics presents a similar case. A hosted analytics provider is able to pay more money upfront for hardware than any one of its customers would. The reality of data processing is that there are physical limitations to the amount of data a computer can process given a certain amount of time. This problem can only be overcome with more and better hardware.

Because it serves multiple users, a hosted system is actually incentivized to provision enough machines to answer questions quickly. The compute resources are only required for the duration of a query against the system. The faster a query gets answered, the quicker those resources are freed up to answer someone else's query. Responding to queries fast enough to free up resources for the next query is actually the only way to achieve high levels of concurrency. Because the hosted provider is building their business on the idea that multiple customers will share the same infrastructure, they have to support more than just one query at a time and thus are naturally forced to provide their users with a faster querying experience. Economies of scale work to the users' advantage.

Integration of Diverse Data Streams
Another benefit of hosted analytics systems is that they can provide overnight integration with other data sets, both public and private. Taking this back to the restaurant analogy, restaurants add new items to their menu on a regular basis. If they find a supplier that will give them Alaskan king crab for the same price as a lesser form of crab, patrons will all of a sudden start eating better crab without having even known it was coming. The hosted analytics case is similar in that users can take advantage of new data sets that the provider has integrated.

Consider the following scenario. A marketer might normally have access to customer profile and engagement information through their analytics system. Companies like Amazon Web Services offer up data sets from the human genome, the U.S. Census Bureau, and Wikipedia. If a hosted analytics company integrates a public data set like one of these, they can then expose it to all of their clients. This means that if there are 1,000 customers of the hosted offering and only one of them asks for the integration of the public data set, 999 customers get that same integration overnight. All of the participants reap the benefits of having more data sets available. Through the process of overlaying various data streams, marketers can learn more about their customers and their behavior in order to better target their campaigns. This is just one more benefit hosted offerings provide to ensure that companies can maximally leverage the value of their data.

Useful Analytics
Analytics are only good if they are understandable and actionable, just as restaurants are only good if their food is edible and delicious. There are thousands of ingredients that could be mixed in with fried eggs, but some will taste delicious and some will just result in an inedible concoction. As patrons of many restaurants, we often come to a consensus on what various restaurants do well, personal taste notwithstanding. This knowledge can be employed to eat only the best meals. The same mechanism of collective understanding will play itself out in the hosted analytics space.

Any company that provides hosted analytics to a variety of businesses wants to give its customers only the most useful analytical metrics and functionalities. Marketers may not have the specific training to pinpoint exactly which analysis methods to leverage for maximal effect. That's where the multi-tenant properties of hosted analytics work to your benefit. The hosted analytics provider will be sensitive to which of their tools are providing the most value across their entire customer base. In other words, the individual customers all come together to form a collaborative filter to ensure that the less useful analytics features will be cast aside in favor of those that yield valuable insights. As with the integration of public data sets, this filtering mechanism ensures that benefits cascade throughout the entire system of analytics users. Even for features that do not seem to be immediately relevant to your company's success, as a customer of a hosted provider you can rest assured that once your company turns that corner in its business growth, the hosted provider already knows the kinds of analysis you'll find yourself needing and has the tools available. Newcomers to the platform are thus quickly able to reap the benefits of an analytical toolset that has been vetted by the crowd.

In the past few years, Big Data has exploded in importance. Marketers must learn how to take away useful, actionable insights from the mass of data at their hands in order to create a competitive advantage for their companies. Hosted analytics systems will truly prove themselves to be a staple choice for deciphering the increasing amounts of data that companies have to deal with, just as restaurants are a ubiquitous presence in our current lives.

In closing, we can stretch the restaurant metaphor just a little bit more. In both a restaurant and a home kitchen, there's an able cook who knows how to turn raw ingredients into a delicious meal. Similarly, the future still includes analysts who understand the intricacies of your business. You will, however, achieve much more efficient use of your analyst's time by leveraging the benefits of a hosted analytics provider: improved performance, "free" integration of external data sets, and collaborative vetting of the analytical feature set.

More Stories By Eric Tschetter

Eric Tschetter is the lead architect at Metamarkets, a leader in big data analytics for web-scale companies. Follow Metamarkets on Twitter @Metamarkets and learn more at www.metamarkets.com.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@MicroservicesExpo Stories
More and more companies are looking to microservices as an architectural pattern for breaking apart applications into more manageable pieces so that agile teams can deliver new features quicker and more effectively. What this pattern has done more than anything to date is spark organizational transformations, setting the foundation for future application development. In practice, however, there are a number of considerations to make that go beyond simply “build, ship, and run,” which changes how...
Without lifecycle traceability and visibility across the tool chain, stakeholders from Planning-to-Ops have limited insight and answers to who, what, when, why and how across the DevOps lifecycle. This impacts the ability to deliver high quality software at the needed velocity to drive positive business outcomes. In his general session at @DevOpsSummit at 19th Cloud Expo, Phil Hombledal, Solution Architect at CollabNet, discussed how customers are able to achieve a level of transparency that e...
Between 2005 and 2020, data volumes will grow by a factor of 300 – enough data to stack CDs from the earth to the moon 162 times. This has come to be known as the ‘big data’ phenomenon. Unfortunately, traditional approaches to handling, storing and analyzing data aren’t adequate at this scale: they’re too costly, slow and physically cumbersome to keep up. Fortunately, in response a new breed of technology has emerged that is cheaper, faster and more scalable. Yet, in meeting these new needs they...
@DevOpsSummit taking place June 6-8, 2017 at Javits Center, New York City, is co-located with the 20th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @DevOpsSummit at Cloud Expo New York Call for Papers is now open.
In his session at 19th Cloud Expo, Claude Remillard, Principal Program Manager in Developer Division at Microsoft, contrasted how his team used config as code and immutable patterns for continuous delivery of microservices and apps to the cloud. He showed how the immutable patterns helps developers do away with most of the complexity of config as code-enabling scenarios such as rollback, zero downtime upgrades with far greater simplicity. He also demoed building immutable pipelines in the cloud ...
Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more business becomes digital the more stakeholders are interested in this data including how it relates to business. Some of these people have never used a monitoring tool before. They have a question on their mind like “How is my application doing” but no id...
In IT, we sometimes coin terms for things before we know exactly what they are and how they’ll be used. The resulting terms may capture a common set of aspirations and goals – as “cloud” did broadly for on-demand, self-service, and flexible computing. But such a term can also lump together diverse and even competing practices, technologies, and priorities to the point where important distinctions are glossed over and lost.
Information technology is an industry that has always experienced change, and the dramatic change sweeping across the industry today could not be truthfully described as the first time we've seen such widespread change impacting customer investments. However, the rate of the change, and the potential outcomes from today's digital transformation has the distinct potential to separate the industry into two camps: Organizations that see the change coming, embrace it, and successful leverage it; and...
Monitoring of Docker environments is challenging. Why? Because each container typically runs a single process, has its own environment, utilizes virtual networks, or has various methods of managing storage. Traditional monitoring solutions take metrics from each server and applications they run. These servers and applications running on them are typically very static, with very long uptimes. Docker deployments are different: a set of containers may run many applications, all sharing the resource...
Join Impiger for their featured webinar: ‘Cloud Computing: A Roadmap to Modern Software Delivery’ on November 10, 2016, at 12:00 pm CST. Very few companies have not experienced some impact to their IT delivery due to the evolution of cloud computing. This webinar is not about deciding whether you should entertain moving some or all of your IT to the cloud, but rather, a detailed look under the hood to help IT professionals understand how cloud adoption has evolved and what trends will impact th...
Without lifecycle traceability and visibility across the tool chain, stakeholders from Planning-to-Ops have limited insight and answers to who, what, when, why and how across the DevOps lifecycle. This impacts the ability to deliver high quality software at the needed velocity to drive positive business outcomes. In his session at @DevOpsSummit 19th Cloud Expo, Eric Robertson, General Manager at CollabNet, showed how customers are able to achieve a level of transparency that enables everyone fro...
Internet of @ThingsExpo, taking place June 6-8, 2017 at the Javits Center in New York City, New York, is co-located with the 20th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. @ThingsExpo New York Call for Papers is now open.
The 20th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held June 6-8, 2017, at the Javits Center in New York City, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Containers, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal ...
You have great SaaS business app ideas. You want to turn your idea quickly into a functional and engaging proof of concept. You need to be able to modify it to meet customers' needs, and you need to deliver a complete and secure SaaS application. How could you achieve all the above and yet avoid unforeseen IT requirements that add unnecessary cost and complexity? You also want your app to be responsive in any device at any time. In his session at 19th Cloud Expo, Mark Allen, General Manager of...
"Dice has been around for the last 20 years. We have been helping tech professionals find new jobs and career opportunities," explained Manish Dixit, VP of Product and Engineering at Dice, in this SYS-CON.tv interview at 19th Cloud Expo, held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA.
Rapid innovation, changing business landscapes, and new IT demands force businesses to make changes quickly. In the eyes of many, containers are at the brink of becoming a pervasive technology in enterprise IT to accelerate application delivery. In this presentation, attendees learned about the: The transformation of IT to a DevOps, microservices, and container-based architecture What are containers and how DevOps practices can operate in a container-based environment A demonstration of how ...
Application transformation and DevOps practices are two sides of the same coin. Enterprises that want to capture value faster, need to deliver value faster – time value of money principle. To do that enterprises need to build cloud-native apps as microservices by empowering teams to build, ship, and run in production. In his session at @DevOpsSummit at 19th Cloud Expo, Neil Gehani, senior product manager at HPE, discussed what every business should plan for how to structure their teams to delive...
As we enter the final week before the 19th International Cloud Expo | @ThingsExpo in Santa Clara, CA, it's time for me to reflect on six big topics that will be important during the show. Hybrid Cloud This general-purpose term seems to provide a comfort zone for many enterprise IT managers. It sounds reassuring to be able to work with one of the major public-cloud providers like AWS or Microsoft Azure while still maintaining an on-site presence.
Much of the value of DevOps comes from a (renewed) focus on measurement, sharing, and continuous feedback loops. In increasingly complex DevOps workflows and environments, and especially in larger, regulated, or more crystallized organizations, these core concepts become even more critical. In his session at @DevOpsSummit at 18th Cloud Expo, Andi Mann, Chief Technology Advocate at Splunk, showed how, by focusing on 'metrics that matter,' you can provide objective, transparent, and meaningful f...
Logs are continuous digital records of events generated by all components of your software stack – and they’re everywhere – your networks, servers, applications, containers and cloud infrastructure just to name a few. The data logs provide are like an X-ray for your IT infrastructure. Without logs, this lack of visibility creates operational challenges for managing modern applications that drive today’s digital businesses.