Click here to close now.




















Welcome!

Microservices Expo Authors: Elizabeth White, Liz McMillan, Carmen Gonzalez, Ruxit Blog, Trevor Parsons

Related Topics: @CloudExpo, Java IoT, Microservices Expo, Microsoft Cloud, Agile Computing, Release Management

@CloudExpo: Article

Why Your Analytics Should Be Hosted

It’s become increasingly clear that Big Data is transforming the business landscape

It's become increasingly clear that Big Data, and the tools for manipulating, visualizing and analyzing it, are transforming the business landscape. McKinsey released a report in 2011 that projects 40 percent growth in global data generated per year. This is all well and good, but more and more companies are finding that their toolbox for dealing with all of this data is antiquated and confusing.

Indeed, 58 percent of enterprise decision makers surveyed in March 2012 by DataXu felt they lacked the skills and technology required for marketing analytics. Marketers should be chomping at the bit to fruitfully employ the data they have. Successful marketing requires proper segmentation of the customer base to create more targeted campaigns. Real-time insight into the performance of existing campaigns and a clear grasp of where to redirect efforts can also turn a campaign that would have failed into a success. These are the promises made by the drivers of the current "data movement." The unfortunate reality, however, is that the accumulation of data just adds to the costs of an organization as it struggles to merely store the incoming torrent of data, let alone harness it and allow non-technical individuals to explore and understand it.

Luckily, this isn't the first time that industries have experienced this type of problem. The data movement is just like any other one that starts out as a niche interest to a select few people, eventually growing into a commoditized marketplace that competes on usability and ease of access.

Of all metaphors to pick for this process, the restaurant is an apt one. Cooking is something everyone can do. Mix up some batter, put it on a hot skillet, and you'll get pancakes. Add some eggs and a glass of orange juice and you've either got your brain on drugs or a complete breakfast. You can also go to your local IHOP and order the same thing. If you make it yourself, you know everything that's in it and can control the various aspects of the meal. But you also have to deal with acquiring the ingredients, having the facilities to cook, and doing the cleanup. If you go to a restaurant, all you have to do is show up, tell them what you want, and pay.

Similarly, the analytics space has two types of offerings. You can choose to do it yourself or you can use a hosted service to take care of things for you. As with cooking versus going to a restaurant, there are costs and benefits associated with both, but my biased opinion is that a hosted solution is the best choice for tackling the current influx of data.

Economies of Scale
Restaurants provide the benefits of economies of scale to their patrons, allowing customers to consume and enjoy foods that they normally wouldn't be able to at home. High-quality tuna is rather expensive and generally comes in quantities that no individual person could ever consume before it goes bad. Yet, you can go to a sushi restaurant and get various parts of the fish. This is economies of scale in action. The restaurant can afford to put down a significant sum of money to acquire the whole tuna and resell it in pieces to its patrons.

Hosted analytics presents a similar case. A hosted analytics provider is able to pay more money upfront for hardware than any one of its customers would. The reality of data processing is that there are physical limitations to the amount of data a computer can process given a certain amount of time. This problem can only be overcome with more and better hardware.

Because it serves multiple users, a hosted system is actually incentivized to provision enough machines to answer questions quickly. The compute resources are only required for the duration of a query against the system. The faster a query gets answered, the quicker those resources are freed up to answer someone else's query. Responding to queries fast enough to free up resources for the next query is actually the only way to achieve high levels of concurrency. Because the hosted provider is building their business on the idea that multiple customers will share the same infrastructure, they have to support more than just one query at a time and thus are naturally forced to provide their users with a faster querying experience. Economies of scale work to the users' advantage.

Integration of Diverse Data Streams
Another benefit of hosted analytics systems is that they can provide overnight integration with other data sets, both public and private. Taking this back to the restaurant analogy, restaurants add new items to their menu on a regular basis. If they find a supplier that will give them Alaskan king crab for the same price as a lesser form of crab, patrons will all of a sudden start eating better crab without having even known it was coming. The hosted analytics case is similar in that users can take advantage of new data sets that the provider has integrated.

Consider the following scenario. A marketer might normally have access to customer profile and engagement information through their analytics system. Companies like Amazon Web Services offer up data sets from the human genome, the U.S. Census Bureau, and Wikipedia. If a hosted analytics company integrates a public data set like one of these, they can then expose it to all of their clients. This means that if there are 1,000 customers of the hosted offering and only one of them asks for the integration of the public data set, 999 customers get that same integration overnight. All of the participants reap the benefits of having more data sets available. Through the process of overlaying various data streams, marketers can learn more about their customers and their behavior in order to better target their campaigns. This is just one more benefit hosted offerings provide to ensure that companies can maximally leverage the value of their data.

Useful Analytics
Analytics are only good if they are understandable and actionable, just as restaurants are only good if their food is edible and delicious. There are thousands of ingredients that could be mixed in with fried eggs, but some will taste delicious and some will just result in an inedible concoction. As patrons of many restaurants, we often come to a consensus on what various restaurants do well, personal taste notwithstanding. This knowledge can be employed to eat only the best meals. The same mechanism of collective understanding will play itself out in the hosted analytics space.

Any company that provides hosted analytics to a variety of businesses wants to give its customers only the most useful analytical metrics and functionalities. Marketers may not have the specific training to pinpoint exactly which analysis methods to leverage for maximal effect. That's where the multi-tenant properties of hosted analytics work to your benefit. The hosted analytics provider will be sensitive to which of their tools are providing the most value across their entire customer base. In other words, the individual customers all come together to form a collaborative filter to ensure that the less useful analytics features will be cast aside in favor of those that yield valuable insights. As with the integration of public data sets, this filtering mechanism ensures that benefits cascade throughout the entire system of analytics users. Even for features that do not seem to be immediately relevant to your company's success, as a customer of a hosted provider you can rest assured that once your company turns that corner in its business growth, the hosted provider already knows the kinds of analysis you'll find yourself needing and has the tools available. Newcomers to the platform are thus quickly able to reap the benefits of an analytical toolset that has been vetted by the crowd.

In the past few years, Big Data has exploded in importance. Marketers must learn how to take away useful, actionable insights from the mass of data at their hands in order to create a competitive advantage for their companies. Hosted analytics systems will truly prove themselves to be a staple choice for deciphering the increasing amounts of data that companies have to deal with, just as restaurants are a ubiquitous presence in our current lives.

In closing, we can stretch the restaurant metaphor just a little bit more. In both a restaurant and a home kitchen, there's an able cook who knows how to turn raw ingredients into a delicious meal. Similarly, the future still includes analysts who understand the intricacies of your business. You will, however, achieve much more efficient use of your analyst's time by leveraging the benefits of a hosted analytics provider: improved performance, "free" integration of external data sets, and collaborative vetting of the analytical feature set.

More Stories By Eric Tschetter

Eric Tschetter is the lead architect at Metamarkets, a leader in big data analytics for web-scale companies. Follow Metamarkets on Twitter @Metamarkets and learn more at www.metamarkets.com.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@MicroservicesExpo Stories
Akana has announced the availability of the new Akana Healthcare Solution. The API-driven solution helps healthcare organizations accelerate their transition to being secure, digitally interoperable businesses. It leverages the Health Level Seven International Fast Healthcare Interoperability Resources (HL7 FHIR) standard to enable broader business use of medical data. Akana developed the Healthcare Solution in response to healthcare businesses that want to increase electronic, multi-device acce...
The web app is agile. The REST API is agile. The testing and planning are agile. But alas, data infrastructures certainly are not. Once an application matures, changing the shape or indexing scheme of data often forces at best a top down planning exercise and at worst includes schema changes that force downtime. The time has come for a new approach that fundamentally advances the agility of distributed data infrastructures. Come learn about a new solution to the problems faced by software organ...
Containers are not new, but renewed commitments to performance, flexibility, and agility have propelled them to the top of the agenda today. By working without the need for virtualization and its overhead, containers are seen as the perfect way to deploy apps and services across multiple clouds. Containers can handle anything from file types to operating systems and services, including microservices. What are microservices? Unlike what the name implies, microservices are not necessarily small,...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo in Silicon Valley. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place Nov 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 17th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading in...
Docker containerization is increasingly being used in production environments. How can these environments best be monitored? Monitoring Docker containers as if they are lightweight virtual machines (i.e., monitoring the host from within the container), with all the common metrics that can be captured from an operating system, is an insufficient approach. Docker containers can’t be treated as lightweight virtual machines; they must be treated as what they are: isolated processes running on hosts....
Puppet Labs is pleased to share the findings from our 2015 State of DevOps Survey. We have deepened our understanding of how DevOps enables IT performance and organizational performance, based on responses from more than 20,000 technical professionals we’ve surveyed over the past four years. The 2015 State of DevOps Report reveals high-performing IT organizations deploy 30x more frequently with 200x shorter lead times. They have 60x fewer failures and recover 168x faster
Microservice architecture is fast becoming a go-to solution for enterprise applications, but it's not always easy to make the transition from an established, monolithic infrastructure. Lightweight and loosely coupled, building a set of microservices is arguably more difficult than building a monolithic application. However, once established, microservices offer a series of advantages over traditional architectures as deployment times become shorter and iterating becomes easier.
The 5th International DevOps Summit, co-located with 17th International Cloud Expo – being held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA – announces that its Call for Papers is open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the ...
The 17th International Cloud Expo has announced that its Call for Papers is open. 17th International Cloud Expo, to be held November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, APM, APIs, Microservices, Security, Big Data, Internet of Things, DevOps and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding bu...
ElasticBox, the agile application delivery manager, announced freely available public boxes for the DevOps community. ElasticBox works with enterprises to help them deploy any application to any cloud. Public boxes are curated reference boxes that represent some of the most popular applications and tools for orchestrating deployments at scale. Boxes are an adaptive way to represent reusable infrastructure as components of code. Boxes contain scripts, variables, and metadata to automate proces...
To support developers and operations professionals in their push to implement DevOps principles for their infrastructure environments, ProfitBricks, a provider of cloud infrastructure, is adding support for DevOps tools Ansible and Chef. Ansible is a platform for configuring and managing data center infrastructure that combines multi-node software deployment, ad hoc task execution, and configuration management, and is used by DevOps professionals as they use its playbooks functionality to autom...
It’s been proven time and time again that in tech, diversity drives greater innovation, better team productivity and greater profits and market share. So what can we do in our DevOps teams to embrace diversity and help transform the culture of development and operations into a true “DevOps” team? In her session at DevOps Summit, Stefana Muller, Director, Product Management – Continuous Delivery at CA Technologies, answered that question citing examples, showing how to create opportunities for ...
Puppet Labs has announced the next major update to its flagship product: Puppet Enterprise 2015.2. This release includes new features providing DevOps teams with clarity, simplicity and additional management capabilities, including an all-new user interface, an interactive graph for visualizing infrastructure code, a new unified agent and broader infrastructure support.
SYS-CON Events announced today that Pythian, a global IT services company specializing in helping companies leverage disruptive technologies to optimize revenue-generating systems, has been named “Bronze Sponsor” of SYS-CON's 17th Cloud Expo, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Founded in 1997, Pythian is a global IT services company that helps companies compete by adopting disruptive technologies such as cloud, Big Data, advance...
Skeuomorphism usually means retaining existing design cues in something new that doesn’t actually need them. However, the concept of skeuomorphism can be thought of as relating more broadly to applying existing patterns to new technologies that, in fact, cry out for new approaches. In his session at DevOps Summit, Gordon Haff, Senior Cloud Strategy Marketing and Evangelism Manager at Red Hat, discussed why containers should be paired with new architectural practices such as microservices rathe...
DevOps has traditionally played important roles in development and IT operations, but the practice is quickly becoming core to other business functions such as customer success, business intelligence, and marketing analytics. Modern marketers today are driven by data and rely on many different analytics tools. They need DevOps engineers in general and server log data specifically to do their jobs well. Here’s why: Server log files contain the only data that is completely full and accurate in th...
All major researchers estimate there will be tens of billions devices - computers, smartphones, tablets, and sensors - connected to the Internet by 2020. This number will continue to grow at a rapid pace for the next several decades. With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo, November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Learn what is going on, contribute to the discussions, and e...
SYS-CON Events announced today that G2G3 will exhibit at SYS-CON's @DevOpsSummit Silicon Valley, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Based on a collective appreciation for user experience, design, and technology, G2G3 is uniquely qualified and motivated to redefine how organizations and people engage in an increasingly digital world.
Whether you like it or not, DevOps is on track for a remarkable alliance with security. The SEC didn’t approve the merger. And your boss hasn’t heard anything about it. Yet, this unruly triumvirate will soon dominate and deliver DevSecOps faster, cheaper, better, and on an unprecedented scale. In his session at DevOps Summit, Frank Bunger, VP of Customer Success at ScriptRock, will discuss how this cathartic moment will propel the DevOps movement from such stuff as dreams are made on to a prac...
SYS-CON Events announced today that DataClear Inc. will exhibit at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. The DataClear ‘BlackBox’ is the only solution that moves your PC, browsing and data out of the United States and away from prying (and spying) eyes. Its solution automatically builds you a clean, on-demand, virus free, new virtual cloud based PC outside of the United States, and wipes it clean...