Click here to close now.




















Welcome!

Microservices Expo Authors: Pat Romanski, Trevor Parsons, Cloud Best Practices Network, Elizabeth White, Joe Pruitt

Related Topics: @CloudExpo, Java IoT, Microservices Expo, Microsoft Cloud, Agile Computing, Release Management

@CloudExpo: Article

Why Your Analytics Should Be Hosted

It’s become increasingly clear that Big Data is transforming the business landscape

It's become increasingly clear that Big Data, and the tools for manipulating, visualizing and analyzing it, are transforming the business landscape. McKinsey released a report in 2011 that projects 40 percent growth in global data generated per year. This is all well and good, but more and more companies are finding that their toolbox for dealing with all of this data is antiquated and confusing.

Indeed, 58 percent of enterprise decision makers surveyed in March 2012 by DataXu felt they lacked the skills and technology required for marketing analytics. Marketers should be chomping at the bit to fruitfully employ the data they have. Successful marketing requires proper segmentation of the customer base to create more targeted campaigns. Real-time insight into the performance of existing campaigns and a clear grasp of where to redirect efforts can also turn a campaign that would have failed into a success. These are the promises made by the drivers of the current "data movement." The unfortunate reality, however, is that the accumulation of data just adds to the costs of an organization as it struggles to merely store the incoming torrent of data, let alone harness it and allow non-technical individuals to explore and understand it.

Luckily, this isn't the first time that industries have experienced this type of problem. The data movement is just like any other one that starts out as a niche interest to a select few people, eventually growing into a commoditized marketplace that competes on usability and ease of access.

Of all metaphors to pick for this process, the restaurant is an apt one. Cooking is something everyone can do. Mix up some batter, put it on a hot skillet, and you'll get pancakes. Add some eggs and a glass of orange juice and you've either got your brain on drugs or a complete breakfast. You can also go to your local IHOP and order the same thing. If you make it yourself, you know everything that's in it and can control the various aspects of the meal. But you also have to deal with acquiring the ingredients, having the facilities to cook, and doing the cleanup. If you go to a restaurant, all you have to do is show up, tell them what you want, and pay.

Similarly, the analytics space has two types of offerings. You can choose to do it yourself or you can use a hosted service to take care of things for you. As with cooking versus going to a restaurant, there are costs and benefits associated with both, but my biased opinion is that a hosted solution is the best choice for tackling the current influx of data.

Economies of Scale
Restaurants provide the benefits of economies of scale to their patrons, allowing customers to consume and enjoy foods that they normally wouldn't be able to at home. High-quality tuna is rather expensive and generally comes in quantities that no individual person could ever consume before it goes bad. Yet, you can go to a sushi restaurant and get various parts of the fish. This is economies of scale in action. The restaurant can afford to put down a significant sum of money to acquire the whole tuna and resell it in pieces to its patrons.

Hosted analytics presents a similar case. A hosted analytics provider is able to pay more money upfront for hardware than any one of its customers would. The reality of data processing is that there are physical limitations to the amount of data a computer can process given a certain amount of time. This problem can only be overcome with more and better hardware.

Because it serves multiple users, a hosted system is actually incentivized to provision enough machines to answer questions quickly. The compute resources are only required for the duration of a query against the system. The faster a query gets answered, the quicker those resources are freed up to answer someone else's query. Responding to queries fast enough to free up resources for the next query is actually the only way to achieve high levels of concurrency. Because the hosted provider is building their business on the idea that multiple customers will share the same infrastructure, they have to support more than just one query at a time and thus are naturally forced to provide their users with a faster querying experience. Economies of scale work to the users' advantage.

Integration of Diverse Data Streams
Another benefit of hosted analytics systems is that they can provide overnight integration with other data sets, both public and private. Taking this back to the restaurant analogy, restaurants add new items to their menu on a regular basis. If they find a supplier that will give them Alaskan king crab for the same price as a lesser form of crab, patrons will all of a sudden start eating better crab without having even known it was coming. The hosted analytics case is similar in that users can take advantage of new data sets that the provider has integrated.

Consider the following scenario. A marketer might normally have access to customer profile and engagement information through their analytics system. Companies like Amazon Web Services offer up data sets from the human genome, the U.S. Census Bureau, and Wikipedia. If a hosted analytics company integrates a public data set like one of these, they can then expose it to all of their clients. This means that if there are 1,000 customers of the hosted offering and only one of them asks for the integration of the public data set, 999 customers get that same integration overnight. All of the participants reap the benefits of having more data sets available. Through the process of overlaying various data streams, marketers can learn more about their customers and their behavior in order to better target their campaigns. This is just one more benefit hosted offerings provide to ensure that companies can maximally leverage the value of their data.

Useful Analytics
Analytics are only good if they are understandable and actionable, just as restaurants are only good if their food is edible and delicious. There are thousands of ingredients that could be mixed in with fried eggs, but some will taste delicious and some will just result in an inedible concoction. As patrons of many restaurants, we often come to a consensus on what various restaurants do well, personal taste notwithstanding. This knowledge can be employed to eat only the best meals. The same mechanism of collective understanding will play itself out in the hosted analytics space.

Any company that provides hosted analytics to a variety of businesses wants to give its customers only the most useful analytical metrics and functionalities. Marketers may not have the specific training to pinpoint exactly which analysis methods to leverage for maximal effect. That's where the multi-tenant properties of hosted analytics work to your benefit. The hosted analytics provider will be sensitive to which of their tools are providing the most value across their entire customer base. In other words, the individual customers all come together to form a collaborative filter to ensure that the less useful analytics features will be cast aside in favor of those that yield valuable insights. As with the integration of public data sets, this filtering mechanism ensures that benefits cascade throughout the entire system of analytics users. Even for features that do not seem to be immediately relevant to your company's success, as a customer of a hosted provider you can rest assured that once your company turns that corner in its business growth, the hosted provider already knows the kinds of analysis you'll find yourself needing and has the tools available. Newcomers to the platform are thus quickly able to reap the benefits of an analytical toolset that has been vetted by the crowd.

In the past few years, Big Data has exploded in importance. Marketers must learn how to take away useful, actionable insights from the mass of data at their hands in order to create a competitive advantage for their companies. Hosted analytics systems will truly prove themselves to be a staple choice for deciphering the increasing amounts of data that companies have to deal with, just as restaurants are a ubiquitous presence in our current lives.

In closing, we can stretch the restaurant metaphor just a little bit more. In both a restaurant and a home kitchen, there's an able cook who knows how to turn raw ingredients into a delicious meal. Similarly, the future still includes analysts who understand the intricacies of your business. You will, however, achieve much more efficient use of your analyst's time by leveraging the benefits of a hosted analytics provider: improved performance, "free" integration of external data sets, and collaborative vetting of the analytical feature set.

More Stories By Eric Tschetter

Eric Tschetter is the lead architect at Metamarkets, a leader in big data analytics for web-scale companies. Follow Metamarkets on Twitter @Metamarkets and learn more at www.metamarkets.com.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@MicroservicesExpo Stories
Learn how to solve the problem of keeping files in sync between multiple Docker containers. In his session at 16th Cloud Expo, Aaron Brongersma, Senior Infrastructure Engineer at Modulus, discussed using rsync, GlusterFS, EBS and Bit Torrent Sync. He broke down the tools that are needed to help create a seamless user experience. In the end, can we have an environment where we can easily move Docker containers, servers, and volumes without impacting our applications? He shared his results so yo...
Auto-scaling environments, micro-service architectures and globally-distributed teams are just three common examples of why organizations today need automation and interoperability more than ever. But is interoperability something we simply start doing, or does it require a reexamination of our processes? And can we really improve our processes without first making interoperability a requirement for how we choose our tools?
Cloud Migration Management (CMM) refers to the best practices for planning and managing migration of IT systems from a legacy platform to a Cloud Provider through a combination professional services consulting and software tools. A Cloud migration project can be a relatively simple exercise, where applications are migrated ‘as is’, to gain benefits such as elastic capacity and utility pricing, but without making any changes to the application architecture, software development methods or busine...
The Software Defined Data Center (SDDC), which enables organizations to seamlessly run in a hybrid cloud model (public + private cloud), is here to stay. IDC estimates that the software-defined networking market will be valued at $3.7 billion by 2016. Security is a key component and benefit of the SDDC, and offers an opportunity to build security 'from the ground up' and weave it into the environment from day one. In his session at 16th Cloud Expo, Reuven Harrison, CTO and Co-Founder of Tufin,...
JavaScript is primarily a client-based dynamic scripting language most commonly used within web browsers as client-side scripts to interact with the user, browser, and communicate asynchronously to servers. If you have been part of any web-based development, odds are you have worked with JavaScript in one form or another. In this article, I'll focus on the aspects of JavaScript that are relevant within the Node.js environment.
You often hear the two titles of "DevOps" and "Immutable Infrastructure" used independently. In his session at DevOps Summit, John Willis, Technical Evangelist for Docker, covered the union between the two topics and why this is important. He provided an overview of Immutable Infrastructure then showed how an Immutable Continuous Delivery pipeline can be applied as a best practice for "DevOps." He ended the session with some interesting case study examples.
Approved this February by the Internet Engineering Task Force (IETF), HTTP/2 is the first major update to HTTP since 1999, when HTTP/1.1 was standardized. Designed with performance in mind, one of the biggest goals of HTTP/2 implementation is to decrease latency while maintaining a high-level compatibility with HTTP/1.1. Though not all testing activities will be impacted by the new protocol, it's important for testers to be aware of any changes moving forward.
One of the ways to increase scalability of services – and applications – is to go “stateless.” The reasons for this are many, but in general by eliminating the mapping between a single client and a single app or service instance you eliminate the need for resources to manage state in the app (overhead) and improve the distributability (I can make up words if I want) of requests across a pool of instances. The latter occurs because sessions don’t need to hang out and consume resources that could ...
Alibaba, the world’s largest ecommerce provider, has pumped over a $1 billion into its subsidiary, Aliya, a cloud services provider. This is perhaps one of the biggest moments in the global Cloud Wars that signals the entry of China into the main arena. Here is why this matters. The cloud industry worldwide is being propelled into fast growth by tremendous demand for cloud computing services. Cloud, which is highly scalable and offers low investment and high computational capabilities to end us...
The Internet of Things. Cloud. Big Data. Real-Time Analytics. To those who do not quite understand what these phrases mean (and let’s be honest, that’s likely to be a large portion of the world), words like “IoT” and “Big Data” are just buzzwords. The truth is, the Internet of Things encompasses much more than jargon and predictions of connected devices. According to Parker Trewin, Senior Director of Content and Communications of Aria Systems, “IoT is big news because it ups the ante: Reach out ...
At DevOps Summit NY there’s been a whole lot of talk about not just DevOps, but containers, IoT, and microservices. Sessions focused not just on the cultural shift needed to grow at scale with a DevOps approach, but also made sure to include the network ”plumbing” needed to ensure success as applications decompose into the microservice architectures enabling rapid growth and support for the Internet of (Every)Things.
Our guest on the podcast this week is Adrian Cockcroft, Technology Fellow at Battery Ventures. We discuss what makes Docker and Netflix highly successful, especially through their use of well-designed IT architecture and DevOps.
Explosive growth in connected devices. Enormous amounts of data for collection and analysis. Critical use of data for split-second decision making and actionable information. All three are factors in making the Internet of Things a reality. Yet, any one factor would have an IT organization pondering its infrastructure strategy. How should your organization enhance its IT framework to enable an Internet of Things implementation? In his session at @ThingsExpo, James Kirkland, Red Hat's Chief Arch...
This week, I joined SOASTA as Senior Vice President of Performance Analytics. Given my background in cloud computing and distributed systems operations — you may have read my blogs on CNET or GigaOm — this may surprise you, but I want to explain why this is the perfect time to take on this opportunity with this team. In fact, that’s probably the best way to break this down. To explain why I’d leave the world of infrastructure and code for the world of data and analytics, let’s explore the timing...
Digital Transformation is the ultimate goal of cloud computing and related initiatives. The phrase is certainly not a precise one, and as subject to hand-waving and distortion as any high-falutin' terminology in the world of information technology. Yet it is an excellent choice of words to describe what enterprise IT—and by extension, organizations in general—should be working to achieve. Digital Transformation means: handling all the data types being found and created in the organizat...
Public Cloud IaaS started its life in the developer and startup communities and has grown rapidly to a $20B+ industry, but it still pales in comparison to how much is spent worldwide on IT: $3.6 trillion. In fact, there are 8.6 million data centers worldwide, the reality is many small and medium sized business have server closets and colocation footprints filled with servers and storage gear. While on-premise environment virtualization may have peaked at 75%, the Public Cloud has lagged in adop...
SYS-CON Events announced today that HPM Networks will exhibit at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. For 20 years, HPM Networks has been integrating technology solutions that solve complex business challenges. HPM Networks has designed solutions for both SMB and enterprise customers throughout the San Francisco Bay Area.
MuleSoft has announced the findings of its 2015 Connectivity Benchmark Report on the adoption and business impact of APIs. The findings suggest traditional businesses are quickly evolving into "composable enterprises" built out of hundreds of connected software services, applications and devices. Most are embracing the Internet of Things (IoT) and microservices technologies like Docker. A majority are integrating wearables, like smart watches, and more than half plan to generate revenue with ...
Rapid innovation, changing business landscapes, and new IT demands force businesses to make changes quickly. The DevOps approach is a way to increase business agility through collaboration, communication, and integration across different teams in the IT organization. In his session at DevOps Summit, Chris Van Tuin, Chief Technologist for the Western US at Red Hat, will discuss: The acceleration of application delivery for the business with DevOps
Software is eating the world. The more it eats, the bigger the mountain of data and wealth of valuable insights to digest and act on. Forward facing customer-centric IT organizations, leaders and professionals are looking to answer questions like how much revenue was lost today from platinum users not converting because they experienced poor mobile app performance. This requires a single, real-time pane of glass for end-to-end analytics covering business, customer, and IT operational data.