Welcome!

Microservices Expo Authors: Pat Romanski, Liz McMillan, Elizabeth White, Yeshim Deniz, Zakia Bouachraoui

Related Topics: Microservices Expo

Microservices Expo: Article

Scaling SOA

Implementing SOA comes with overhead, but smart architecture can lead to infinitely scalable SOA implementations

Based upon conversations with architects in ZapThink’s Licensed ZapThink Architect (LZA) course, our High Performance SOA ZapFlash back in the fall of 2006 was a bit ahead of its time. In 2006, most architects were simply focusing on how to put SOA implementations together, but today, the focus has unquestionably shifted to how to make SOA implementations enterprise-class. As a result, there is predictably an increasing emphasis on scalability.

Conventional wisdom might suggest that SOA implementations scale poorly. While it’s true that abstraction layers always add some overhead, if your SOA implementation is too slow or doesn’t scale properly, the problem is more likely with how you went about architecting it, rather than a problem with the SOA approach itself. Understanding the best practices for building scalable, high performance SOA implementations, therefore, is an essential skill for any SOA architect.

The Three Dimensions of Scalability
Building scalable SOA implementations begins predictably with broad-based scalability best practices. The SOA architect must then understand how to apply these practices in the SOA context. SOA introduces some new scalability challenges to be sure, but from the systems perspective (where scalability lives), SOA is just one more approach for accessing enterprise resources.

The scalability approach this ZapFlash will focus on is called the “Scalability Cube,” which is one way of looking at the core best practices for enterprise scalability. This cube has three dimensions, which you can think of as the X, Y, and Z axes. Basically, there are three essentially orthogonal approaches to scalability, and the architect’s challenge is in properly combining them to achieve the best results.

Each axis consists of multiple instances of an underlying implementation. Each instance contains some subset of the infrastructure necessary to support one or more Services, including data stores, data integration technology, legacy applications and integration technology, Service run time containers, and whatever middleware glues the lot together. The key to scaling the SOA implementation is to duplicate such instances in particular ways depending upon which axis we’re following.

The most straightforward scalability dimension is the X axis. The X axis represents the distribution of the same work or mirroring of data. To scale an implementation along the X axis, deploy multiple identical systems that are essentially mirror images of capabilities and data. A typical example of X axis scaling would be multiple identical databases on the back end supporting identical Service implementation instances. Scaling out, then, depends upon the straightforward process of adding identical instances.

The advantages of scaling along the X axis are as follows:

  • It’s simple to implement and simple to scale out. Simply distribute Service requests via a simple round robin algorithm on a load balancer or other intermediary.
  • It provides the best recovery from failure. If anything goes wrong with one of the instances, simply route traffic to other instances until you fix the problem. As a result, most disaster recovery strategies rely upon X axis scalability.
  • It works well with Entity Services, because Entity Services tend to abstract database tables or data integrations, which easily fit into an X axis instance.

There are disadvantages of X axis scaling as well:

  • It doesn’t maintain process instance state well, because multiple requests from the same consumer may be routed to different X axis instances in an essentially random fashion. Therefore, you must typically rely on messages to maintain state, which may require custom message headers and Service contracts.
  • X axis scaling doesn’t work for Services that abstract legacy systems that can’t be duplicated further. If you only have that one old mainframe acting as the back end for your Services, it will prevent X axis scaling of the infrastructure that supports the Service interfaces to the legacy system.

Orthogonal to the X axis is the Y axis, which represents the distribution and separation of work responsibilities or data across multiple instances. The distribution of capabilities along the Y axis are by resource, and such instances separate data based on data type or type of work. An example of scaling along the Y axis would be where each database and corresponding middleware instance supports a separate set of Services. To scale up, run fewer and fewer Services on any particular implementation instance as you add instances.

The advantages of Y axis scaling are:

  • It’s the best way to scale reusable Services. As the demand for reusable Services increases, their implementations can be distributed to as many implementation instances as necessary.
  • It works well with Task Services, because Task Services are less likely to abstract capabilities across multiple data sources as Entity Services are.
  • Y axis scaling also works well for abstracting legacy systems that can’t be duplicated further, although the scalability of such systems will still impact the overall scalability of the Services that abstract them. (Resolving that issue typically involves some kind of caching scheme.)

The disadvantages of Y axis scaling are:

  • Entity Services that abstract complex joins do poorly with Y axis scaling, because the complexity of the underlying implementation limits how narrow each instance can become.
  • Y axis scaling also deals with state poorly, when compositions include Services from different Y axis instances. However, because it’s possible in some cases to plan for Y axis scaling that maintains compositions in separate Y axis instances, it can maintain state more effectively than X axis scaling some of the time.

Finally, the Z axis represents distribution and segmentation of work by customer, customer need, location or value, in other words, by Service consumer or by type of request. For example, each database and corresponding Service container instance supports a separate, pre-defined set of consumers.

The advantages of Z axis scaling include:

  • It’s the best approach for addressing the state issue of the other approaches. There’s no chance a user’s state will be lost within the context of a particular Z axis instance, as long as that instance is working properly.
  • In some cases, individual compositions could also be allocated to individual Z axis instances, if different consumers consume different processes implemented by those compositions. This advantage would only apply for those processes that are specific to particular consumers.

The disadvantages of Z axis scaling are:

  • It doesn’t work well for reusable Services, for obvious reasons: consumers assigned to one Z axis instance can’t consume Services in another.
  • Z axis scaling also doesn’t deal well with legacy systems that can’t be duplicated further, as with X axis scaling.

Putting the Cube Together
In general, no one axis is sufficient to scale a SOA implementation fully. The architect must plan combinations of approaches that fit into the Cube as though it were a Chinese puzzle. Some examples of pairwise combinations are as follows:

  • X and Y axis — mirrored, identical clusters of servers, where each server in a particular cluster runs some set of Services distinct from the other servers in the cluster. Alternately, the X and Y axis combination might be several clusters, where each cluster addresses a specific set of Services, and each cluster is made up of a set of mirrored, identical servers.
  • X and Z axis — mirrored, identical clusters of servers, where each server in a particular cluster is dedicated to requests from a specific subset of consumer requests. Alternately, the X and Z axis combination might be several clusters, where each cluster addresses a specific subset of consumers, and each cluster is made up of a set of mirrored, identical servers.
  • Y and Z axis — a set of clusters of servers, where each cluster hosts a set of Services and each server in a cluster responds to requests from a particular set of consumers, or vice versa, naturally.
  • X, Y, and Z axis — Leveraging the full cube essentially requires clusters of clusters. There are too many possibilities to list here, so we’ll leave this option as an exercise for the reader!

The ZapThink Take
It’s important to remember that scalability, for all its complexity, is still only one part of the SOA performance problem. High performance begins with good design, as design mistakes like improper Service granularity or overly complex schemas can sink your performance. It’s just as important to deal with potential bottlenecks, which in the SOA case, typically involve XML processing issues of various sorts. High performance SOA means getting all of these elements right, as any one of them can limit the overall performance of your implementation.

We’d like to hear from you about your SOA performance challenges and solutions! Please email us at [email protected] if you have any SOA performance stories — either horror stories or success stories. If you have a product or service that can help with SOA performance, we’d like to hear from you too! We’ll use the information we receive in our presentation at our upcoming High Performance SOA seminar in New York City on March 29th. Watch the new ZapThink Web site for more information on that event as it develops. Hope to see you there!

More Stories By Jason Bloomberg

Jason Bloomberg is a leading IT industry analyst, Forbes contributor, keynote speaker, and globally recognized expert on multiple disruptive trends in enterprise technology and digital transformation. He is ranked #5 on Onalytica’s list of top Digital Transformation influencers for 2018 and #15 on Jax’s list of top DevOps influencers for 2017, the only person to appear on both lists.

As founder and president of Agile Digital Transformation analyst firm Intellyx, he advises, writes, and speaks on a diverse set of topics, including digital transformation, artificial intelligence, cloud computing, devops, big data/analytics, cybersecurity, blockchain/bitcoin/cryptocurrency, no-code/low-code platforms and tools, organizational transformation, internet of things, enterprise architecture, SD-WAN/SDX, mainframes, hybrid IT, and legacy transformation, among other topics.

Mr. Bloomberg’s articles in Forbes are often viewed by more than 100,000 readers. During his career, he has published over 1,200 articles (over 200 for Forbes alone), spoken at over 400 conferences and webinars, and he has been quoted in the press and blogosphere over 2,000 times.

Mr. Bloomberg is the author or coauthor of four books: The Agile Architecture Revolution (Wiley, 2013), Service Orient or Be Doomed! How Service Orientation Will Change Your Business (Wiley, 2006), XML and Web Services Unleashed (SAMS Publishing, 2002), and Web Page Scripting Techniques (Hayden Books, 1996). His next book, Agile Digital Transformation, is due within the next year.

At SOA-focused industry analyst firm ZapThink from 2001 to 2013, Mr. Bloomberg created and delivered the Licensed ZapThink Architect (LZA) Service-Oriented Architecture (SOA) course and associated credential, certifying over 1,700 professionals worldwide. He is one of the original Managing Partners of ZapThink LLC, which was acquired by Dovel Technologies in 2011.

Prior to ZapThink, Mr. Bloomberg built a diverse background in eBusiness technology management and industry analysis, including serving as a senior analyst in IDC’s eBusiness Advisory group, as well as holding eBusiness management positions at USWeb/CKS (later marchFIRST) and WaveBend Solutions (now Hitachi Consulting), and several software and web development positions.

Microservices Articles
Lori MacVittie is a subject matter expert on emerging technology responsible for outbound evangelism across F5's entire product suite. MacVittie has extensive development and technical architecture experience in both high-tech and enterprise organizations, in addition to network and systems administration expertise. Prior to joining F5, MacVittie was an award-winning technology editor at Network Computing Magazine where she evaluated and tested application-focused technologies including app secu...
Using new techniques of information modeling, indexing, and processing, new cloud-based systems can support cloud-based workloads previously not possible for high-throughput insurance, banking, and case-based applications. In his session at 18th Cloud Expo, John Newton, CTO, Founder and Chairman of Alfresco, described how to scale cloud-based content management repositories to store, manage, and retrieve billions of documents and related information with fast and linear scalability. He addresse...
Adding public cloud resources to an existing application can be a daunting process. The tools that you currently use to manage the software and hardware outside the cloud aren’t always the best tools to efficiently grow into the cloud. All of the major configuration management tools have cloud orchestration plugins that can be leveraged, but there are also cloud-native tools that can dramatically improve the efficiency of managing your application lifecycle. In his session at 18th Cloud Expo, ...
The now mainstream platform changes stemming from the first Internet boom brought many changes but didn’t really change the basic relationship between servers and the applications running on them. In fact, that was sort of the point. In his session at 18th Cloud Expo, Gordon Haff, senior cloud strategy marketing and evangelism manager at Red Hat, will discuss how today’s workloads require a new model and a new platform for development and execution. The platform must handle a wide range of rec...
Containers and Kubernetes allow for code portability across on-premise VMs, bare metal, or multiple cloud provider environments. Yet, despite this portability promise, developers may include configuration and application definitions that constrain or even eliminate application portability. In this session we'll describe best practices for "configuration as code" in a Kubernetes environment. We will demonstrate how a properly constructed containerized app can be deployed to both Amazon and Azure ...
SYS-CON Events announced today that DatacenterDynamics has been named “Media Sponsor” of SYS-CON's 18th International Cloud Expo, which will take place on June 7–9, 2016, at the Javits Center in New York City, NY. DatacenterDynamics is a brand of DCD Group, a global B2B media and publishing company that develops products to help senior professionals in the world's most ICT dependent organizations make risk-based infrastructure and capacity decisions.
Discussions of cloud computing have evolved in recent years from a focus on specific types of cloud, to a world of hybrid cloud, and to a world dominated by the APIs that make today's multi-cloud environments and hybrid clouds possible. In this Power Panel at 17th Cloud Expo, moderated by Conference Chair Roger Strukhoff, panelists addressed the importance of customers being able to use the specific technologies they need, through environments and ecosystems that expose their APIs to make true ...
In his keynote at 19th Cloud Expo, Sheng Liang, co-founder and CEO of Rancher Labs, discussed the technological advances and new business opportunities created by the rapid adoption of containers. With the success of Amazon Web Services (AWS) and various open source technologies used to build private clouds, cloud computing has become an essential component of IT strategy. However, users continue to face challenges in implementing clouds, as older technologies evolve and newer ones like Docker c...
CloudEXPO New York 2018, colocated with DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI, Machine Learning and WebRTC to one location.
DevOpsSummit New York 2018, colocated with CloudEXPO | DXWorldEXPO New York 2018 will be held November 11-13, 2018, in New York City. Digital Transformation (DX) is a major focus with the introduction of DXWorldEXPO within the program. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term.