Welcome!

Microservices Expo Authors: Liz McMillan, Pat Romanski, Carmen Gonzalez, Elizabeth White, Jason Bloomberg

Related Topics: Microservices Expo

Microservices Expo: Article

XML Binding Frameworks in the Context of Service-Oriented Architecture

Make an informed choice about a binding framework for your SOA needs

XMLBeans (http://xmlbeans.apache.org): XMLBeans uses XML Schema to compile to Java interfaces and classes that allow modification of XML instance data. XMLBeans provides setter and getter methods like JavaBeans. XMLBeans distinguishes itself by fully supporting XML Schema. During demarshalling an XML instance the full XML infoset is kept and is available to the developer. This is critical because a subset of XML is not easily represented in Java, e.g., the order of elements or comments might be needed in an application.

JAXB (www.java.sun.com/xml/jaxb): JAXB is a standard for XML binding and consists of two parts. First, JAXB contains a compiler that reads a schema and produces the equivalent Java object model. This generated object model captures the structure of XML better than general-purpose APIs like DOM or SAX, making it a lot easier to manipulate XML content.

The second part is an API, through which applications communicate with generated code. The JAXB implementation from SUN JWSDP1.5 is used.

Quantitative Evaluation of the Frameworks
We have quantitatively evaluated the XML binding frameworks on the following parameters.

  1. Marshalling Time - Marshalling is the process of generating an XML representation for an object in memory. As with Java object serialization, the representation needs to include all dependent objects: objects referenced by our main object, objects referenced by those objects, and so on.
  2. Demarshalling Time - Demarshalling is the reverse process of marshalling; it builds an object (and potentially a graph of linked objects) in memory from an XML representation.
  3. Memory Load - JVM Heap memory used for the whole process of marshalling and demarshalling using the respective XML binding frameworks.
Performance Tests
The test run used three samples of XML varied by its nature (segmentation factor). The test cases had to execute the selected binding frameworks on the sample XML files serially some x number of times. In addition to these, x number of threads have been executed concurrently. The combination of these test scenarios is an indicator of the memory load on a platform and time statistics of each of these utilities (see Table 1).

Test Suite
The JProbe profiler and Memory Debugger have been used to capture the performance, memory, and threading coverage issues of the respective binding frameworks. Each test case is run as a stand-alone session to have snapshots of the performance captured for each program execution. The tests are performed on a system with the following configuration:

  • Pentium IV 1.8 GHz
  • 512 MB RAM
  • Virtual Memory: 768 MB
Quantitative Results
Marshalling Time

The graphs in Figures 4.1-4.4 represent the marshalling performance for simple and complex XML respectively. Higher performance from JiBX is due to the byte code enhancement done at the compile time. It avoids introspection at runtime.

As seen from Figures 4.3 and 4.4, for multiple concurrent threads the results for XMLBeans worsen, whereas JiBX on the other hand has the same pattern.

Demarshalling Time
Refer to Figures 4.5 and Fig 4.6. Castor provides good support for data binding using code generation, but demarshalling performance is weak compared to others. The graph pattern is similar in the case of concurrent runs.

Castor uses the SAX2 parser and parses the XML file in an event-driven manner, thus reducing its speed and performance as compared to JiBX.

JiBX uses XMLPull parsing technique, unlike the push parsing techniques such as SAX. Instead of the parser calling methods in the handler to report document components, one calls the parser to get each component in turn.

Memory Load
Memory load over the entire run (composed of demarshalling marshalling) is low for JiBX as it uses the XMLPull parsing technique during demarshalling, which doesn't require the parser to maintain the state in the document. Castor uses the SAX2 parser, which requires the parser to maintain the state in the document. JAXB is relatively inefficient in terms of memory usage.

Inferences
From the performance analysis of the four data-binding frameworks, a SWOT analysis is presented in Figure 5. Based on information from Figure 5 and from the performance charts seen earlier, JiBX outperforms all of the other binding frameworks by a wide margin on both time and memory usage. JAXB scores well on marshalling and demarshalling time, but its memory usage is higher than the others. XMLBeans has a major performance hit but is more memory-efficient than Castor and JAXB. Depending on the specific requirement of an application, a suitable XML binding framework can be chosen.

Summary
We have illustrated in this article the pivotal role XML binding frameworks can play in context of SOA. To enable practitioners to make an informed choice about the appropriate XML binding platform, we have carried out a detailed quantitative analysis of some of the popular XML binding frameworks in the J2EE context, and presented the results.

References

More Stories By Dr. Srinivas Padmanabhuni

Dr. Srinivas Padmanabhuni is a principal researcher with the Web Services Centre of Excellence in SETLabs, Infosys Technologies, and specializes in Web Services, service-oriented architecture, and grid technologies alongside pursuing interests in Semantic Web, intelligent agents, and enterprise architecture. He has authored several papers in international conferences. Dr. Padmanabhuni holds a PhD degree in computing science from University of Alberta, Edmonton, Canada.

More Stories By Bijoy Majumdar

Bijoy Majumdar is a member of the Web Services COE (Center of Excellence) for Infosys Technologies, a global IT consulting firm, and has substantial experience in publishing papers, presenting papers at conferences, and defining standards for SOA and Web services. Prior to Infosys, Bijoy Majumdar worked as an IT Analyst, and had been a member of the GE Center of Excellence (e-center) under the E-Business Practice of Tata Consultancy Services.

More Stories By Ujval Mysore

Ujval Mysore is a member of the Web Services COE (Center of Excellence) for Infosys Tehcnologies, a global IT consulting firm, and have substantial experience in publishing papers, presenting papers at conferences, and defining standards for SOA and Web services. The Web Services COE specializes in SOA, Web services, and other related technologies. Dr. Srinivas Padmanabhuni heads the Web Services COE.

More Stories By Vikram Sitaram

Vikram Sitaram is a member of the Web Services COE (Center of Excellence) for Infosys Tehcnologies, a global IT consulting firm, and have substantial experience in publishing papers, presenting papers at conferences, and defining standards for SOA and Web services. The Web Services COE specializes in SOA, Web services, and other related technologies. Dr. Srinivas Padmanabhuni heads the Web Services COE.

Comments (2)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


Microservices Articles
Modern software design has fundamentally changed how we manage applications, causing many to turn to containers as the new virtual machine for resource management. As container adoption grows beyond stateless applications to stateful workloads, the need for persistent storage is foundational - something customers routinely cite as a top pain point. In his session at @DevOpsSummit at 21st Cloud Expo, Bill Borsari, Head of Systems Engineering at Datera, explored how organizations can reap the bene...
"NetApp's vision is how we help organizations manage data - delivering the right data in the right place, in the right time, to the people who need it, and doing it agnostic to what the platform is," explained Josh Atwell, Developer Advocate for NetApp, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
The Jevons Paradox suggests that when technological advances increase efficiency of a resource, it results in an overall increase in consumption. Writing on the increased use of coal as a result of technological improvements, 19th-century economist William Stanley Jevons found that these improvements led to the development of new ways to utilize coal. In his session at 19th Cloud Expo, Mark Thiele, Chief Strategy Officer for Apcera, compared the Jevons Paradox to modern-day enterprise IT, examin...
In his session at 20th Cloud Expo, Mike Johnston, an infrastructure engineer at Supergiant.io, discussed how to use Kubernetes to set up a SaaS infrastructure for your business. Mike Johnston is an infrastructure engineer at Supergiant.io with over 12 years of experience designing, deploying, and maintaining server and workstation infrastructure at all scales. He has experience with brick and mortar data centers as well as cloud providers like Digital Ocean, Amazon Web Services, and Rackspace. H...
Skeuomorphism usually means retaining existing design cues in something new that doesn’t actually need them. However, the concept of skeuomorphism can be thought of as relating more broadly to applying existing patterns to new technologies that, in fact, cry out for new approaches. In his session at DevOps Summit, Gordon Haff, Senior Cloud Strategy Marketing and Evangelism Manager at Red Hat, will discuss why containers should be paired with new architectural practices such as microservices ra...
In his session at 20th Cloud Expo, Scott Davis, CTO of Embotics, discussed how automation can provide the dynamic management required to cost-effectively deliver microservices and container solutions at scale. He also discussed how flexible automation is the key to effectively bridging and seamlessly coordinating both IT and developer needs for component orchestration across disparate clouds – an increasingly important requirement at today’s multi-cloud enterprise.
The Software Defined Data Center (SDDC), which enables organizations to seamlessly run in a hybrid cloud model (public + private cloud), is here to stay. IDC estimates that the software-defined networking market will be valued at $3.7 billion by 2016. Security is a key component and benefit of the SDDC, and offers an opportunity to build security 'from the ground up' and weave it into the environment from day one. In his session at 16th Cloud Expo, Reuven Harrison, CTO and Co-Founder of Tufin, ...
DevOps is often described as a combination of technology and culture. Without both, DevOps isn't complete. However, applying the culture to outdated technology is a recipe for disaster; as response times grow and connections between teams are delayed by technology, the culture will die. A Nutanix Enterprise Cloud has many benefits that provide the needed base for a true DevOps paradigm. In their Day 3 Keynote at 20th Cloud Expo, Chris Brown, a Solutions Marketing Manager at Nutanix, and Mark Lav...
Many organizations are now looking to DevOps maturity models to gauge their DevOps adoption and compare their maturity to their peers. However, as enterprise organizations rush to adopt DevOps, moving past experimentation to embrace it at scale, they are in danger of falling into the trap that they have fallen into time and time again. Unfortunately, we've seen this movie before, and we know how it ends: badly.
TCP (Transmission Control Protocol) is a common and reliable transmission protocol on the Internet. TCP was introduced in the 70s by Stanford University for US Defense to establish connectivity between distributed systems to maintain a backup of defense information. At the time, TCP was introduced to communicate amongst a selected set of devices for a smaller dataset over shorter distances. As the Internet evolved, however, the number of applications and users, and the types of data accessed and...