Welcome!

Microservices Expo Authors: Flint Brenton, Elizabeth White, Pat Romanski, Cameron Van Orman, Jason Bloomberg

Related Topics: Microservices Expo, Java IoT, Industrial IoT

Microservices Expo: Article

SQL Peer-to-Peer Dynamic Structured Data Processing Collaboration

Using automatic metadata maintenance

Unstructured and XML semi-structured data is now used more than structured data. Unstructured data is useful because of its fuzzy processing applied to this more common ubiquitous data.  But fixed structured data still keeps businesses running day in and day out, which requires consistent predictable highly principled processing for correct results. This means structured data cannot be replaced by unstructured or semi-structured data.  For this reason, it would be very useful to have a general purpose peer-to-peer collaboration capability that can utilize highly principled hierarchical data processing and its flexible and advanced structured processing to support dynamically structured data and its dynamic structured processing.  This flexible dynamic structured processing can change the structure of the data as necessary for the required processing while preserving the relational and hierarchical data principles and semantics of the data to derive correct structured data results even after structure transformations.

This processing will perform freely across remote unrelated peer locations anytime and transparently support unpredictable structured data and data type changes automatically for immediate processing.  Such an automatic peer-to-peer dynamic structured data collaboration is depicted below with its dynamic working hierarchical data structure being modified at each peer site that are labeled P1 to P4 in the diagram directly below.  Its operation is described below the diagram.


Diagram Description

In the diagram above, peer locations: P1, P2, P3, and P4 located anywhere need to collaborate and share their structured data in order to produce a needed result. This process will require unpredictably changing the data structure and data types as it becomes necessary to achieve the desired need and result.  Peer 1 starts the collaboration process by inputting three relational tables, A, B, and C, and models them into a hierarchical structure sending it off to Peer 2 for further processing. Concurrently overlapping with peer 2's processing, Peer 1 also inputs a XML linear hierarchical structure, XYZ, and transforms it into a nonlinear multipath hierarchical structure sending it off to Peer 3 for further processing.

Peer 2 and peer 3 are now performing independently and concurrently. Each is: retrieving their structure input from peer 1, inputting additional relational table data from their different peer home locations, and joining this data to their working data structures. On completion, peer 2 and peer 3 both send their modified data structures off to common peer 4 for further processing.

Peer 4 accepts the modified data structures from both peer 2 and peer 3 which operated concurrently. It hierarchically joins them together using a matching data item value between nodes B and X (B.b=X.x). Peer 4 then eliminates unneeded data items from the joined result using SQL's dynamic SELECT operation to select data items for output  from nodes A, B, E, Y and W.  This SQL query looks like: SELECT A.a, B.b, E.e, Y.y, W.w FROM P2View LEFT JOIN P3View ON B.b=X.x.  This slices out all nodes (C, D, Z ,X, V) that were not referenced by the SELECT statement. This automatically aggregates the necessary data nicely as shown in the diagram above. This process is known as projection in relational processing and node promotion in hierarchical processing. The LEFT JOIN operation hierarchically places P2's structure over P3's structure connected by the ON clause specification of: B.b=X.x. This newly combined hierarchical structure in peer 4 is sent back to Peer 1 for immediate review and processing where the hierarchical data can be selectively output in different formats each with different data selections as shown in the above diagram.

During this entire peer-to-peer collaboration process, the changing data structures and data types are automatically maintained and utilized transparently for the user as needed. The user at each receiving peer can also view the current active structure and its data types. But knowledge of the structure is not necessary for the user to specify in the query because the maintained structure is automatically known and used inherently by the query processor.  Different working data structure versions can also be saved and restored at each peer by the user.

Integrating SQL With Peer-to-Peer Structured Data Collaboration

The problem with performing the above type of dynamic processing is that structured data processing has been limited to fixed static structure processing because dynamically generated structured data cannot be handled today.  Sharing structured data today is performed with shared metadata.  The metadata remains the same, so the structure must remain static. But with dynamic structured processing, the data structure can be dynamically modified as needed to support the required structured operation as shown and described above. This requires automatic metadata maintenance which has not previously been supported by the industry for structured data processing.

An advanced ANSI SQL transparent hierarchical processor prototype, SQLfX (www.adatinc.com/demo.html), has been developed that can support the required dynamic and flexible structured data processing necessary for collaboration. In addition, it uses SQL's inherent hierarchical data processing capabilities that naturally support full multipath dynamic hierarchical data structures. This allows the most complex hierarchical operations to be performed in order to always meet the need at the required time and peer locations with the SQLfX processor. It already operates structure-aware because of its dynamic processing which is also necessary for the required automatic metadata maintenance to occur automatically at each peer because of the dynamically changing data structure and data items.

The ANSI SQL SQLfX flexible dynamic hierarchical processing technology can be enhanced to integrate with peer-to-peer structured data processing collaboration that eliminates the user control necessary for the dynamically changing metadata. The automatic metadata maintenance supplies the updated current metadata that accompanies the data when transmitted between peers. This allows amazingly fast on the fly advanced hierarchical structured data processing collaboration. This enables previously unknown structure results delivered to any peer to be immediate processed automatically by a SQLfX SQL processor located at the peer location.

SQLfX SQL controls the peer-to-peer processing sending and receiving of data structures using new SQL InFile and OutFile keywords added by SQLfX for this purpose. Password and data encryption can also be supported for data security. Further dynamic processing and data structure modification can be performed at each peer visited in any order including in parallel as shown in the diagram above. This opens up the new capability of dynamic structured data processing and its automatic and transparent metadata handling.

SQL Hierarchical Processing Capabilities for Structured Data Collaboration
SQLfX is a powerful new ANSI SQL transparent multipath hierarchical processor that dynamically processes heterogeneous logical flat data like relational and physical hierarchical data such as XML initially. This SQLfX full dynamic hierarchical data processing enables logical and physical structures to be hierarchically joined and modeled dynamically. This hierarchical processing significantly increases the power of the data structure and the queries applied to it.  This is extremely flexible and powerful and is automatically performed without user hierarchical navigation. This operation naturally utilizes the hierarchical semantic information between the pathways to process powerful multipath queries. This can freely reference multipath queries that can for example select data from one path based on data in another path. This unlimited dynamic processing requires special automatic processing known as Lowest Common Ancestor (LCA) processing which enables any conceivable valid multipath query to be processed automatically.  This capability is supported in SQLfX naturally and is missing in other data processors such as XQuery.

An additional valuable benefit of using hierarchical structures is that they are great at naturally organizing and reusing data. Their ability to freely create and grow logical hierarchical multipath structures dynamically also has another overlooked powerful benefit.  It continually increases the data value of the data nonlinearly through automatic data reuse and sharing of the data at higher levels with the multiple lower levels in a pyramid fashion. In addition, the dynamic joining of these hierarchical structures can dynamically increase their data value and querying power many times.  Another powerful advantage are logical hierarchical data structures that are assembled on the fly when creating new structures such as when structures are joined and exist only when and while they are being used. These logical structures add flexibility to hierarchical structures and efficiency to their new use.

Hierarchical structures can also be hierarchically data filtered in their entirety following hierarchical semantics using SQL's WHERE clause to filter the data by data value to only the precise desired data result. This is a complex and powerful operation because data filtering applied to any node data item in a hierarchical structure affects all other nodes of the structure. This is because every node in a hierarchical structure is related to every other node in the data structure. This is demonstrated in Diagram 2 below where filtering node E winds up affecting all other nodes sometimes indirectly through a cousin relationship such as node B. In this example, all nodes with a data occurrence related to data item E equal to 25 are filtered out. The data filtering flow is represented by the arrows.  This is a powerful concept giving multipath processing and WHERE clause hierarchical data processing significant power. This global hierarchical structure filtering is particularly useful when combined with transferring these powerful multipath structures between peers in peer-to-peer processing and the entire structure needs to be filtered for some data value condition. This can be a complex condition involving multiple paths.

SQLfX SQL also supports a very advanced dynamic any-to-any data structure transformation and a data structure virtualization capability. This allows all hierarchically data transformations to be performed semantically correct at a high SQL processing level. With multipath hierarchical processing and any-to-any structure transformations, a variety of hypothetical, experimental, research, exploratory, and problem solving queries can be carried out immediately in an unrestricted fashion further enhanced by powerful real time hierarchical processing collaboration.

Conclusion

All of the powerful and flexible capabilities mentioned in this article make multipath hierarchical structures and their hierarchical processing the perfect opportunity for this dynamic structured data processing collaboration. The universally known SQL interface makes it a perfect API and this is backed by this new relational hierarchical processing technology.  Single one-way data transmissions will also always be available to send to anyone any time because a receive-only version of SQLfX peer-to-peer will be freely available to download and use to automatically view and utilize the one-way transmitted data structure. Additional information on SQLfX's advanced hierarchical processing capabilities and operation can be found at www.adatinc.com. Persons and Companies wanting more information or help on SQL  peer-to-peer dynamic structured data processing collaboration can contact [email protected].

More Stories By Michael M David

Michael M. David is founder and CTO of Advanced Data Access Technologies, Inc. He has been a staff scientist and lead XML architect for NCR/Teradata and their representative to the SQLX Group. He has researched, designed and developed commercial query languages for heterogeneous hierarchical and relational databases for over twenty years. He has authored the book "Advanced ANSI SQL Data Modeling and Structure Processing" Published by Artech House Publishers and many papers and articles on database topics. His research and findings have shown that Hierarchical Data Processing is a subset of Relational Processing and how to utilize this advanced inherent capability in ANSI SQL. Additionally, his research has shown that advanced multipath (LCA) processing is also naturally supported and performed automatically in ANSI SQL, and advanced hierarchical processing operations are also possible. These advanced capabilities can be performed and explained in the ANSI SQL Transparent XML Hierarchical Processor at his site at: www.adatinc.com/demo.html.

@MicroservicesExpo Stories
The nature of the technology business is forward-thinking. It focuses on the future and what’s coming next. Innovations and creativity in our world of software development strive to improve the status quo and increase customer satisfaction through speed and increased connectivity. Yet, while it's exciting to see enterprises embrace new ways of thinking and advance their processes with cutting edge technology, it rarely happens rapidly or even simultaneously across all industries.
Is advanced scheduling in Kubernetes achievable? Yes, however, how do you properly accommodate every real-life scenario that a Kubernetes user might encounter? How do you leverage advanced scheduling techniques to shape and describe each scenario in easy-to-use rules and configurations? In his session at @DevOpsSummit at 21st Cloud Expo, Oleg Chunikhin, CTO at Kublr, will answer these questions and demonstrate techniques for implementing advanced scheduling. For example, using spot instances ...
With the rise of DevOps, containers are at the brink of becoming a pervasive technology in Enterprise IT to accelerate application delivery for the business. When it comes to adopting containers in the enterprise, security is the highest adoption barrier. Is your organization ready to address the security risks with containers for your DevOps environment? In his session at @DevOpsSummit at 21st Cloud Expo, Chris Van Tuin, Chief Technologist, NA West at Red Hat, will discuss: The top security r...
DevSecOps – a trend around transformation in process, people and technology – is about breaking down silos and waste along the software development lifecycle and using agile methodologies, automation and insights to help get apps to market faster. This leads to higher quality apps, greater trust in organizations, less organizational friction, and ultimately a five-star customer experience. These apps are the new competitive currency in this digital economy and they’re powered by data. Without ...
With the modern notion of digital transformation, enterprises are chipping away at the fundamental organizational and operational structures that have been with us since the nineteenth century or earlier. One remarkable casualty: the business process. Business processes have become so ingrained in how we envision large organizations operating and the roles people play within them that relegating them to the scrap heap is almost unimaginable, and unquestionably transformative. In the Digital ...
These days, APIs have become an integral part of the digital transformation journey for all enterprises. Every digital innovation story is connected to APIs . But have you ever pondered over to know what are the source of these APIs? Let me explain - APIs sources can be varied, internal or external, solving different purposes, but mostly categorized into the following two categories. Data lakes is a term used to represent disconnected but relevant data that are used by various business units wit...
Today most companies are adopting or evaluating container technology - Docker in particular - to speed up application deployment, drive down cost, ease management and make application delivery more flexible overall. As with most new architectures, this dream takes significant work to become a reality. Even when you do get your application componentized enough and packaged properly, there are still challenges for DevOps teams to making the shift to continuous delivery and achieving that reducti...
Most of the time there is a lot of work involved to move to the cloud, and most of that isn't really related to AWS or Azure or Google Cloud. Before we talk about public cloud vendors and DevOps tools, there are usually several technical and non-technical challenges that are connected to it and that every company needs to solve to move to the cloud. In his session at 21st Cloud Expo, Stefano Bellasio, CEO and founder of Cloud Academy Inc., will discuss what the tools, disciplines, and cultural...
Enterprises are moving to the cloud faster than most of us in security expected. CIOs are going from 0 to 100 in cloud adoption and leaving security teams in the dust. Once cloud is part of an enterprise stack, it’s unclear who has responsibility for the protection of applications, services, and data. When cloud breaches occur, whether active compromise or a publicly accessible database, the blame must fall on both service providers and users. In his session at 21st Cloud Expo, Ben Johnson, C...
21st International Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Me...
‘Trend’ is a pretty common business term, but its definition tends to vary by industry. In performance monitoring, trend, or trend shift, is a key metric that is used to indicate change. Change is inevitable. Today’s websites must frequently update and change to keep up with competition and attract new users, but such changes can have a negative impact on the user experience if not managed properly. The dynamic nature of the Internet makes it necessary to constantly monitor different metrics. O...
Agile has finally jumped the technology shark, expanding outside the software world. Enterprises are now increasingly adopting Agile practices across their organizations in order to successfully navigate the disruptive waters that threaten to drown them. In our quest for establishing change as a core competency in our organizations, this business-centric notion of Agile is an essential component of Agile Digital Transformation. In the years since the publication of the Agile Manifesto, the conn...
Many organizations are now looking to DevOps maturity models to gauge their DevOps adoption and compare their maturity to their peers. However, as enterprise organizations rush to adopt DevOps, moving past experimentation to embrace it at scale, they are in danger of falling into the trap that they have fallen into time and time again. Unfortunately, we've seen this movie before, and we know how it ends: badly.
There is a huge demand for responsive, real-time mobile and web experiences, but current architectural patterns do not easily accommodate applications that respond to events in real time. Common solutions using message queues or HTTP long-polling quickly lead to resiliency, scalability and development velocity challenges. In his session at 21st Cloud Expo, Ryland Degnan, a Senior Software Engineer on the Netflix Edge Platform team, will discuss how by leveraging a reactive stream-based protocol,...
Many organizations adopt DevOps to reduce cycle times and deliver software faster; some take on DevOps to drive higher quality and better end-user experience; others look to DevOps for a clearer line-of-sight to customers to drive better business impacts. In truth, these three foundations go together. In this power panel at @DevOpsSummit 21st Cloud Expo, moderated by DevOps Conference Co-Chair Andi Mann, industry experts will discuss how leading organizations build application success from all...
The last two years has seen discussions about cloud computing evolve from the public / private / hybrid split to the reality that most enterprises will be creating a complex, multi-cloud strategy. Companies are wary of committing all of their resources to a single cloud, and instead are choosing to spread the risk – and the benefits – of cloud computing across multiple providers and internal infrastructures, as they follow their business needs. Will this approach be successful? How large is the ...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
"NetApp's vision is how we help organizations manage data - delivering the right data in the right place, in the right time, to the people who need it, and doing it agnostic to what the platform is," explained Josh Atwell, Developer Advocate for NetApp, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
The “Digital Era” is forcing us to engage with new methods to build, operate and maintain applications. This transformation also implies an evolution to more and more intelligent applications to better engage with the customers, while creating significant market differentiators. In both cases, the cloud has become a key enabler to embrace this digital revolution. So, moving to the cloud is no longer the question; the new questions are HOW and WHEN. To make this equation even more complex, most ...
One of the biggest challenges with adopting a DevOps mentality is: new applications are easily adapted to cloud-native, microservice-based, or containerized architectures - they can be built for them - but old applications need complex refactoring. On the other hand, these new technologies can require relearning or adapting new, oftentimes more complex, methodologies and tools to be ready for production. In his general session at @DevOpsSummit at 20th Cloud Expo, Chris Brown, Solutions Marketi...