Welcome!

Microservices Expo Authors: Jason Bloomberg, Elizabeth White, Liz McMillan, Pat Romanski, Kevin Jackson

Related Topics: Microservices Expo

Microservices Expo: Article

SOA to the Rescue, When Drug Discovery Needs Data Fast!

Information is key to drug discovery

As the demand for new medicines grows, so does the need for better information to manage and execute the R&D processes. There is huge pressure to make informed decisions, especially during the project's early stages when the risk is high and before downstream costs are added.

Pfizer spends billions on research projects annually. At Pfizer Global R&D where the company's drug discovery takes place, research scientists and managers require vast amounts of up-to-the-minute information on lab results, submission status, and project schedules to move new research forward quickly. Management must constantly analyze the entire portfolio of new medicines in discovery to look for opportunities, trends, and areas where attention is needed. Researchers and managers strive to bring together the best in ideas, practices, policies as well as the use of information.

At Pfizer's Research Informatics Division within Global Research and Development, we seek to provide the best information possible to our R&D customers. Meeting this mission requires constant innovation. Over the past several years, we have faced a number of challenges, causing us to evolve our information delivery methods and technologies significantly. These include a new approach to real-time data integration, such as using SOA data services that lets us build new solutions more rapidly and in alignment with our SOA strategies.

Data Integration Is a Critical Requirement
At Pfizer R&D, the information required for executing and managing projects is drawn from many sources, including laboratory research, historical records, clinical trials, and business intelligence. The data is complex, diverse, and spreads across the company in various technology and application silos.

Through innovative use of analytics, reporting, and portal technology, we have made great strides toward improving how this information is presented internally. However, data integration remains the biggest challenge in effectively providing information to our researchers and managers.

Why is this critical? To properly assess a portfolio of discovery projects, Pfizer managers must pull data from sources such as packaged applications, historical data from data warehouses, document repositories, and custom systems. Each source has its own access mechanisms, syntax, and security. Few are structured properly for consumption, let alone reuse. These combined factors slow down new application development projects.

Time Is of the Essence!
To move new research forward as quickly as possible, our research scientists and managers must have critical up-to-date information from across our wide array of source systems. If information is only refreshed monthly, then necessary course corrections are typically delayed by several weeks. A few weeks may not sound like a lot, but on a 24-month project, these weeks can easily add up to six months or more.

For new IT projects, time is of the essence. Business agility requires IT agility. Pfizer's researchers and managers, like their business user counterparts, constantly make new demands on IT for new information systems to help the business perform more efficiently, effectively, and competitively. This means we must build new systems quickly. Rapid application development (RAD) techniques are highly desired. In fact, we continuously evaluate our Enterprise Development Life Cycle (EDLC) processes with a primary objective of reducing time-to-solution with faster responses to business needs.

SOA-Compliance Is an Important Requirement
Our SOA adoption has accelerated in recent years. Specifically, we use SOA approaches to increase reuse of existing components, save development time, and cut costs. So, we strive to use SOA methods and technologies whenever possible.

With respect to SOA and data integration, we've found that SOA helps break down silo-type data gathering and integration processes by standardizing how data is promoted and reused. The ability to virtualize and abstract via data services helps groups to easily understand and consume data confidently, reliably, and quickly without having to hunt for these sources or rely on manual processes for gathering and integrating them.

Old Extract & Mart-based Approaches Can't Meet New Requirements
Traditionally, Pfizer has used three approaches to data integration. The first is custom coding directly between sources and consuming applications. This works well for our simple integration problems where there are one or two defined sources and little transformation is needed. But as additional sources are added, and complex data structures are ever changing, this delivery approach has severe limitations.

Second, we've used replicated file extracts as a way to integrate data. File extracts handle data silos more efficiently than custom coding. For example, application teams that need data receive periodic file extracts from the application teams that manage the source data applications. This arms-length batch approach minimizes the impact on source systems and is useful for daily transaction summaries, shared reference data, etc. However, data integration beyond simple access - abstraction, transformation, federation, and more - requires extra work by the consuming team. This method proliferates replicated data without any controls on quality, security, and scalability.

Extract, Transform, and Load (ETL) with data marts or warehouses is our third approach to data integration. This kind of physical data replication has several advantages in terms of rationalizing and combining heterogeneous data from multiple sources. For large-scale multi-dimensional analysis, we find data warehouses are effective solutions given their ability to support the large volumes and significant schema transformations typically required. To date, this has been the data integration approach of choice for our medium and large-scale data integration projects.

Unfortunately, these three approaches may not be entirely effective with our customers. Because our customers must make decisions based on near real-time data, they often can't afford the extra development time required for building and testing custom coding, file extracts, and data marts. Forcing our business users to wait extra months for new solutions to be developed has a huge impact on how quickly we get new drugs to market.

Further, typical data mart/replication architectures don't easily fit into our new SOA strategy. New data integration projects must be SOA-enabled from the start, so they can deliver value moving forward.

Given the accelerating business demand for new systems from the R&D groups we support, my team decided to find a new approach to data integration that lets us build additional real-time solutions more rapidly, and in alignment with our SOA strategies, while avoiding the replication downside.

To do this, we launched a project with the goal of identifying and adopting a new approach to data integration that meets the following criteria:

  1. Complex data integration across multiple heterogeneous sources without unnecessary data replication,
  2. Real-time information delivery,
  3. Rapid application development, and
  4. SOA-compliance.

More Stories By Daniel Eng

Daniel Eng has over 17 years of diverse IT experience in managing projects, leading technical teams, and developing enterprise applications within Fortune 100 companies. Currently at Pfizer Global Research and Development, Dan is leading efforts in transitioning business processes and applications into a SOA environment by using emerging technologies and agile management practices. Prior to Pfizer, he was an independent consultant helping his Fortune 500 clients in developing intranet sites, portable applications and e-commerce solutions. Dan has also worked in many e-commerce start-ups and healthcare organizations. He holds a BSEE degree from Polytechnic University and an MBA degree from Gonzaga University.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@MicroservicesExpo Stories
If you cannot explicitly articulate how investing in a new technology, changing the approach or re-engineering the business process will help you achieve your customer-centric vision of the future in direct and measurable ways, you probably shouldn’t be doing it. At Intellyx, we spend a lot of time talking to technology vendors. In our conversations, we explore emerging new technologies that are either disrupting the way enterprise organizations work or that help enable those organizations to ...
In 2014, Amazon announced a new form of compute called Lambda. We didn't know it at the time, but this represented a fundamental shift in what we expect from cloud computing. Now, all of the major cloud computing vendors want to take part in this disruptive technology. In his session at 20th Cloud Expo, Doug Vanderweide, an instructor at Linux Academy, discussed why major players like AWS, Microsoft Azure, IBM Bluemix, and Google Cloud Platform are all trying to sidestep VMs and containers wit...
The taxi industry never saw Uber coming. Startups are a threat to incumbents like never before, and a major enabler for startups is that they are instantly “cloud ready.” If innovation moves at the pace of IT, then your company is in trouble. Why? Because your data center will not keep up with frenetic pace AWS, Microsoft and Google are rolling out new capabilities. In his session at 20th Cloud Expo, Don Browning, VP of Cloud Architecture at Turner, posited that disruption is inevitable for comp...
For organizations that have amassed large sums of software complexity, taking a microservices approach is the first step toward DevOps and continuous improvement / development. Integrating system-level analysis with microservices makes it easier to change and add functionality to applications at any time without the increase of risk. Before you start big transformation projects or a cloud migration, make sure these changes won’t take down your entire organization.
There's a lot to gain from cloud computing, but success requires a thoughtful and enterprise focused approach. Cloud computing decouples data and information from the infrastructure on which it lies. A process that is a LOT more involved than dragging some folders from your desktop to a shared drive. Cloud computing as a mission transformation activity, not a technological one. As an organization moves from local information hosting to the cloud, one of the most important challenges is addressi...
"We are a monitoring company. We work with Salesforce, BBC, and quite a few other big logos. We basically provide monitoring for them, structure for their cloud services and we fit into the DevOps world" explained David Gildeh, Co-founder and CEO of Outlyer, in this SYS-CON.tv interview at DevOps Summit at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
"When we talk about cloud without compromise what we're talking about is that when people think about 'I need the flexibility of the cloud' - it's the ability to create applications and run them in a cloud environment that's far more flexible,” explained Matthew Finnie, CTO of Interoute, in this SYS-CON.tv interview at 20th Cloud Expo, held June 6-8, 2017, at the Javits Center in New York City, NY.
What's the role of an IT self-service portal when you get to continuous delivery and Infrastructure as Code? This general session showed how to create the continuous delivery culture and eight accelerators for leading the change. Don Demcsak is a DevOps and Cloud Native Modernization Principal for Dell EMC based out of New Jersey. He is a former, long time, Microsoft Most Valuable Professional, specializing in building and architecting Application Delivery Pipelines for hybrid legacy, and cloud ...
For most organizations, the move to hybrid cloud is now a question of when, not if. Fully 82% of enterprises plan to have a hybrid cloud strategy this year, according to Infoholic Research. The worldwide hybrid cloud computing market is expected to grow about 34% annually over the next five years, reaching $241.13 billion by 2022. Companies are embracing hybrid cloud because of the many advantages it offers compared to relying on a single provider for all of their cloud needs. Hybrid offers bala...
21st International Cloud Expo, taking place October 31 - November 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Me...
Companies have always been concerned that traditional enterprise software is slow and complex to install, often disrupting critical and time-sensitive operations during roll-out. With the growing need to integrate new digital technologies into the enterprise to transform business processes, this concern has become even more pressing. A 2016 Panorama Consulting Solutions study revealed that enterprise resource planning (ERP) projects took an average of 21 months to install, with 57 percent of t...
Microservices are increasingly used in the development world as developers work to create larger, more complex applications that are better developed and managed as a combination of smaller services that work cohesively together for larger, application-wide functionality. Tools such as Service Fabric are rising to meet the need to think about and build apps using a piece-by-piece methodology that is, frankly, less mind-boggling than considering the whole of the application at once. Today, we'll ...
In his session at Cloud Expo, Alan Winters, an entertainment executive/TV producer turned serial entrepreneur, presented a success story of an entrepreneur who has both suffered through and benefited from offshore development across multiple businesses: The smart choice, or how to select the right offshore development partner Warning signs, or how to minimize chances of making the wrong choice Collaboration, or how to establish the most effective work processes Budget control, or how to ma...
Hybrid IT is today’s reality, and while its implementation may seem daunting at times, more and more organizations are migrating to the cloud. In fact, according to SolarWinds 2017 IT Trends Index: Portrait of a Hybrid IT Organization 95 percent of organizations have migrated crucial applications to the cloud in the past year. As such, it’s in every IT professional’s best interest to know what to expect.
Both SaaS vendors and SaaS buyers are going “all-in” to hyperscale IaaS platforms such as AWS, which is disrupting the SaaS value proposition. Why should the enterprise SaaS consumer pay for the SaaS service if their data is resident in adjacent AWS S3 buckets? If both SaaS sellers and buyers are using the same cloud tools, automation and pay-per-transaction model offered by IaaS platforms, then why not host the “shrink-wrapped” software in the customers’ cloud? Further, serverless computing, cl...
Containers, microservices and DevOps are all the rage lately. You can read about how great they are and how they’ll change your life and the industry everywhere. So naturally when we started a new company and were deciding how to architect our app, we went with microservices, containers and DevOps. About now you’re expecting a story of how everything went so smoothly, we’re now pushing out code ten times a day, but the reality is quite different.
In the decade following his article, cloud computing further cemented Carr’s perspective. Compute, storage, and network resources have become simple utilities, available at the proverbial turn of the faucet. The value they provide is immense, but the cloud playing field is amazingly level. Carr’s quote above presaged the cloud to a T. Today, however, we’re in the digital era. Mark Andreesen’s ‘software is eating the world’ prognostication is coming to pass, as enterprises realize they must be...
A common misconception about the cloud is that one size fits all. Companies expecting to run all of their operations using one cloud solution or service must realize that doing so is akin to forcing the totality of their business functionality into a straightjacket. Unlocking the full potential of the cloud means embracing the multi-cloud future where businesses use their own cloud, and/or clouds from different vendors, to support separate functions or product groups. There is no single cloud so...
Colocation is a central pillar of modern enterprise infrastructure planning because it provides greater control, insight, and performance than managed platforms. In spite of the inexorable rise of the cloud, most businesses with extensive IT hardware requirements choose to host their infrastructure in colocation data centers. According to a recent IDC survey, more than half of the businesses questioned use colocation services, and the number is even higher among established businesses and busin...
When shopping for a new data processing platform for IoT solutions, many development teams want to be able to test-drive options before making a choice. Yet when evaluating an IoT solution, it’s simply not feasible to do so at scale with physical devices. Building a sensor simulator is the next best choice; however, generating a realistic simulation at very high TPS with ease of configurability is a formidable challenge. When dealing with multiple application or transport protocols, you would be...