discoveryHub - "Standard Edition"


      discoveryHub is a software product based on a mature, patented technology more than eight years in the making. The product is designed as a discovery middleware platform that allows corporations to gain access to a wide variety of life-sciences data sources either online 'live' over the internet or to create localized caches of public data in-house.

Informatics Infrastructure:

The discoveryHub is a complete biological data acquisition and integration platform. This middleware platform sits on top of established database engines such as Oracle, DB2 and Sybase. It uses the familiar SQL interface to support real-time queries from any data domain. APIs enable query and retrieval, with the results coming back to you in the form you require. The discoveryHub complements, extends and enhances the current infrastructure of any biotech company to help accelerate the drug discovery process. The discoveryHub is a complete mathematical solution built specifically to handle the enormous complexities and volume of life science data. And because it incorporates all of the processing necessary to acquire and integrate data, the IT task is simplified and accelerated.

Complex Data Handling:

The primary factor limiting data acquisition and integration across multiple, structurally complex, heterogeneous data sources was the inherent design of relational databases. Life science data is very complicated and using flat two-dimensional structures to store and retrieve data is very inefficient. The breakthrough came from an academically recognized extension of the relational mathematics on which the heart of our system, the discoveryHub is based. This nested relational calculus, created by geneticXchange founder Dr. Limsoon Wong at the University of Pennsylvania, recognizes the limitation of existing RDBMS systems in dealing with complex (nested) data structures. The discoveryHub uses a multidimensional approach to handle the complex data structure and can elegantly store and retrieve data by simple SQL commands.

Multiple Data Sources:

There are 60+ wrappers available today. Ultra thin wrappers are rapidly generated with our Wrapper Development Kit (WDK) to access and read data from life science's growing profusion of data stores. Typically a class of wrapper is created first. This lets you retrieve and write out data from prime data classes like algorithms, web sites, flat files, text documents and XML. Then within each class, an unlimited number of additional, targeted, individual wrappers can be generated. The familiar bottleneck slowing down your projects is eliminated once and for all.

Nested Relational Calculus:

The core discoveryHub technology is based on a mathematical principle called 'Nested Relational Calculus' which is similar to the theory behind most of the relational database management systems (RDBMS) on the market today with one major difference: the discoveryHub nested relational calculus can deal inherently with heterogeneous, hierarchical structured data.

discoveryHub was built from the ground up utilizing Nested Relational Calculus as its underlying principle, so its query processing engine can uniquely allow users to select, join and in some cases update data that resides in complex and often disparate nested structures.

High Productivity:

Researchers remain in total, immediate control of both their data and process. The discoveryHub delivers an immediate, ready to go, data integration solution without the need for new coding or tedious reprogramming. Instead of creating custom solutions, the IT departments can save significant time and money by using the discoveryHub.

      Sitemap