AstraZeneca is a global research-based bio-pharmaceutical company with skills and resources focused on discovering, developing and marketing medicines for some of the world’s most serious illnesses, including cancer, heart disease, neurological disorders such as schizophrenia, respiratory disease and infection.
Success in the pharmaceutical research and discovery process is highly dependent on the availability and accessibility of high quality research data. The quality of the data can be assessed by its accuracy, correctness, completeness, currency and relevance. While the accuracy and the correctness of data are purely defined by the methods used to generate the data, the latter three – completeness, currency and relevance, could be determined partially or completely by an effective semantic data integration approach, which:
Researchers gather information from a broad range of biomedical data sources in an iterative way in order to generate or expand a certain theory, to test hypotheses, and to make educated, informed assertions about which relationships are causal, and about exactly how they are causal. They need a mechanism, which will allow them to mine all data scattered among different relevant resources and to identify visible (direct) and invisible (distant) relations between biomedical entities studied along the pharmaceutical research and discovery process.
Develop a platform for Interactive Relationship Discovery, which allows the identification of long causal relationship chains between the biomedical objects in the Linked Life Data cloud. The platform will be used for early hypothesis testing, which requires identification of direct and non-direct relations between biomedical entities and giving a hint for possible mechanism, which usually remains hidden.
To facilitate the process of relationship discovery, the platform should provide an easy and intuitive tool, which will allow the researchers to interactively mine and explore the causal relations.
The semantic warehousing is a suitable approach to assist researchers in getting an overview on the existing relationships within the scientific and clinical data by utilizing causality data mining. Linked Life Data is used as a platform for Interactive Relationship Discovery between biomedical entities as it:
Since the entities in the Linked Life Data are usually strongly interlinked, the approach for simply crawling/querying the repository for relationships and listing them is not sufficient in most cases. That’s why, in addition Linked Life Data provides defines user-centered process and interactive tools for assistance in the discovery of even very large numbers of causal relations.