In the future, information will be extracted from a combination of instruments from multiple platforms, insitu data and model outputs. This is beginning to happen in a few of the mature science areas, such as physical oceanography and atmospheric chemistry. The challenge is to do this efficiently, and in a manner that is repeatable across multiple investigations and disciplines.
Access to data: Not all the data of published studies or those submitted to regulatory bodies is accessible and considerations other than those of science often govern access. Which studies should be included and which should not is a vexed question. Should it be restricted to double blind, controlled studies or should case-control studies also be used? The latter are often easier to perform and less expensive than the former and placebo controlled studies can lead to ethical problems. Should non-peer-reviewed studies be considered? These are often the basis of regulatory submissions. Publication bias towards positive results is a problem as is discovery of unpublished data. The quality of unpublished studies may well be inferior to those which have been published, but could they still merit inclusion? Each possible decision introduces some bias in the final study.
Do methods of the included studies have to be identical? This would further severely limit the number of available cases as in general, it would limit analyses to studies performed with the same protocol.
Differences in responses to drugs exist among ethnic groups and even within such groups. For example, the French or the Austrians are less sensitive to neuromuscular blocking drugs than Americans. Hong Kong Chinese are considerably more sensitive to opioids than Australians. Is it reasonable to ignore such differences? This runs the risk of masking answers which differ from group to group, where A may be the best for group 1 and B for group 2.
These are only some of the problems encountered when studies are combined to enlarge the data pool. Heterogeneity of the studies can also introduce statistical problems, such as Simpson's paradox.