Reproducible Pharmacometrics: Using Reproducible Research methodologies to improve pharmacometric analyses

This is the tutorial that was given by Niclas Jonsson and I at PAGE in Glasgow in 2013, also available here: http://www.page-meeting.org/default.asp?abstract=2774

Reproducibility is the cornerstone of scientific research, but is nonetheless a challenging area in pharmacometric data analysis. The large number of intermediate steps required, often involving multiple versions of datasets, combined with a mixture of software tools and the substantial quantity of results that must be tracked and summarized renders traceability an onerous and time-consuming business.

The concept of “reproducible research” is that the final product of scientific research is not just the text of a report or research article, but should also include the full computational environment used to produce the results, including all the associated code and data – and that this bundle of data and scripts should be shared with others who wish to reproduce these results. Although this is not often possible in pharmacometrics, given that data are usually confidential and that it may not be practical to reproduce hundreds of model fits, we can apply the process of reproducible research to our activities as far as possible to ensure that traceability is maintained.

Although there are many approaches that may be taken to adopting this principle, we shall focus on the combination of R, knitr and LaTeX. These tools together enable the end-to-end scripting of data file creation, capture of results from external software tools and subsequent analyses, and can automate the creation of publication-quality reports, articles and slide decks.

This tutorial demonstrates that applying techniques such as these is not particularly difficult, especially now that they are coming into general use and support from software tools is maturing. The substantial benefits of doing so are covered, which include increased accuracy, efficiency, reliability and credibility, elimination of transcription errors, built-in traceability, and the ability to reproduce an analysis, including article or report, in its entirety years later. A live demonstration will be available during the poster sessions.

The slides shown at PAGE are here: 1606-Page2013-ReproducibleResearch_v3.1.pdf (1.1 MB). Support materials are here: ReproducibleResearchDemo.zip (1010.7 KB)

Comments / criticism / questions welcome!

2 Likes