Identification of a possible proteomic biomarker in Parkinson’s disease: discovery and replication in blood, brain and cerebrospinal fluid
Winchester L., Barber I., Lawton M., Ash J., Liu B., Evetts S., Hopkins-Jones L., Lewis S., Bresner C., Malpartida AB., Williams N., Gentlemen S., Wade-Martins R., Ryan B., Holgado-Nevado A., Hu M., Ben-Shlomo Y., Grosset D., Lovestone S.
Abstract Biomarkers to aid diagnosis and delineate the progression of Parkinson’s disease are vital for targeting treatment in the early phases of the disease. Here, we aim to discover a multi-protein panel representative of Parkinson’s and make mechanistic inferences from protein expression profiles within the broader objective of finding novel biomarkers. We used aptamer-based technology (SomaLogic®) to measure proteins in 1599 serum samples, 85 cerebrospinal fluid samples and 37 brain tissue samples collected from two observational longitudinal cohorts (the Oxford Parkinson’s Disease Centre and Tracking Parkinson’s) and the Parkinson’s Disease Brain Bank, respectively. Random forest machine learning was performed to discover new proteins related to disease status and generate multi-protein expression signatures with potential novel biomarkers. Differential regulation analysis and pathway analysis were performed to identify functional and mechanistic disease associations. The most consistent diagnostic classifier signature was tested across modalities [cerebrospinal fluid (area under curve) = 0.74, P = 0.0009; brain area under curve = 0.75, P = 0.006; serum area under curve = 0.66, P = 0.0002]. Focusing on serum samples and using only those with severe disease compared with controls increased the area under curve to 0.72 (P = 1.0 × 10−4). In the validation data set, we showed that the same classifiers were significantly related to disease status (P < 0.001). Differential expression analysis and weighted gene correlation network analysis highlighted key proteins and pathways with known relationships to Parkinson’s. Proteins from the complement and coagulation cascades suggest a disease relationship to immune response. The combined analytical approaches in a relatively large number of samples, across tissue types, with replication and validation, provide mechanistic insights into the disease as well as nominate a protein signature classifier that deserves further biomarker evaluation.