1887

Abstract

The severity and progression of lung disease are highly variable across individuals with cystic fibrosis (CF) and are imperfectly predicted by mutations in the human gene CFTR, lung microbiome variation or other clinical factors. The opportunistic pathogen () dominates airway infections in most CF adults. Here we hypothesized that within–host genetic variation of populations would be associated with lung disease severity. To quantify genetic variation within CF sputum samples, we used deep amplicon sequencing (AmpliSeq) of 209 genes previously associated with pathogenesis or adaptation to the CF lung. We trained machine learning models using single nucleotide variants (SNVs), microbiome diversity data and clinical factors to classify lung disease severity at the time of sputum sampling, and to predict lung function decline after 5 years in a cohort of 54 adult CF patients with chronic infection. Models using SNVs alone classified lung disease severity with good sensitivity and specificity (area under the receiver operating characteristic curve: AUROC=0.87). Models were less predictive of lung function decline after 5 years (AUROC=0.74) but still significantly better than random. The addition of clinical data, but not sputum microbiome diversity data, yielded only modest improvements in classifying baseline lung function (AUROC=0.92) and predicting lung function decline (AUROC=0.79), suggesting that AmpliSeq data account for most of the predictive value. Our work provides a proof of principle that genetic variation in sputum tracks lung disease severity, moderately predicts lung function decline and could serve as a disease biomarker among CF patients with chronic infections.

Funding
This study was supported by the:
  • Fonds de Recherche du Québec - Santé
    • Principle Award Recipient: NguyenDao
  • Canadian Institutes for Health Research
    • Principle Award Recipient: DaoNguyen
  • Génome Québec
    • Principle Award Recipient: B.Jesse Shapiro
  • Genome Canada
    • Principle Award Recipient: B.Jesse Shapiro
  • This is an open-access article distributed under the terms of the Creative Commons Attribution License.
Loading

Article metrics loading...

/content/journal/mgen/10.1099/mgen.0.000981
2023-04-13
2024-03-28
Loading full text...

Full text loading...

/deliver/fulltext/mgen/9/4/mgen000981.html?itemId=/content/journal/mgen/10.1099/mgen.0.000981&mimeType=html&fmt=ahah

References

  1. Welsh MJ, Ramsey BW, Accurso F, Cutting GR. Cystic fibrosis. In Scriver CR, Beaudet AL, Sly WS, Valle D. eds Metabolic & Molecular Bases of Inherited Disease, 8th. edn vol 3 New York: McGraw-Hill; 2001 pp 5121–5188
    [Google Scholar]
  2. Turcios NL. Cystic fibrosis lung disease: an overview. Respir Care 2020; 65:233–251 [View Article]
    [Google Scholar]
  3. Shanthikumar S, Neeland MN, Saffery R, Ranganathan S. Gene modifiers of cystic fibrosis lung disease: a systematic review. Pediatr Pulmonol 2019; 54:1356–1366 [View Article]
    [Google Scholar]
  4. Coburn B, Wang PW, Diaz Caballero J, Clark ST, Brahma V et al. Lung microbiota across age and disease stage in cystic fibrosis. Sci Rep 2015; 5:10241 [View Article]
    [Google Scholar]
  5. Cox MJ, Allgaier M, Taylor B, Baek MS, Huang YJ et al. Airway microbiota and pathogen abundance in age-stratified cystic fibrosis patients. PLoS One 2010; 5:e11044 [View Article]
    [Google Scholar]
  6. Cuthbertson L, Walker AW, Oliver AE, Rogers GB, Rivett DW et al. Lung function and microbiota diversity in cystic fibrosis. Microbiome 2020; 8:45 [View Article] [PubMed]
    [Google Scholar]
  7. Zhao J, Schloss PD, Kalikin LM, Carmody LA, Foster BK et al. Decade-long bacterial community dynamics in cystic fibrosis airways. Proc Natl Acad Sci 2012; 109:5809–5814 [View Article]
    [Google Scholar]
  8. Goddard AF, Staudinger BJ, Dowd SE, Joshi-Datar A, Wolcott RD et al. Direct sampling of cystic fibrosis lungs indicates that DNA-based analyses of upper-airway specimens can misrepresent lung microbiota. Proc Natl Acad Sci 2012; 109:13769–13774 [View Article]
    [Google Scholar]
  9. Emerson J, Rosenfeld M, McNamara S, Ramsey B, Gibson RL. Pseudomonas aeruginosa and other predictors of mortality and morbidity in young children with cystic fibrosis. Pediatr Pulmonol 2002; 34:91–100 [View Article]
    [Google Scholar]
  10. Fothergill JL, Walshaw MJ, Winstanley C. Transmissible strains of Pseudomonas aeruginosa in cystic fibrosis lung infections. Eur Respir J 2012; 40:227–238 [View Article]
    [Google Scholar]
  11. Kosorok MR, Zeng L, West SE, Rock MJ, Splaingard ML et al. Acceleration of lung disease in children with cystic fibrosis after Pseudomonas aeruginosa acquisition. Pediatr Pulmonol 2001; 32:277–287 [View Article]
    [Google Scholar]
  12. Harris JK, Wagner BD, Zemanick ET, Robertson CE, Stevens MJ et al. Changes in airway microbiome and inflammation with ivacaftor treatment in patients with cystic fibrosis and the G551D mutation. Ann Am Thorac Soc 2020; 17:212–220 [View Article]
    [Google Scholar]
  13. Hisert KB, Heltshe SL, Pope C, Jorth P, Wu X et al. Restoring cystic fibrosis transmembrane conductance regulator function reduces airway bacteria and inflammation in people with cystic fibrosis and chronic lung infections. Am J Respir Crit Care Med 2017; 195:1617–1628 [View Article]
    [Google Scholar]
  14. Bragonzi A, Paroni M, Nonis A, Cramer N, Montanari S et al. Pseudomonas aeruginosa microevolution during cystic fibrosis lung infection establishes clones with adapted virulence. Am J Respir Crit Care Med 2009; 180:138–145 [View Article]
    [Google Scholar]
  15. Folkesson A, Jelsbak L, Yang L, Johansen HK, Ciofu O et al. Adaptation of Pseudomonas aeruginosa to the cystic fibrosis airway: an evolutionary perspective. Nat Rev Microbiol 2012; 10:841–851 [View Article]
    [Google Scholar]
  16. Marvig RL, Sommer LM, Molin S, Johansen HK. Convergent evolution and adaptation of Pseudomonas aeruginosa within patients with cystic fibrosis. Nat Genet 2015; 47:57–64 [View Article]
    [Google Scholar]
  17. Mowat E, Paterson S, Fothergill JL, Wright EA, Ledson MJ et al. Pseudomonas aeruginosa population diversity and turnover in cystic fibrosis chronic infections. Am J Respir Crit Care Med 2011; 183:1674–1679 [View Article]
    [Google Scholar]
  18. Smith EE, Buckley DG, Wu Z, Saenphimmachak C, Hoffman LR et al. Genetic adaptation by Pseudomonas aeruginosa to the airways of cystic fibrosis patients. Proc Natl Acad Sci 2006; 103:8487–8492 [View Article]
    [Google Scholar]
  19. Tümmler B. Clonal variations in Pseudomonas aeruginosa. In Pseudomonas Boston, MA: Springer; 2006 pp 35–68
    [Google Scholar]
  20. Dettman JR, Kassen R. Evolutionary genomics of niche-specific adaptation to the cystic fibrosis lung in Pseudomonas aeruginosa. Mol Biol Evol 2021; 38:663–675 [View Article]
    [Google Scholar]
  21. Klockgether J, Cramer N, Fischer S, Wiehlmann L, Tümmler B. Long-term microevolution of Pseudomonas aeruginosa differs between mildly and severely affected cystic fibrosis lungs. Am J Respir Cell Mol Biol 2018; 59:246–256 [View Article]
    [Google Scholar]
  22. Marvig RL, Johansen HK, Molin S, Jelsbak L. Genome analysis of a transmissible lineage of Pseudomonas aeruginosa reveals pathoadaptive mutations and distinct evolutionary paths of hypermutators. PLoS Genet 2013; 9:e1003741 [View Article]
    [Google Scholar]
  23. Williams D, Evans B, Haldenby S, Walshaw MJ, Brockhurst MA et al. Divergent, coexisting Pseudomonas aeruginosa lineages in chronic cystic fibrosis lung infections. Am J Respir Crit Care Med 2015; 191:775–785 [View Article]
    [Google Scholar]
  24. Lim YW, Evangelista JS, Schmieder R, Bailey B, Haynes M et al. Clinical insights from metagenomic analysis of sputum samples from patients with cystic fibrosis. J Clin Microbiol 2014; 52:425–437 [View Article]
    [Google Scholar]
  25. Dias R, Torkamani A. Artificial intelligence in clinical and genomic diagnostics. Genome Med 2019; 11:1–12 [View Article]
    [Google Scholar]
  26. Lees JA, Mai TT, Galardini M, Wheeler NE, Horsfield ST et al. Improved prediction of bacterial genotype-phenotype associations using interpretable pangenome-spanning regressions. mBio 2020; 11:e01344-20 [View Article]
    [Google Scholar]
  27. Acosta N, Heirali A, Somayaji R, Surette MG, Workentine ML et al. Sputum microbiota is predictive of long-term clinical outcomes in young adults with cystic fibrosis. Thorax 2018; 73:1016–1025 [View Article] [PubMed]
    [Google Scholar]
  28. Acosta N, Whelan FJ, Somayaji R, Poonja A, Surette MG et al. The evolving cystic fibrosis microbiome: a comparative cohort study spanning 16 years. Ann Am Thorac Soc 2017; 14:1288–1297 [View Article]
    [Google Scholar]
  29. Pellegrino R, Viegi G, Brusasco V, Crapo RO, Burgos F et al. Interpretative strategies for lung function tests. Eur Respir J 2005; 26:948–968 [View Article] [PubMed]
    [Google Scholar]
  30. Aronesty E. Fastq-mcf: sequence quality filter, clipping and processor; 2013
  31. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J et al. The sequence alignment/Map format and SAMtools. Bioinformatics 2009; 25:2078–2079 [View Article]
    [Google Scholar]
  32. Koboldt DC, Zhang Q, Larson DE, Shen D, McLellan MD et al. VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing. Genome Res 2012; 22:568–576 [View Article] [PubMed]
    [Google Scholar]
  33. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012; 6:80–92 [View Article]
    [Google Scholar]
  34. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods 2020; 17:261–272 [View Article] [PubMed]
    [Google Scholar]
  35. Waskom ML. seaborn: statistical data visualization. J Open Source Software 2021; 6:3021 [View Article]
    [Google Scholar]
  36. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B et al. Scikit-learn: machine learning in python. J Mach Learn Res 2011; 12:2825–2830
    [Google Scholar]
  37. Ke G, Meng Q, Finley T, Wang T, Chen W et al. Lightgbm: a highly efficient gradient boosting decision tree. Adv Neural Inf Process Syst 2017; 30:3146–3154
    [Google Scholar]
  38. Koehrsen W. Feature Selector: Feature Selection in Python; 2019 https://github.com/WillKoehrsen/feature-selector
  39. Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining 2016 pp 785–794
    [Google Scholar]
  40. Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP et al. Random forest: a classification and regression tool for compound classification and QSAR modeling. J Chem Inf Comput Sci 2003; 43:1947–1958 [View Article]
    [Google Scholar]
  41. Somayaji R, Lam JC, Surette MG, Waddell B, Rabin HR et al. Long-term clinical outcomes of “Prairie Epidemic Strain” Pseudomonas aeruginosa infection in adults with cystic fibrosis. Thorax 2017; 72:333–339 [View Article]
    [Google Scholar]
  42. Macesic N, Bear Don’t Walk OJ, Pe’er I, Tatonetti NP, Peleg AY et al. Predicting phenotypic polymyxin resistance in Klebsiella pneumoniae through machine learning analysis of genomic data. mSystems 2020; 5: [View Article]
    [Google Scholar]
  43. Méric G, Mageiros L, Pensar J, Laabei M, Yahara K et al. Disease-associated genotypes of the commensal skin bacterium Staphylococcus epidermidis. Nat Commun 2018; 9:5034 [View Article]
    [Google Scholar]
  44. Mobegi FM, Cremers AJH, de Jonge MI, Bentley SD, van Hijum SAFT et al. Deciphering the distance to antibiotic resistance for the pneumococcus using genome sequencing data. Sci Rep 2017; 7:1–13 [View Article]
    [Google Scholar]
  45. Recker M, Laabei M, Toleman MS, Reuter S, Saunderson RB et al. Clonal differences in Staphylococcus aureus bacteraemia-associated mortality. Nat Microbiol 2017; 2:1381–1388 [View Article]
    [Google Scholar]
  46. Kumru B, Emiralioğlu N, Gökmen Ozel H. Does body mass index affect lung function in patients with cystic fibrosis?. Clinical Nutrition 2018; 37:S91 [View Article]
    [Google Scholar]
  47. Snell GI, Bennetts K, Bartolo J, Levvey B, Griffiths A et al. Body mass index as a predictor of survival in adults with cystic fibrosis referred for lung transplantation. J Heart Lung Transplant 1998; 17:1097–1103
    [Google Scholar]
  48. Zhao CY, Hao Y, Wang Y, Varga JJ, Stecenko AA et al. Microbiome data enhances predictive models of lung function in people with cystic fibrosis. J Infect Dis 2020jiaa655
    [Google Scholar]
  49. Zhang L, Mah TF. Involvement of a novel efflux system in biofilm-specific resistance to antibiotics. J Bacteriol 2008; 190:4447–4452 [View Article] [PubMed]
    [Google Scholar]
  50. Kuhn M, Johnson K. Over-fitting and model tuning. In Applied Predictive Modeling New York, NY: Springer New York; 2013 pp 61–92
    [Google Scholar]
  51. Alaa AM, van der Schaar M. Prognostication and risk factors for cystic fibrosis via automated machine learning. Sci Rep 2018; 8:11242 [View Article]
    [Google Scholar]
  52. Raschka S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J Open Source Software 2018; 3:638 [View Article]
    [Google Scholar]
  53. Pincus NB, Ozer EA, Allen JP, Nguyen M, Davis JJ et al. A genome-based model to predict the virulence of Pseudomonas aeruginosa isolates. mBio 2020; 11:e01527-20 [View Article]
    [Google Scholar]
  54. van der Gast CJ, Walker AW, Stressmann FA, Rogers GB, Scott P et al. Partitioning core and satellite taxa from within cystic fibrosis lung bacterial communities. ISME J 2011; 5:780–791 [View Article] [PubMed]
    [Google Scholar]
  55. Zhao CY, Hao Y, Wang Y, Varga JJ, Stecenko AA et al. Microbiome data enhances predictive models of lung function in people with cystic fibrosis. J Infect Dis 2021; 223:S246–S256 [View Article] [PubMed]
    [Google Scholar]
  56. Mallawaarachchi S, Tonkin-Hill G, Croucher NJ, Turner P, Speed D et al. Genome-wide association, prediction and heritability in bacteria with application to Streptococcus pneumoniae. NAR Genom Bioinforma 2022; 4:lqac011 [View Article]
    [Google Scholar]
  57. Saber MM, Shapiro BJ. Benchmarking bacterial genome-wide association study methods using simulated genomes and phenotypes. Microb Genom 2020; 6: [View Article] [PubMed]
    [Google Scholar]
  58. Maatallah M, Cheriaa J, Backhrouf A, Iversen A, Grundmann H et al. Population structure of Pseudomonas aeruginosa from five Mediterranean countries: evidence for frequent recombination and epidemic occurrence of CC235. PLoS One 2011; 6:e25617 [View Article]
    [Google Scholar]
  59. Bouvier NM. Cystic fibrosis and the war for iron at the host-pathogen battlefront. Proc Natl Acad Sci 2016; 113:1480–1482 [View Article]
    [Google Scholar]
  60. Firoz A, Haris M, Hussain K, Raza M, Verma D et al. Can targeting iron help in combating chronic Pseudomonas infection? A systematic review. Cureus 2021; 13: [View Article]
    [Google Scholar]
  61. Poudyal B, Sauer K. The ABC of biofilm drug tolerance: the MerR-like regulator BrlR is an activator of ABC transport systems, with PA1874-77 contributing to the tolerance of Pseudomonas aeruginosa biofilms to tobramycin. Antimicrob Agents Chemother 2018; 62: [View Article]
    [Google Scholar]
  62. MacFadden DR, Melano RG, Coburn B, Tijet N, Hanage WP et al. Comparing patient risk factor-, sequence type-, and resistance locus identification-based approaches for predicting antibiotic resistance in Escherichia coli bloodstream infections. J Clin Microbiol 2019; 57: [View Article]
    [Google Scholar]
http://instance.metastore.ingenta.com/content/journal/mgen/10.1099/mgen.0.000981
Loading
/content/journal/mgen/10.1099/mgen.0.000981
Loading

Data & Media loading...

Supplements

Supplementary material 1

PDF

Supplementary material 2

EXCEL

Supplementary material 3

EXCEL
This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error