Accommodating uncertainty in a tree set for function estimation

Brian C. Healy, Victor G. DeGruttola, Chengcheng Hu

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Multiple branching trees have been used to model the acquisition of HIV drug resistance mutations, and several different algorithms have been developed to construct the tree set that best describes the data. These algorithms have mainly focused on the structure of the tree set. The focal point of this paper is estimation of functions of the tree set parameters that incorporate uncertainty in the tree set. The functions of interest are the state probabilities, the co-occurrence of mutations and the order of acquisition. Such functions are of interest because they help characterize the genetic pathways that lead to multi-drug resistance. We propose a bootstrap technique to account for the additional variability in estimates due to uncertainty in the tree set. The methods are applied to genetic sequences of patients from a database compiled by the Forum for Collaborative HIV Research in an effort to characterize genetic pathways to resistance to drugs from the nucleoside reverse transcriptase inhibitor (NRTI) class. The main results were that patients with a 211K mutation in the RT region of the viral genome were more likely to have a 215Y mutation and less likely to have a 70R mutation compared to patients without a 211K mutation.

Original languageEnglish (US)
Article number5
JournalStatistical Applications in Genetics and Molecular Biology
Volume7
Issue number1
StatePublished - 2008
Externally publishedYes

Fingerprint

Function Estimation
Trees (mathematics)
Uncertainty
Mutation
Drug Resistance
Pathway
Genes
Likely
HIV
Reverse Transcriptase Inhibitors
Viral Genome
Multiple Drug Resistance
Jurisprudence
Nucleosides
Bootstrap
Inhibitor
Branching
Reverse
Drugs
Genome

Keywords

  • Bootstrap techniques
  • Branching trees
  • HIV
  • Resistance mutations

ASJC Scopus subject areas

  • Genetics

Cite this

Accommodating uncertainty in a tree set for function estimation. / Healy, Brian C.; DeGruttola, Victor G.; Hu, Chengcheng.

In: Statistical Applications in Genetics and Molecular Biology, Vol. 7, No. 1, 5, 2008.

Research output: Contribution to journalArticle

@article{bbf1d2315c084d06809ddc71d78c16a7,
title = "Accommodating uncertainty in a tree set for function estimation",
abstract = "Multiple branching trees have been used to model the acquisition of HIV drug resistance mutations, and several different algorithms have been developed to construct the tree set that best describes the data. These algorithms have mainly focused on the structure of the tree set. The focal point of this paper is estimation of functions of the tree set parameters that incorporate uncertainty in the tree set. The functions of interest are the state probabilities, the co-occurrence of mutations and the order of acquisition. Such functions are of interest because they help characterize the genetic pathways that lead to multi-drug resistance. We propose a bootstrap technique to account for the additional variability in estimates due to uncertainty in the tree set. The methods are applied to genetic sequences of patients from a database compiled by the Forum for Collaborative HIV Research in an effort to characterize genetic pathways to resistance to drugs from the nucleoside reverse transcriptase inhibitor (NRTI) class. The main results were that patients with a 211K mutation in the RT region of the viral genome were more likely to have a 215Y mutation and less likely to have a 70R mutation compared to patients without a 211K mutation.",
keywords = "Bootstrap techniques, Branching trees, HIV, Resistance mutations",
author = "Healy, {Brian C.} and DeGruttola, {Victor G.} and Chengcheng Hu",
year = "2008",
language = "English (US)",
volume = "7",
journal = "Statistical Applications in Genetics and Molecular Biology",
issn = "1544-6115",
publisher = "Berkeley Electronic Press",
number = "1",

}

TY - JOUR

T1 - Accommodating uncertainty in a tree set for function estimation

AU - Healy, Brian C.

AU - DeGruttola, Victor G.

AU - Hu, Chengcheng

PY - 2008

Y1 - 2008

N2 - Multiple branching trees have been used to model the acquisition of HIV drug resistance mutations, and several different algorithms have been developed to construct the tree set that best describes the data. These algorithms have mainly focused on the structure of the tree set. The focal point of this paper is estimation of functions of the tree set parameters that incorporate uncertainty in the tree set. The functions of interest are the state probabilities, the co-occurrence of mutations and the order of acquisition. Such functions are of interest because they help characterize the genetic pathways that lead to multi-drug resistance. We propose a bootstrap technique to account for the additional variability in estimates due to uncertainty in the tree set. The methods are applied to genetic sequences of patients from a database compiled by the Forum for Collaborative HIV Research in an effort to characterize genetic pathways to resistance to drugs from the nucleoside reverse transcriptase inhibitor (NRTI) class. The main results were that patients with a 211K mutation in the RT region of the viral genome were more likely to have a 215Y mutation and less likely to have a 70R mutation compared to patients without a 211K mutation.

AB - Multiple branching trees have been used to model the acquisition of HIV drug resistance mutations, and several different algorithms have been developed to construct the tree set that best describes the data. These algorithms have mainly focused on the structure of the tree set. The focal point of this paper is estimation of functions of the tree set parameters that incorporate uncertainty in the tree set. The functions of interest are the state probabilities, the co-occurrence of mutations and the order of acquisition. Such functions are of interest because they help characterize the genetic pathways that lead to multi-drug resistance. We propose a bootstrap technique to account for the additional variability in estimates due to uncertainty in the tree set. The methods are applied to genetic sequences of patients from a database compiled by the Forum for Collaborative HIV Research in an effort to characterize genetic pathways to resistance to drugs from the nucleoside reverse transcriptase inhibitor (NRTI) class. The main results were that patients with a 211K mutation in the RT region of the viral genome were more likely to have a 215Y mutation and less likely to have a 70R mutation compared to patients without a 211K mutation.

KW - Bootstrap techniques

KW - Branching trees

KW - HIV

KW - Resistance mutations

UR - http://www.scopus.com/inward/record.url?scp=41649106946&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=41649106946&partnerID=8YFLogxK

M3 - Article

C2 - 18312210

AN - SCOPUS:41649106946

VL - 7

JO - Statistical Applications in Genetics and Molecular Biology

JF - Statistical Applications in Genetics and Molecular Biology

SN - 1544-6115

IS - 1

M1 - 5

ER -