Fractal analysis of DNA sequence data

Update Item Information
Publication Type dissertation
School or College School of Medicine
Department Biomedical Informatics
Author Berthelsen, Cheryl Lynn
Contributor Glazier, James; Olipant, Arnold; Goldgar, David; Tavare, Simon
Title Fractal analysis of DNA sequence data
Date 1993-03
Description DNA sequence databases are growing at an almost exponential rate. New analysis methods are needed to extract knowledge about the organization of nucleotides from this vast amount of data. Fractal analysis is a new scientific paradigm that has been used successfully in many domains including the biological and physical sciences. Biological growth in a nonlinear dynamic process and some have suggested that to consider fractal geometry as a biological design principle may be most productive. This research is an exploratory study of the application of fractal analysis to DNA sequence data. A simple random fractal, the random walk, is used to represent DNA sequences. The fractal dimension of these walks is than estimated using the "sandbox method" Analysis of 164 human DNA sequences compared to three types of control sequences (random, base-content matched, and dimer-content matched) reveal that long-range correlations are present in DNA that are not explained by base or dimer frequencies. The study also revealed that the fractal dimension of coding sequences was significantly lower that sequences that were primarily non-coding, indicating the presence of longer-range correlations in functional sequences. The multifractal spectrum is used to analyze fractals that are heterogeneous and have a different fractal dimension for subsets with different scalings. The multifractal spectrum of the random walks of twelve mitochondrial genome sequences was estimated. Eight vertebrate mtDNA sequences had uniformly lower spectra values than did four invertebrate mtDNA sequences. Thus, vertebrate mitochondria show significantly longer-rage correlations than do invertebrate mitochondria. The higher multifractal spectra values for invertebrate mitochondria suggest a more random organization of sequences. This research also included considerable theoretical work on the effects of finite size, embedding dimension, and scaling ranges.
Type Text
Publisher University of Utah
Subject Nucleotide Sequence; Mathematical Models; Fractals
Subject MESH Sequence Analysis; DNA; Nucleosides; Base Sequence; Fractals
Dissertation Institution University of Utah
Dissertation Name PhD
Language eng
Relation is Version of Digital reproduction of "Fractal analysis of DNA sequence data". Spencer S. Eccles Health Sciences Library. Print version of "Fractal analysis of DNA sequence data". available at J. Willard Marriott Library Special Collection. QP6.5 1993 .B47.
Rights Management © Cheryl Lynn Berthelsen.
Format Medium application/pdf
Format Extent 2,350,922 bytes
Identifier undthes,4592
Source Original: University of Utah Spencer S. Eccles Health Sciences Library (no longer available).
Funding/Fellowship Gradyate Researcg Committee of the University of Utah
Master File Extent 2,350,984 bytes
ARK ark:/87278/s60r9r7r
Setname ir_etd
ID 191466
Reference URL https://collections.lib.utah.edu/ark:/87278/s60r9r7r