Predicting the binding preference of transcription factors to individual DNA k-mers.

Overview

abstract

MOTIVATION: Recognition of specific DNA sequences is a central mechanism by which transcription factors (TFs) control gene expression. Many TF-binding preferences, however, are unknown or poorly characterized, in part due to the difficulty associated with determining their specificity experimentally, and an incomplete understanding of the mechanisms governing sequence specificity. New techniques that estimate the affinity of TFs to all possible k-mers provide a new opportunity to study DNA-protein interaction mechanisms, and may facilitate inference of binding preferences for members of a given TF family when such information is available for other family members. RESULTS: We employed a new dataset consisting of the relative preferences of mouse homeodomains for all eight-base DNA sequences in order to ask how well we can predict the binding profiles of homeodomains when only their protein sequences are given. We evaluated a panel of standard statistical inference techniques, as well as variations of the protein features considered. Nearest neighbour among functionally important residues emerged among the most effective methods. Our results underscore the complexity of TF-DNA recognition, and suggest a rational approach for future analyses of TF families.

authors

Alleyne, Trevis M

Peña-Castillo, Lourdes

Badis, Gwenael

Talukder, Shaheynoor

Berger, Michael
Gehrke, Andrew R
Philippakis, Anthony A
Bulyk, Martha L
Morris, Quaid D
Hughes, Timothy R

publication date

December 16, 2008

published in

Bioinformatics (Oxford, England) Journal

Research

keywords

Computational Biology
DNA
Sequence Analysis, DNA
Transcription Factors

Identity

PubMed Central ID

PMC2666811

Scopus Document Identifier

64549112146

Digital Object Identifier (DOI)

10.1093/bioinformatics/btn645

PubMed ID

19088121

Additional Document Info

has global citation frequency

21

volume

25

issue

8

VIVO Weill Cornell Medical College

Predicting the binding preference of transcription factors to individual DNA k-mers. Academic Article

Overview

abstract

authors

publication date

published in

Research

keywords

Identity

PubMed Central ID

Scopus Document Identifier

Digital Object Identifier (DOI)

PubMed ID

Additional Document Info

has global citation frequency

volume

issue