SIMAP--structuring the network of protein similarities.

Overview

abstract

Protein sequences are the most important source of evolutionary and functional information for new proteins. In order to facilitate the computationally intensive tasks of sequence analysis, the Similarity Matrix of Proteins (SIMAP) database aims to provide a comprehensive and up-to-date dataset of the pre-calculated sequence similarity matrix and sequence-based features like InterPro domains for all proteins contained in the major public sequence databases. As of September 2007, SIMAP covers approximately 17 million proteins and more than 6 million non-redundant sequences and provides a complete annotation based on InterPro 16. Novel features of SIMAP include a new, portlet-based web portal providing multiple, structured views on retrieved proteins and integration of protein clusters and a unique search method for similar domain architectures. Access to SIMAP is freely provided for academic use through the web portal for individuals at http://mips.gsf.de/simap/and through Web Services for programmatic access at http://mips.gsf.de/webservices/services/SimapService2.0?wsdl.

authors

Krumsiek, Jan
Wachinger, Benedikt
Stümpflen, Volker
Mewes, Werner

publication date

November 23, 2007

published in

Nucleic acids research Journal

Research

keywords

Databases, Protein
Sequence Alignment
Sequence Analysis, Protein

Identity

PubMed Central ID

PMC2238827

Scopus Document Identifier

38549120716

Digital Object Identifier (DOI)

10.1093/nar/gkm963

PubMed ID

18037617

Additional Document Info

has global citation frequency

25

volume

36

issue

Database issue

VIVO Weill Cornell Medical College

SIMAP--structuring the network of protein similarities. Academic Article

Overview

abstract

authors

publication date

published in

Research

keywords

Identity

PubMed Central ID

Scopus Document Identifier

Digital Object Identifier (DOI)

PubMed ID

Additional Document Info

has global citation frequency

volume

issue