SIMAP - The Similarty Matrix of Proteins
The Similarity Matrix of Proteins (SIMAP)
Welcome to SIMAP.
SIMAP is a database containing the similarity space formed by about all amino-acid sequences from public databases and completely sequenced genomes.
You may find sequences and protein entries of interest by fulltext search which uses an index of proteins IDs, accession numbers and descriptions, and the Biothesaurus.
Starting from your query sequence you may find the nearest sequences in SIMAP. By searching parts of your query in a suffix array of all SIMAP sequences (generated by VMATCH), this search runs much faster than BLAST.