Finding Interpro annotation without BioMart

From Bioinformatics Lab
Jump to: navigation, search

Interpro annotation could be retrieved by BioMart. But sometimes, in some species like Pseudomonas aeruginosa, BioMart didn't provide Interpro annotation data. If then, follow the below three steps.

1. Download interpro annotation file (protein2ipr.dat.gz) regardless of species from Interpro download site

2. Find out Uniprot protein IDs of the species that you want

In case of Pseudomonas aeruginosa, the protein IDs of Pseudomonas Genome Database were different of their Uniprot proteins. Therefore, I downloaded Uniprot protein IDs of Pseudomonas aeruginosa at Uniprot homepage. If you click this URL, you can see my search result of Pseudomonas aeruginosa.

3. From 'protein2ipr.dat.gz' file, extract only data with uniprot protein IDs of the species that you want to get. If you can use simple script languages, it is an easy job.

Personal tools