To set up an account, you have to provide contact details. The person providing these details, the contact person, is the person who we will contact if we have any questions about the submission, and the person to whom the accession number will be sent. The contact person does not need to be the same as the contact person of a possible publication. The contact details will not be added to the UniProt Knowledgebase entry.
The following fields are mandatory:
If you don't get an e-mail just after entering your personal details it is likely that the e-mail address you provided is incorrect. Any error with the e-mail address might lead to delays in releasing accession numbers.
Click Create a new submission to start the submission process. This opens a web form and you will be asked to provide the following information:
Required:
Optional:
You will be expected to complete all required fields.
Please give the name(s) of the protein.
Examples:
The scientific name of the organism from which the sequence was obtained.
Example: Homo sapiens
The commonly used name(s) of the organism from which the sequence was obtained.
Example: Human
The NCBI Taxonomy ID of the organism from which the sequence was obtained.
The strain, cultivar or variety from which the sequence was obtained.
Examples:
The tissue or cell type the protein was purified from. You may also include developmental stage information.
Examples:
You should ideally submit your sequence data before you have galley proofs. However, if your manuscript has already been accepted for publication, the accession number can be included at the galley proof stage as a note added in proof.
We suggest that the following text be used to cite the accession number(s) in publication(s): "The protein sequence data reported in this paper will appear in the UniProt Knowledgebase under the accession number(s) xxxxxx."
You must choose between the following citation types:
If you do not plan to publish the sequence elsewhere, please provide a list of author names (a minimum of one name is required) and a title for the submission. The title should be similar to one you would use if you published the sequence in a journal.
Please give the title, author list, journal name, volume, pages and the year of publication. If the Pubmed ID is known please also provide this.
Unpublished journal articles have the same fields as a published journal article, with the exception of publication year and Pubmed ID. Fill in as much information as you currently have available. Mandatory field restrictions are relaxed for this citation type.
Please give the title of the thesis, the author, year (if known), institution name and the country.
Please select whether you are submitting a complete sequence or one or more fragments of a particular protein. If you are submitting fragments you can use a checkbox to indicate whether the first fragment is the N-terminal one and whether the order of fragments is known or not. Enter the sequence in the box provided using single letter amino acid code.
Depending on your preferences, your data can either be released immediately after being checked by a curator, or kept confidential initially. Please note that the data can not be kept confidential once published in a journal or thesis. There are three options for release date.
The entry will be released after you have been provided with the accession number(s). Please note that the entry will not appear in the database immediately after we send you the accession number.
The entry will be released when it is published in a journal. Please contact us when your paper is published, giving full citation details, to ensure that we release your entry at the appropriate time.
The entry will be released on or after the date you provided. If you publish your data before the date you gave us, please let us know and we can make sure that the submitted data is released at the appropriate time. You may change the provided date by contacting data submissions by e-mail.
The information you provide will be annotated by a scientific curator before it is entered in the database, so don't worry if you are unsure which category to put the data in. You may for example use the Other Information category for the information that doesn't fit anywhere else.
Please give the result and accuracy in Daltons, and choose the method you used from the drop-down menu. Please let us know in the comment field if the result is not of the whole protein, but a fragment. If you give several results for one peptide, please specify what causes the difference in the mass. Only experimental data may be entered here.
Examples:
What does the protein do?
Examples:
Is this protein specifically found in certain tissues?
Examples:
Have you found similarities between this protein sequence and some other proteins?
Examples:
The EC number does not need to be complete, but please provide as much as you can.
Examples:
Do you know which reaction the enzyme catalyses?
Examples:
Do you know if a cofactor is required for enzyme activity?
Examples:
Do you know if the enzyme functions in a specific metabolic pathway?
Examples:
Do you know what activates or inhibits the enzyme?
Examples:
Do you know the maximal velocity of the reaction catalyzed by this enzyme (Vmax)? Additional information may be provided in the notes field.
Examples:
Do you know the Michaelis-Menten constant (kM) for this enzyme? You must state the substrate used to determine the kM
Examples:
Does the protein consist of one peptide or several?
Examples:
Is the protein an allergen?
Examples:
Where is the protein located, within the cell or outside
Examples:
Is the protein modified after it is synthesised?
Examples:
Do you know what circumstances cause the protein to be synthesized?
Examples:
Some proteins appear at specific moments in the development of an organism.
Examples:
Do you know the optimum temperature for this enzyme, variation of enzyme activity with variation in temperature, or therostability of the enzyme?
Examples:
Do you know the optimum pH for this enzyme, or variation of enzyme activity with variation in pH?
Examples:
Do you know the standard redox potential for this protein?
Examples:
Is your protein photoreactive? Does it absorb a particular wavelength? Please don't list absorbance values due to cytochromes, flavin groups in flavin-binding proteins or pyridoxal phosphate in PLP-containing proteins.
Examples:
Did you identify the protein from a 2D page? Please give the mass in kilodaltons and pI. Only experimental data may be entered here.
Examples:
Do you have any other information to give about the protein? If applicable, add the source of the data.
Examples:
Are there any interesting features in the sequence you can tell us about?
Click on "add a feature" and choose the appropriate description from the drop-down list. If the feature is not on the list, other domain of interest or other site of interest should be selected.
Are you unsure of the identity of an amino acid, or the order of some residues? Only experimentally determined information should be entered here.
Examples:
Is the sequence posttranslationally modified? Please give the chemical nature of the modification in the comment field. The most common modifications are: Acetylation, Amidation, Formylation, Pyrrolidone carboxylic acid, Hydroxylation, Methylation, Phosphorylation and Sulfation. Exception: Glycosylation should be described using the annotation type 'Glycosylation site'.
Examples:
Are some of the residues linked by an intra-chain or inter-chain disulfide bond?
Examples:
If your protein is an enzyme, do you know which amino acids are involved in the activity of the enzyme? If known, the role of the active site (proton donor, nucleophile, charge relay system etc) should be added in the Comment field.
Examples:
Is the protein glycosylated? If you know more details, such as the type of the linkage, or the nature of the sugar (or the reducing terminal sugar), please add these to the Comment field.
N-linked:
This is the most prevalent carbohydrate linkage, which involves the addition of oligosaccharides to asparagine residues of secreted or membrane-bound proteins. The modification requires the following sequence motif: Asn-Xaa-Ser/Thr where 'x' cannot be a proline residue.
O-linked:
Glycans attached to serine, threonine and, to a lesser extent, hydroxyproline and hydroxylysine. Occurs on secreted and membrane-bound proteins, but also on cytoplasmic and nuclear proteins. The most common type of glycan linked in this fashion is N-acetylgalactosamine and, in rare cases, can be mannose, fucose, glucose, galactose or xylose.
C-linked:
C-glycosylation involves the mannosylation of specific tryptophan residues.
Examples:
Does the protein bind a metal ion? Please state in the comments field which metal is bound.
Examples:
Does the sequence bind a nucleotide phosphate? Please indicate the nature of the nucleotide phosphate in the Comment field.
Examples:
Does the sequence contain a DNA binding domain? If known, please give the nature of the DNA-binding region in the comment field.
Examples:
Does the protein have a covalent bond or a specific strong interaction with a chemical group (co-enzyme, prosthetic group, etc.)? Please state what is bound in the comments field.
Examples:
Are there any transmembrane regions?
Examples:
Are there any other domains of interest within the protein? Please give details in the comments field.
Examples:
Are there any other sites of interest (single residues, or pairs of residues) within the protein? Please give details in the comments field.
Examples:
Are there any natural variations in the sequence? This should only be used for natural variation. Please enter the position of the variant, and the alternative residue. Please give details in the comments field.
Examples:
We are using evidence tags to document where each piece of information has originated. Please tell us if the data you provide is
How do I update a previously submitted data?
SPIN may be used for submitting new sequencing data to a publicly available UniProt Knowledgebase entry. If you want to correct/modify data you have submitted previously, or add new data to a confidential entry, please e-mail the changes to uniprot-submissions@ebi.ac.uk. Please use the update form to update other publicly available UniProtKB/Swiss-Prot entries or email help@uniprot.org.
How long will it take to get an accession number?
We will process data submissions within 7 working days of receipt and send submitters notification of either which accession number(s) their data have been assigned or what additional information is needed. There are several things authors can do to minimize the time it takes to get an accession number:
What is the SPIN ID?
You are provided with a unique Submission identifier number (SPIN ID) for your submission every time you start a new session from the SPIN home page. This number allows the system to keep track of your submissions and sequences. This SPIN ID number is not an accession number. You will receive an accession number once your submission has been processed by a curator. Once you have received an accession number you should cite it in all further communication with EBI.
How do I submit several proteins from one organism?
Use a separate entry form for each protein/peptide.
How do I submit same protein from several organisms?
Use a separate entry form for each organism.
How do I submit several fragments from one protein/peptide (one organism)?
Use one entry form for all fragments.
How do I submit several fragments, may be from different proteins?
Use a separate entry form for each fragment.
How do I submit several fragments from different proteins?
Use a separate entry form for each fragment.
What are the browser requirements?
We recommend using the latest versions of Firefox, Chrome, Safari, Opera or Internet Explorer. Javascript has to be enabled.
What is SPIN?
SPIN is the Internet tool for the submission of directly sequenced protein sequences to the UniProt Knowledgebase.
Where is SPIN?
SPIN is a service offered by the UniProt teams at the European Molecular Biology Laboratory - The European Bioinformatics Institute (EMBL-EBI) located in Cambridge (United Kingdom).
Who is responsible for SPIN?
For any problems please contact uniprot-submissions@ebi.ac.uk.