Rosetta 3.2.1 Release Manual |
This script uses the alignment to produce a trimmed template suitable for an initial molecular replacement search. All deleted residues are removed, and aligned nonmatching residues are trimmed to the gamma carbon. Functionally, the template trimming is doing the same thing as the crystallographic software 'Sculptor' but it doesn't remap the residues as sculptor does (and makes it easier to run with different alignments).
The trimmed HHR-file should look something like this:
Query 1XXX Match_columns 134 No_of_seqs 101 out of 418 Neff 6.7 Searched_HMMs 22773 Command /cluster/toolkit/production/bioprogs/hhpred/hhsearch -cpu 4 -v 1 -i /cluster/toolkit/production/tmp/production/417840/9488624.hhm -d /cluster/toolkit/production/databases/hhpred/new_dbs/pdb70_21Aug10/db/pdb.hhm -o /cluster/toolkit/production/tmp/production/417840/9488624.hhr -p 20 -P 20 -Z 100 -B 100 -seq 1 -aliw 80 -global -ssm 2 -norealign -sc 1 -dbstrlen 10000 -cs /cluster/toolkit/production/bioprogs/csblast/data/clusters.prf No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 2qo4_A Liver-basic fatty acid 100.0 0 0 292.6 21.2 126 2-133 1-126 (126) No 1 >2qo4_A Liver-basic fatty acid binding protein; liver bIle acid-binding protein, BABP, FABP, cholic acid, cholate, bIle acid, lipid-binding, transport; HET: CHD; 1.50A {Danio rerio} PDB: 2qo6_A* 2qo5_A* 2ftb_A* 2ft9_A* Probab=100.00 E-value=0 Score=292.58 Aligned_cols=126 Identities=31% Similarity=0.488 Sum_probs=0.0 Q ss_pred CcccEEEEEEeccCHHHHHHHcCCCHHHHhhhhcCCceEEEEEeCCEEEEEEEccceEEEEEEECCCcEEeecccCCCCE Q 1CRB_ChainA 2 VDFNGYWKMLSNENFEEYLRALDVNVALRKIANLLKPDKEIVQDGDHMIIRTLSTFRNYIMDFQVGKEFEEDLTGIDDRK 81 (134) Q Consensus 2 ~~f~G~wkl~~sENfde~Lk~lGv~~~~Rk~a~~~~p~~eI~~~Gd~~tikt~t~~kt~~~~F~lGeefee~~~t~dg~~ 81 (134) |+|+|+|+|++|||||+||+|||||+++|++|+.++|+++|+||||+|+|++.+++++.+++|+|||||||+ ++||++ T Consensus 1 MaF~G~wkl~~sENfd~flkalGv~~~~rk~a~~~~p~~~I~~~Gd~~~ikt~s~~kt~~~~F~lGeefee~--~~dG~k 78 (126) T 2qo4_A 1 MAFSGTWQVYAQENYEEFLRAISLPEEVIKLAKDVKPVTEIQQNGSDFTITSKTPGKTVTNSFTIGKEAEIT--TMDGKK 78 (126) T ss_dssp -CCCEEEEEEEEESHHHHHHHTTCCHHHHHHTTTCCCEEEEEEETTEEEEEEEETTEEEEEEEETTBEEEEE--CTTSCE T ss_pred CCccEEEEEEeccCHHHHHHHcCCCHHHHhhhhcCCceEEEEEeCCEEEEEEEcCCeeEEEEEECCCcEEEE--cCCCCE Q ss_pred EEEEEEEECCEEEEEEECCCCCeEEEEEEECCEEEEEEEECCEEEEEEEEEC Q 1CRB_ChainA 82 CMTTVSWDGDKLQCVQKGEKEGRGWTQWIEGDELHLEMRAEGVTCKQVFKKV 133 (134) Q Consensus 82 ~k~~~t~eg~kLv~~~~~~~~~~~~~re~~g~~l~~t~~~~~V~~~R~ykrv 133 (134) +|++++||||+|++ ..++...+||++||+|++||+++||+|+|+|+|| T Consensus 79 ~k~~~t~eg~kLv~----~~~~~~~~Re~~g~~l~~t~~~~~v~~~R~ykrv 126 (126) T 2qo4_A 79 LKCIVKLDGGKLVC----RTDRFSHIQEIKAGEMVETLTVGGTTMIRKSKKI 126 (126) T ss_dssp EEEECEEETTEEEE----ECSSCEEEEEEETTEEEEEEEETTEEEEEEEEEC T ss_pred EEEEEEEECCEEEE----EECCCcEEEEEECCEEEEEEEECCEEEEEEEEEC
Limitations
(1) The trimmed PDB file. This file shoud be used as input to a molecular replacement program (like PHASER) to find an initial set of potential solutions for refinement.
(2) An alignment file for use in Rosetta. The alignment file looks like the following:
## 1CRB_ 2qo4.PHASER.1.pdb # hhsearch scores_from_program: 0 1.00 2 DFNGYWKMLSNENFEEYLRALDVNVALRKIANLLKPDKEIVQDGDHMIIRTLSTFRNYIMDFQVGKEFEEDLTGIDDRKCMTTVSWDGDKLQCVQKGEKEGRGWTQWIEGDELHLEMRAEGVTCKQVFKKV 0 AFSGTWQVYAQENYEEFLRAISLPEEVIKLAKDVKPVTEIQQNGSDFTITSKTPGKTVTNSFTIGKEAEIT--TMDGKKLKCIVKLDGGKLVCRTD----RFSHIQEIKAGEMVETLTVGGTTMIRKSKKI --
The first line is '##' followed by: (1) a code for the output file (2) the name of the template PDB If the template is later renamed (by the MR program for example) then this would have to be changed.
The second line identifies the source of the alignment; the third just keep as it is.
The fourth line is the target sequence and the fifth is the template ... the number is an 'offset', identifying where the sequence starts. However, the number doesn't use the PDB resid but just counts residues _starting at 0_. The sixth line is '--'.
For more information on the steps following this script, see steps 2-5 of the tutorial in /demo/electron_density/molecular_replacement.