This is an archive copy of the IUCr web site dating from 2008. For current content please visit https://www.iucr.org.
[IUCr Home Page] [CIF Home Page] [mmCIF Home Page]

Re: STRUCT_REF_SEQ_DIF

Paula Fitzgerald (paula_fitzgerald@Merck.Com)
Mon, 11 Mar 96 21:14:13 EST


Herb Bernstein writes:

> STRUCT_REF_SEQ_DIF could use the token _struct_ref_seq_dif.db_seq_num
> to specify the sequence position at which a point difference occurs
> in terms of the sequence of the referenced database.  While this can
> be inferred by very careful use of _struct_ref_seq.db_align... tokens,
> that is clumsy and error-prone when there are insertions or deletions.

What Herb writes here raises an issue of schema.  When I wrote the STRUCT_REF
data items, my thinking was that the sequence in the data block was intended
to match the referenced sequence in the database in phase - that is, that
we would allow for point differences (ala vs. ile, for instance) but that if
the match involved insertions and deletions, they would have to be handled
via careful use of the alignment beg and end data items.

Now that Herb raises this issue, I realize that we could try and be more
sophisticated about this, but my personal feeling is that this could easily
get out of hand if we tried to do something that would cope with (for 
instance) a 42-residue insertion.  So I am inclined to leave the data itmes
they way they are, but to adjust the definitions for the categories to say
explicitly that insertions and deletions should be handled with appropriate
use of the beg and end of the sequence matched.  But I'll wait for some
comment before I do that.

Paula

********************************************************************************
 Dr. Paula M. D. Fitzgerald  ______________ voice and FAX: (908) 594-5510
   Merck Research Laboratories ______________ email: paula_fitzgerald@merck.com
     P.O. Box 2000, Ry50-105     ______________ or bean@merck.com           
       Rahway, NJ 07065  USA 
         (for express mail use 126 E. Lincoln Ave. instead of P. O. Box 2000)  
********************************************************************************