Abstract
The explosion of DNA sequence data from genome projects presents many
challenges. For instance, we must extend our current knowledge of protein
structure and function so that it can be applied to these new sequences.
The derivation of rules for the relationships between sequence and structure
allow us to recognize a common fold by the use of tertiary templates. New
techniques enable us to begin to meet the challenge of rule-based modelling
of distantly related proteins. This paper describes an integrated and knowledge-based
approach to the prediction of protein structure and function which can
maximize the value of sequence information.