This paper presents a formal linguistic approach to the representation of generic chemical formulae in chemical patents, within the context of use of the ALWIN line-formula notation. The objective of the representation is to permit searches for specific structures and for substructures which are included within the generic expression. The relevance of pattern analysis methods to this problem is highlighted, and preliminary suggestions on algorithm development are put forward.
Get full access to this article
View all access options for this article.
References
1.
M.F. Lynch, J.M. Harrison, W.G. Town and J.E. Ash, Computer Handling of Chemical Structure Information ( Macdonald , London, 1971).
2.
J.E. Ash and E. Hyde, Chemical Information Systems (Ellis Horwood, Chichester, 1975).
3.
E.V. Krishnamurthy , P.V. Sankar and S. Krishnan, ALWIN - Algorithmic Wiswesser motation system for organic compounds, J. Chem. Doc.14 (1974) 130-149.
4.
S. Krishnan and E.V. Krishnamurthy, Compact grammar for algorithmic Wiswesser notation using Morgan name, Inf. Proc. Man.12 (1976) 19-34.
5.
W.M. Duffin , Chemical coding for information retrieval, J. Chem. Doc.1 (1961) 44-46.
6.
C.H. Davis and J.E. Rush, Information Retrieval and Documentation in Chemistry ( Greenwood Press, London, 1974).
7.
A.C. Fleck , Recent developments in the theory of data structures , Comput. Languages3 (1978) 37-52.
8.
A. Rosenfeld and A.C. Kak, Digital Picture Processing ( New York, Academic Press, 1976).
9.
K.S. Fu , Syntactic Pattern Recognition - Applications (Springer , New York, 1977).
10.
D.J. Rosenkrantz , Programmed grammars and classes of formal languages , J. Assoc. Comput. Mach.16 (1969) 107-131.
11.
A.V. Aho and J.D. Ullman, The Theory of Parsing, Translating and Compiling, 2 Vols. ( Prentice-Hall, Englewood Cliffs, NJ, 1972).
12.
A. Craselli , Automatic Interpretation and Classification of Images ( Academic Press, New York, 1969).
13.
F. Nake, and A. Rosenfeld, Graphic Languages (North-Holland, London, 1972).
14.
W.D. Maurer , The Programmer's Introduction to SNOBOL (Elsevier, New York, 1976).
15.
G.W. Adamson , J. Cowell, M.F. Lynch, A.H.W. McLure, W.G. Town and A.M. Yapp, Strategic considerations in the design of a screening system for substructure searches of chemical structure files. J. Chem. Doc.13 (1973) 153-157.
16.
N.A. Farmer and M.P. O'Hara, CAS ONLINE, A new source of substance information from Chemical Abstracts Service, Database (1980) 10-25.
17.
H.H. Bonczek and A.B. Whinston, Picture processing and automatic data-base design, Comput. Graphics and Image Processing5 (1976) 484-495.
18.
D.A. Waterman and F.H. Roth, Pattern-directed Inference Systems (Academic Press, New York, 1979).
19.
D.A. Watt , The parsing problem for affix grammars, Acta Informatica8 (1977 ) 1-20.
20.
O.L. Madsen and B.B. Kristensen, LR parsing of extended context-free grammars, Acta Informatica7 (1976) 61-73.