WHAT IS A
MEDICAL ONTOLOGY?
DRAFT COPY ONLY.
http://www.medparse.com/whatonto.htm

G. William Moore, MD, PhD [1,2,3],

From: Pathology and Laboratory Medicine Service (113), Baltimore VA Maryland Health Care System [1], Baltimore, MD.
Department of Pathology, University of Maryland School of Medicine [2], Baltimore, MD.
Department of Pathology, The Johns Hopkins Medical Institutions [3], Baltimore, MD.



1. MEDICAL ONTOLOGIES.



      1. An ONTOLOGY is a (Platonic) description of essential reality, i.e., what actually is, as opposed to what one can see (observation, accident), or what one can know (epistemiology) (12). The term ontology was coined by two German philosophers, Göckel and Lorhard, in 1613, and first appeared in English in 1721. Quine (13) views ontology as the metaphysical commitments or presuppositions embodied in the different natural sciences. For example, the belief that a cancer can metastasize would be an ONTOLOGICAL COMMITMENT. In the philosophy and practice of science, ontology goes under various names: essence, reality, Mind of God, nature, gold standard, or mathiverse (14). In medical informatics, ontology has come to mean a structured list of concepts, typically prepared by an expert or panel of experts.

      2. With the ease of posting structured lists on the Internet, and with EXTENDED MARKUP LANGUAGE (XML) (38,39) as an emerging standard for such lists, it is likely that the next decade will witness an explosion of public medical ontologies, both amateur and professional.

      3. The importance of ontologies has been recognized by the U. S. Defense Advanced Research Projects Agency (DARPA), the original sponsor of the Internet, which has proposed guidelines for a formal ontology AGENT MARKUP LANGUAGE, that employs the ONTOLOGY INFERENCE LAYER (15,16).

      4. A simple ontology is illustrated by the observation at autopsy that CHRONIC-PASSIVE-CONGESTION-LIVER (CPCL = C0700148-C0721399 in UMLS codes) (40,41). often accompanies HEART HYPERTROPHY (HH = C0795691-C0333959). In an approximate sense, HH causes CPCL (42). Thus, one might expect a hypothetical collection, say, of 10,000 cases to distribute as follows:
              HEART HYPERTROPHY (HH=C0795691-C0333959)
 ____________________________________________
|            |  NOT-HH |     HH   |  Total  |
|___________________________________________|
|  NOT-CPCL  |   7,000 |    1,000 |  8,000  |
|___________________________________________|
|    CPCL    |       0 |    2,000 |  2,000  |
|___________________________________________|
|   Total    |   7,000 |    3,000 | 10,000  |
|___________________________________________|
 CHRONIC-PASSIVE
 -CONGESTION-LIVER
 (CPCL=C0700148-C0721399).


      5. That is, most cases are negative for both features; HH anticipates CPCL in some cases; but there should be only rare cases with CPCL but without HH. Therefore, in the language of first-order propositional logic, CPCL IMPLIES HH, or NOT-CPCL IOR HH.

      6. Such correlations (2x2 CONTINGENCY TABLES), could be edited for redundancy and nonsense correlations (43).

      7. As necessary, a collection of such 2x2 contingency tables could be ORDERED BY IMPORTANCE, based upon the frequencies of cases appearing in the lower right corner of the table.

      8. Medical ontologies can be used to break into de-identified, public databases BY INFERENCE, which are otherwise well-protected. It will become the obligation of the data-holder to anticipate any inferential break-ins, and pre-empt them, by removing vulnerable data (5,44).



2. REFERENCES.



      1. U. S. Code of Federal Regulations. 1995. 45 CFR Subtitle A (10-1-95 Edition), part 46.101 (b) (4).
U. S. Department of Health and Human Services. Office of the Secretary.
The complete Common Rule document (45CFR46), at URL:
http://www.uaf.edu/oar/irb/45cfr46.html
or at URL:
http://ohrp.osophs.dhhs.gov/humansubjects/guidance/45cfr46.htm

      2. U. S. Code of Federal Regulations. 1999. 45 CFR Parts 160 - 164. Standards for Privacy of Individually Identifiable Health Information; Proposed Rule.
Department of Health and Human Services. Office of the Secretary.
Fed Regist. 1999 Nov 3;64(212):59917-59966. http://aspe.hhs.gov/admnsimp/

      3. National Cancer Institute's Confidentiality Brochure, at URL:
http://www-cdp.ims.nci.nih.gov/policy.html

      4. Moore GW, Berman JJ.
Anatomic Pathology Data Mining.
In: Cios KJ, ed. Medical Data Mining and Knowledge Discovery.
2001. XVIII, 502 pp. 98 figs., 98 tabs. Hardcover.
ISBN: 3-7908-1340-0.
Copyright Springer-Verlag: Berlin/Heidelberg 1999.

      5. Sweeney L.
Computational Disclosure Control: A Primer on Data Privacy Protection.
PhD Thesis. Massachusetts Institute of Technology. Spring, 2001. Draft.
http://www.swiss.ai.mit.edu/classes/6.805/articles/privacy/sweeney-thesis-draft.pdf

      6. Sweeney L.
Privacy and medical-records research.
N Engl J Med. 1998 Apr 9;338(15):1077.
PMID: 9537887; UI: 98181820.

      7. Sweeney L.
Guaranteeing anonymity when sharing medical data, the Datafly System.
Proc AMIA Annu Fall Symp. 1997;:51-55.
PMID: 9357587; UI: 98020458.

      8. Gödel K.
Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme. I.
Monatsh Math u Physik. 1931; 38: 173-198.
Translation: Gödel K. On formally undecidable propositions of Principia Mathematica and related systems. New York: Basic Books. 1962.

      9. Nagel E, Newman JR.
Gödel's Proof.
New York: New York University Press. 1958.
ISBN 0-8147-0325-9.

      10. Hofstadter DR.
Gödel, Escher, Bach. An Eternal Golden Braid.
New York: Basic Books. 1979.
ISBN 0-465-02656-7, 777 pages.

      11. Casti JL, DePauli W.
Gödel. A Life of Logic.
Cambridge, MA: Perseus Publishing. 2000.
ISBN 0-7382-0274-6, 210 pages.

      12. Smith B.
Mereotopology: A Theory of Parts and Boundaries.
Data and Knowledge Engineering. 1996;20:287-303.

      13. Quine WVO.
Ontological relative, and other essays.
New York: Columbia University Press. 1969;:.

      14. Stewart I.
Flatterland. Like Flatland. Only More So.
Cambridge, MA: Perseus Publishing. 2001.
ISBN 0-7382-0442-0, 301 pages.

      15. U. S. Defense Advanced Research Projects Agency (DARPA). Agent Markup Language.
http://www.daml.org

      16. U. S. Defense Advanced Research Projects Agency (DARPA). Ontology Inference Layer.
http://www.ontoknowledge.org/oil

      17. Boole G.
An Investigation of the Laws of Thought. On which are founded the Mathematical Theories of Logic and Probabilities.
New York: Dover Publications, Inc. 1954.

      18. Lewis CI, Langford CH.
Symbolic Logic. Second Edition.
New York: Dover Publications, Inc. 1932.

      19. Borkowski L.
Formale Logik.
Muenchen: C. H. Beck.

      20. Zeman J.
Modal Logic: The Lewis Modal Systems.
Oxford at the Clarendon Press. 1972;:.

      21. Haack S.
Deviant Logic, Fuzzy Logic: Beyond the Formalism.
Paperback - 292 pages (November 1996).
Chicago: University of Chicago Press, 1996.
ISBN: 0226311341 ;

      22. Nguyen HT, Walker EA.
A First Course in Fuzzy Logic.
Hardcover, 300 pages, 2 edition, July 1999.
New York: CRC Press.
ISBN: 0849316596.

      23. Quine WVO, Ullian JS.
The Web of Belief. Second Edition.
Paperback 2nd edition. February 1, 1978. McGraw-Hill Higher Education;
ISBN: 0075536099.

      24. Beeson P.
Sutton's Law Paper.
Am J Med. 1960.

      25. Sutton WF, with Linn E.
Where the money was.
Out of print.

      26. Moore GW, Hutchins GM, Bulkley BH.
Certainty levels in the nullity method of symbolic logic: application to the pathogenesis of congenital heart malformations.
J Theor Biol. 1979 Jan 7;76(1):53-81.

      27. Moore GW, Hutchins GM.
Effort and demand logic in medical decision making.
Metamedicine 1980;1:277-304.

      28. Cios KJ, Moore GW. 2000.
Medical Data Mining and Knowledge Discovery: An Overview.
In: Cios KJ, ed. Medical Data Mining and Knowledge Discovery.
2001. XVIII, 502 pp. 98 figs., 98 tabs. Hardcover.
ISBN: 3-7908-1340-0.
Copyright Springer-Verlag: Berlin/Heidelberg 1999.

      29. College of American Pathologists.
Surgical Pathology Case Summaries.
http://www.cap.org

      30. Collaborative Prostate Cancer Tissue Resource.
http://www.prostatehealth.org/cpctr

      31. Suppes P.
Introduction to Logic.
New York: Van Nostrand. 1957.

      32. Bernays P.
Axiomatic Set Theory.
New York: Dover Publications. 1968.

      33. Suppes P.
Axiomatic Set Theory.
New York: Dover Publications. 1972.
ISBN 0486616304.

      34. Davis M.
Computability and Unsolvability.
New York: Dover Publications, Inc. 1958.
ISBN 0-486-61471-9, 248 pages.

      35. Andrews GL.
Number Theory.
New York: Dover Publications, Inc. 1971.
ISBN 0-486-68252-8, 259 pages.

      36. Schneier B.
Applied Cryptography, Second Edition. Protocols, Algorithms, and Source Code in C.
New York: John Wiley & Sons, 1996.

      37. Tarjan RE.
Data Structures and Network Algorithms. CBMS-NSF Regional Conference Series in Applied Mathematics.
Paperback, December, 1983.
New York: Society for Industrial & Applied Mathematics.
ISBN: 0898711878.

      38. Worldwide Web Consortium.
http://www.w3.org

      39. Light R.
Presenting XML.
Sams.net Publishing. 1997.

      40. U.S. National Library of Medicine.
Unified Medical Language System.
http://www.nlm.nih.gov/research/umls/

      41. U. S. National Library of Medicine.
UMLS Knowledge Sources. Twelfth Edition. Unified Medical Language System.
U. S. Department of Health and Human Services. National Institutes of Health. National Library of Medicine. 2001.

      42. Vigorita VJ, Moore GW, Hutchins GM.
Absence of correlation between coronary arterial atherosclerosis and severity or duration of diabetes mellitus of adult onset.
Am J Cardiol. 1980;46:535-542.

      43. Moore GW, Hutchins GM.
Consistency versus completeness in medical decision making: Application to 155 patients autopsied after coronary artery bypass graft surgery.
Proc 6th Annu Symp Comput Appl Med Care. 1982;6:805-811.

      44. Moore GW, Brown LA, Miller RE.
Set Theory Definition and Algorithm for Medical De-Identification.
Arch Pathol Lab Med. 2001;:in press.
http://www.netautopsy.org/apep00st.htm

      45. Johns Hopkins Autopsy Resource.
http://www.netautopsy.org/

      46. Moore GW, Berman JJ, Hanzlick RL, Buchino JJ, Hutchins GM. 1996.
A prototype Internet autopsy database. 1625 consecutive fetal and neonatal autopsy facesheets spanning 20 years.
Arch Pathol Lab Med. 1996;120:782-785.

      47. Hornung J.
Kritik der Signifikanztests.
Metamed 1977;1:325-345.

      48. Moore GW, Hutchins GM.
The persistent importance of autopsies.
Mayo Clin Proc. 2000 Jun;75(6):557-8.

      49. Wilbur WJ.
Overview of Books at NCBI.
http://www.ncbi.nlm.nih.gov:80/books/mboc/bookshelp/bookover.html#link

      50. Shared Pathology Informatics Network.
http://grants.nih.gov/grants/guide/rfa-files/RFA-CA-01-006.html

      51. Nelson SJ, Cole WG, Tuttle MS, Olson NE, Sherertz DD.
Recognizing new medical knowledge computationally.
Proc Annu Symp Comput Appl Med Care. 1993;17:409-413.

      52. Moore GW, Polacsek RA, Erozan YS, de la Monte SM, Miller RE, Hutchins GM, Riede UN.
Multilingual translation techniques in the analysis of narrative medical text.
Comput Methods Programs Biomed. 1986 Mar;22(1):35-42.

      53. Chomsky N.
Aspects of the Theory of Syntax.
Cambridge, MA: The MIT Press. 1965.



Last Updated: September 16, 2001, by G. William Moore, MD, PhD.