GROWING INTEREST IN IMAGE CAPTURE IN PATHOLOGY. RELATIVELY LITTLE ATTENTION TOWARD IMAGE RETRIEVAL. LARGE IMAGE ARCHIVES BECOME IMAGE CEMETERIES: BURIED IMAGES. INDEXING BY SINGLE DIAGNOSTIC TERM MAY IMPEDE RETRIEVAL BY ALTERNATE TERMINOLOGY. INFORMATION RETRIEVAL: MOST FUNDAMENTAL PROBLEM FACING ANY ARCHIVIST. NO VALUE IN ARCHIVING UNRETRIEVABLE IMAGES.
PATHOLOGY IMAGES INDEXED USING UMLS UNIFIED MEDICAL LANGUAGE SYSTEM (UMLS). ENCODE IMAGES UNDER ALL PATHOLOGIC CONCEPTS IMAGE LEGEND-TEXTS. IMAGES LOADED INTO JOHNS HOPKINS AUTOPSY RESOURCE IMAGE ARCHIVE (JHAR-IA). www.netautopsy.org CLICK ON: 5000 IMAGES.
6,241 LEGEND-TEXTS FROM ELECTRONIC FASCICLES OF AFIP. NON-COPYRIGHTED 5,465 IMAGES COMPRESSED 1:10 AS JPEG FILES INDEXING SOFTWARE WRITTEN IN: M-LANGUAGE (FORMERLY, MUMPS). DISPLAY SOFTWARE WRITTEN IN:
PRACTICAL EXTRACTION AND REPORTING LANGUAGE (PERL).
UNIFIED MEDICAL LANGUAGE SYSTEM (UMLS) : DEVELOPED BY U.S. NATIONAL LIBRARY OF MEDICINE (USNLM) IN 1986. PURPOSE: AID DEVELOPMENT OF SYSTEMS TO RETRIEVE ELECTRONIC BIOMEDICAL INFORMATION. http://www.nlm.nih.gov/research/umls/ LAST UPDATED: March 19, 1999. SIZE: 96,412,092 BYTES. CONCEPT UNIQUE IDENTIFIERS (CUIs): 625,530, MAX=C0700344. SYNONYMS: 1,362,823. LANGUAGE: PRIMARILY ENGLISH. PARTIAL TRANSLATIONS: GERMAN, FRENCH, SPANISH, ITALIAN, RUSSIAN, DUTCH, PORTUGUESE, HUNGARIAN, FINNISH, SWEDISH, NORWEGIAN, DANISH. OVER 50 SOURCE-VOCABULARIES.
C0001625|ENG|P|L0001625|PF|S0011239|Adrenal Glands| C0001625|ENG|P|L0001625|VC|S0352314|ADRENAL GLANDS| C0001625|ENG|P|L0001625|VC|S0354521|Adrenal glands| C0001625|ENG|P|L0001625|VO|S0354515|Adrenal gland, NOS| C0001625|ENG|P|L0001625|VO|S0799809|Adrenal gland <1>| C0001625|ENG|P|L0001625|VS|S0002402|Adrenal gland| C0001625|ENG|P|L0001625|VS|S0354508|Adrenal Gland| C0001625|ENG|P|L0001625|VS|S0414419|adrenal gland| C0001625|ENG|P|L0001625|VW|S0044868|Glands, Adrenal| C0001625|ENG|P|L0001625|VWS|S0044829|Gland, Adrenal| C0001625|ENG|S|L0579081|PF|S0740979|Suprarenal gland| C0001625|ENG|S|L0847296|PF|S0892764|Glandula suprarenalis|EACH INDIVIDUAL RECORD DELINEATED BY NEWLINE BREAK. WITHIN EACH RECORD, SEVEN FIELDS, VARIABLE IN LENGTH, SEPARATED BY VERTICAL PIPE, |, ASCII 124. FIELD 1: CONCEPT UNIQUE IDENTIFIER, CUI, ( C0001625 ) . FIELD 2: LANGUAGE DESIGNATION: ENG, GER, FRE, etc. FIELD 4: LEXICAL UNIQUE IDENTIFIER, LUI, i.e., L0001625, L0579081, L0847296. FIELD 6: STRING UNIQUE IDENTIFIER, SUI, i.e., S0011239, S0352314, S0354521,.... FIELD 7: TEXT FIELD, MATCHED TO A STRING IN IMAGE-LEGEND-TEXT.
CELLULAR BLUE NEVUS (C0334448). BLUE NEVUS (C0206736). CELL (C0007634). BLUE (C0332584). NEVUS (C0027960).
NATURAL-LANGUAGE MEDICAL TEXT: SEQUENCE OF MEDICAL CONCEPTS SEPARATED BY GRAMMATICAL OBJECTS. GRAMMATICAL OBJECTS, OR BARRIER WORDS: NUMERALS, PUNCTUATION, SINGLE LETTERS, ARTICLES, PREPOSITIONS, COMMON VERBS AND MODIFIERS. MEDICAL CONCEPTS, OR KEYWORDS: ARE ONE-WORD OR MULTIPLE-WORD TERMS CONSISTING OF MEDICALLY SIGNIFICANT WORDS.
LENTIGINOUS COMPOUND NEVUS . this LESION is an EARLY COMPOUND NEVUS , because a NEST has MIGRATED from the EPIDERMIS into the DERMIS ( lower right of c ) . elsewhere , the HISTOLOGY is that of a SIMPLE LENTIGO .barrier words displayed in lower case. KEYWORDS DISPLAYED IN UPPER CASE.
| LEGEND NAME |
UMLS CODE |
UMLS NAME |
|---|---|---|
| LENTIGINOUS | C0023321 | Lentigo, NOS |
| COMPOUND NEVUS | C0259781 | Compound Nevus |
| LESION | C0012634 | Lesion, NOS |
| EARLY | C0205085 | Early |
| COMPOUND NEVUS | C0259781 | Compound Nevus |
| NEST | C0205234 | Focal |
| MIGRATED | C0232902 | Migration, NOS |
| EPIDERMIS | C0014520 | Epidermis, NOS |
| DERMIS | C0011646 | Dermis, NOS |
| LOWER | C0205104 | Inferior |
| RIGHT | C0205090 | Right |
| HISTOLOGY | C0019638 | Histologic |
| SIMPLE LENTIGO | C0302255 | Lentigo Simplex |
SIMILAR ENGLISH WORDS WITH DIFFERENT MEANING, DEPENDING ON CONTEXT. IRIS (C0022077) AS PART OF EYE. IRIS (C0331686) AS FLOWER. IRIS (C0331686) AS FLOWER IS RETIRED. ADNEXA WITHOUT NEARBY DISAMBIGUATING WORD: SKIN ADNEXA (C0221943) UTERINE ADNEXA (C0001575) OCULAR ADNEXA (C0229243)
5,465 SEPARATE IMAGE-LEGEND-TEXTS WERE ASSIGNED UMLS-CODES. EVERY IMAGE ASSIGNED THE UMLS CODES FOR PHOTOGRAPHY (C0441468) AND PATHOLOGY (C0030664). PAPILLARY (C0205312): 166 IMAGE-LEGENDS. (THYROID NEOPLASM, BREAST NEOPLASM, URINARY TRACT NEOPLASM, PAPILLARY FEATURE). IRREGULAR (C0205271): 136 IMAGE-LEGENDS. ( GENERAL CONCEPT, NUCLEAR FEATURE, TUMOR BOUNDARY, CELLULAR DISTRIBUTION....) NOT VERY SPECIFIC AS INDEXING TERMS. MODIFIERS OCCUR AT HIGHER FREQUENCY THAN DIAGNOSES. SPECIFIC TERMS: LOW FREQUENCY. MALIGNANT MENINGIOMA (C0259785): NINE IMAGE-LEGENDS. ROSAI DORFMAN DISEASE (C0019625): EIGHT IMAGE-LEGENDS. DERMOID CYST (C0011649): EIGHT IMAGE-LEGENDS. CHONDROBLASTOMA (C0008441): EIGHT IMAGE-LEGENDS.
IMAGES ARE CONSTITUTIVELY NON-HIERARCHICAL. EXAMPLE: MEDULLARY CARCINOMA OF THYROID (C0238462) JUSTIFIABLY FILED UNDER: THYROID GLAND (C0040132), ORGAN OF ORIGIN. TUMOR (C0027651) TUMOR, MALIGNANT (C0006826) C CELLS (C0229579), FROM WHICH IT ARISES. MULTIPLE ENDOCRINE NEOPLASIA TYPE I SYNDROME (C0025267). MUST BE DISTINGUISHED FROM: MEDULLARY CARCINOMA OF BREAST (C0206693). ALPHABETIC SIMILARITY NO PATHOGENETIC RELATIONSHIP
NO HIERARCHICAL WAY OF ORGANIZING IMAGES. IMAGE RETRIEVAL MECHANISM BY PATHOLOGIC CONCEPTS. IMAGE RETRIEVAL SYSTEM USING ALL PATHOLOGY CONCEPTS WITHIN THE IMAGE IS ACHIEVABLE. IMAGES AUTOMATICALLY UMLS-ENCODED FROM PRE-EXISTING TEXT DESCRIBING THE IMAGES. AUTOMATIC UMLS ENCODING CAN USE THE ENTIRE UMLS NOMENCLATURE, OVER 700,000 DISTINCT CONCEPTS. IMAGES STORED IN NON-HIERARCHICAL, NON-ORDERED FASHION. ENCAPSULATION OF UMLS TERMS WITH IMAGES PERMITS LARGE MERGED IMAGE ARCHIVES. IMAGE RETRIEVAL VIA UMLS-ENCODED INDEX MAY SUCCEED, EVEN WHEN A CHOSEN QUERY TERM NOT INCLUDED IN IMAGE-LEGEND.
| RANK | FREQUENCY | UMLS CODE | UMLS NAME |
|---|---|---|---|
| 1 | 5465 | C0030664 | PATHOLOGY |
| 2 | 5465 | C0441468 | PHOTOGRAPH |
| 3 | 2016 | C0007634 | CELL |
| 4 | 1812 | C0441469 | PICTURE |
| 5 | 1140 | C0027651 | NEOPLASM |
| 6 | 1102 | C0024109 | LUNG |
| 7 | 644 | C0012634 | DISEASE |
| 8 | 617 | C0030705 | PATIENT |
| 9 | 581 | C0205165 | SMALL |
| 10 | 569 | C0205234 | FOCAL |
| 11 | 549 | C0205164 | LARGE |
| 12 | 528 | C0233426 | APPEAR |
| 13 | 522 | C0006141 | BREAST |
| 14 | 487 | C0205397 | OBSERVE |
| 15 | 466 | C0038128 | STAIN |
| 16 | 458 | C0150312 | PRESENT |
| 17 | 421 | C0015392 | EYE |
| 18 | 413 | C0445247 | SAME |
| 19 | 408 | C0010834 | CYTOPLASM |
| 20 | 407 | C0205392 | SOME |
| 21 | 401 | C0205182 | ATYPICAL |
| 22 | 387 | C0022646 | KIDNEY |
| 23 | 375 | C0205402 | PROMINENT |
| 24 | 365 | C0449774 | PATTERN |
| 25 | 347 | C0205091 | LEFT |
| 26 | 341 | C0040132 | THYROID |
| 27 | 336 | C0205160 | NEGATIVE |
| 28 | 324 | C0205090 | RIGHT |
| 29 | 323 | C0042149 | UTERUS |
| 30 | 320 | C0449470 | TYPE |
| 31 | 316 | C0007097 | CANCER |
| 32 | 300 | C0205172 | MANY |
| 33 | 300 | C0370003 | SPECIMEN |
| 34 | 294 | C0014609 | EPITHELIUM |
| 35 | 292 | C0262950 | BONE |
| 36 | 285 | C0332285 | ARISING FROM |
| 37 | 285 | C0444186 | SMEAR |
| 38 | 279 | C0005953 | BONE MARROW |
| 39 | 275 | C0017542 | GIEMSA STAIN |
| 40 | 272 | C0018964 | HEMATOXYLIN |
| 41 | 270 | C0205428 | AFFECTING |
| 42 | 269 | C0007874 | CERVIX |
| 43 | 268 | C0431085 | TUMOR CELLS |
| 44 | 262 | C0042591 | VESSEL |
| 45 | 258 | C0014448 | EOSIN |
| 46 | 253 | C0205250 | ELEVATED |
| 47 | 249 | C0024264 | LYMPHOCYTE |
| 48 | 247 | C0205308 | OLD |
| 49 | 236 | C0439508 | YEAR |
| 50 | 234 | C0392746 | WELL |