Japanese Language Annotation
of an Internet
Pathology Image Archive.

Daisuke Nonaka, M.D. [1]
G. William Moore, MD, PhD [1,2,3]
Yoichi Satomura, M.D. [4]



      From: Department of Pathology, University of Maryland School of Medicine, Baltimore, Maryland [1]. Pathology and Laboratory Medicine Service, Veterans Affairs Maryland Health Care System, Baltimore, Maryland [2]. Department of Pathology, The Johns Hopkins Medical Institutions, Baltimore, Maryland [3]. and Department of Medical Informatics, Chiba University School of Medicine, Chiba, Japan [4].

TABLE OF CONTENTS.


1. ABSTRACT.
2. INTRODUCTION.
3. ORGANIZATION OF WRITTEN JAPANESE.
4. MANAGING AMBIGUOUS JAPANESE KANJI.
5. NEW WORD FORMATION FOR JAPANESE WORDS.
6. MATERIALS.
7. UNIFIED MEDICAL LANGUAGE SYSTEM.
8. BARRIER WORD METHOD.
9. BARRIER WORD METHOD: JAPANESE TRANSLATION.
10. SAMPLE QUERY: ENTER JAPANESE ROMAJI.
11. SAMPLE QUERY: SELECT ENGLISH TRANSLATION.
12. SAMPLE QUERY: SELECT UMLS TERM.
13. SAMPLE QUERY: SELECT AFIP LEGEND TITLE.
14. SAMPLE QUERY: VIEW JAPANESE ANNOTATIONS.
15. RESULTS.
16. CONCLUSION.
17. REFERENCES.
18. ZIPF DISTRIBUTION: JAPANESE TRANSLATION.


1. ABSTRACT.


NEXT PAGE.
RETURN TO TABLE OF CONTENTS.

      Background: Anatomic pathology images in a large archive must be recoverable both by pathologic diagnosis and by descriptive content. The Image Archive of The Johns Hopkins Autopsy Resource website (JHAR-IA), at URL:
http://www.netautopsy.org
consists of over five thousand uncopyrighted anatomic pathology images from the Armed Forces Institute of Pathology Electronic Fascicles (AFIP-EF). The images have been computer-indexed in the Unified Medical Language System (UMLS), based upon corresponding English-language legend-texts. For Japanese speakers who use English as a second language, it is helpful to annotate this text in Japanese, so that images may be recalled by Japanese keywords. Japanese is a particularly challenging language for Internet annotation, since text must be displayed in any of three alphabets (Katakana, Hiragana, Kanji), and the Kanji system is ambiguous and non-phonetic.

      Design: All words and UMLS concepts in the pathology image legend-texts of the AFIP-EF posted on the JHAR-IA were pointed to phonetic Japanese transliterations in Katakana. Some words and concepts were pointed to Hiragana words or to Kanji ideograms, displayed using the Shift Japan Industrial Standard (SJIS) font, available on most computers marketed in Japan. Indexing software was written in M-language (formerly, MUMPS), and display software was written in the Practical Extraction and Reporting Language (PERL). Both software systems employed a unique English name to display each Kanji ideogram, as well as phonetic On-readings and Kun-readings, specified by Japanese Government Ministry of Education publications.

      Results: There were 5,465 pathology images posted on the JHAR-IA, consisting of 5,364 distinct words and 3,016 distinct UMLS concepts, ranging in frequency from 5,465 occurrences of four UMLS terms to one occurrence apiece of 875 UMLS terms. The Japanese annotations included 632 Kanji terms, each assigned a unique English name.

      Conclusion: English is the dominant language of the Internet, but non-native English speakers may need assistance in locating images based upon non-English keywords. The Johns Hopkins Autopsy Resource Image Archive website may be queried on the Internet with either English or Japanese query-words, and bilingual annotations.


2. INTRODUCTION.


NEXT PAGE.
PREVIOUS PAGE.
RETURN TO TABLE OF CONTENTS.


  • ANATOMIC PATHOLOGY IMAGES RECOVERABLE BY DIAGNOSIS AND DESCRIPTIVE CONTENT.

  • IMAGE ARCHIVE OF JOHNS HOPKINS AUTOPSY RESOURCE WEBSITE (JHAR-IA):
    http://www.netautopsy.org


  • OVER FIVE THOUSAND UNCOPYRIGHTED ANATOMIC PATHOLOGY IMAGES FROM ARMED FORCES INSTITUTE OF PATHOLOGY ELECTRONIC FASCICLES (AFIP-EF).

  • IMAGES COMPUTER-INDEXED IN UNIFIED MEDICAL LANGUAGE SYSTEM (UMLS).

  • GOAL: TO ANNOTATE TEXT IN JAPANESE



  • 3. ORGANIZATION OF WRITTEN JAPANESE.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.
    FOUR JAPANESE ALPHABETS.


  • ROMAJI: ROMAN ALPHABET, 46 BASIC KANA SYLLABLES.
         a    i    u    e    o
        ka   ki   ku   ke   ko
        sa  shi   su   se   so
        ta  chi  tsu   te   to
        na   ni   nu   ne   no   n
        ha   hi   fu   he   ho
        ma   mi   mu   me   mo
        ya        yu        yo
        ra   ri   ru   re   ro
        wa                  wo
      


  • KANJI: MEDICAL KEYWORDS.


  • INCLUDES MOST BODYSITE, MORPHOLOGY, AND DISEASE NAMES.

  • NON-PHONETIC.

  • PRONUNCIATION GUIDE FROM JAPANESE MINISTRY OF EDUCATION.

  • BORROWED FROM CHINESE (TANG DYNASTY, 8TH CENTURY).

  • ABOUT 1800 COMMON KANJI.


  • HIRAGANA: PHONETIC SYLLABARY FOR GRAMMATICAL PARTICLES.


  • 46 HIRAGANA, ADDITIONAL DIACRITICAL MARKS.

  •    a ぁ    i い     u う      e え     o お ....
       


  • KATAKANA: PARALLEL KANA SYLLABARY FOR FOREIGN LOAN-WORDS.


  • KATAKANA: PHONETIC SYLLABARY FOR FOREIGN LOAN-WORDS.


  • 46 KATAKANA, ADDITIONAL DIACRITICAL MARKS.

  •    a ア    i ィ    u ゥ    e ェ    o ォ ....
       


  • HIRAGANA: PARALLEL SYLLABARY FOR GRAMMATICAL PARTICLES.



  • 4. MANAGING AMBIGUOUS JAPANESE KANJI.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • UNIQUE ENGLISH NAME FOR EACH KANJI, REMEMBERED EASILY BY A NATIVE ENGLISH SPEAKER.

  • EXAMPLE: COW.


  • 139:141 牛 8B8D cattle, cow, bull, ox, RESTAURANT-COW
    (CHINESE RESTAURANT MENU)

  • 137:078 丑 8978 cattle, cow, bull, ox, CHINESE-ZODIAC-COW
    (YEAR OF THE OX: 1937, 1949, 1961, 1973, 1985, 1997, 2009....)


  • ON-READING, KUN READING:


  • 147:140 東 938C O=to K=higashi E=east, TOKYO-ONE

  • 139:158 京 8B9E O=kyoh E=capital, metropolis, TOKYO-TWO



  • 5. NEW WORD FORMATION FOR JAPANESE WORDS.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • SAY IT IN ENGLISH WITH A JAPANESE ACCENT.

  • EXAMPLE: COMPUTER => KONNPUTA コ ン プ タ

  • TRANSLITERATE INTO KATAKANA.



  • 6. MATERIALS.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • 6,241 LEGEND-TEXTS FROM ELECTRONIC FASCICLES OF AFIP.

  • NON-COPYRIGHTED 5,465 IMAGES COMPRESSED 1:10 AS JPEG FILES

  • IMAGES LOADED INTO THE INTERNET AUTOPSY DATABASE IMAGE ARCHIVE.

  • www.netautopsy.org



  • 7. UNIFIED MEDICAL LANGUAGE SYSTEM (UMLS).


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • UNIFIED MEDICAL LANGUAGE SYSTEM (UMLS) : DEVELOPED BY U.S. NATIONAL LIBRARY OF MEDICINE (USNLM) IN 1986.

  • PURPOSE: AID DEVELOPMENT OF SYSTEMS TO RETRIEVE ELECTRONIC BIOMEDICAL INFORMATION.

  • URL: http://www.nlm.nih.gov/research/umls/

  • LAST UPDATED: March 19, 1999.

  • SIZE: 96,412,092 BYTES.

  • CONCEPT UNIQUE IDENTIFIERS (CUIs): 625,530, MAX=C0700344.

  • SYNONYMS: 1,362,823.

  • LANGUAGE: PRIMARILY ENGLISH.

  • PARTIAL TRANSLATIONS: GERMAN, FRENCH, SPANISH, ITALIAN, RUSSIAN, DUTCH, PORTUGUESE, HUNGARIAN, FINNISH, SWEDISH, NORWEGIAN, DANISH. NO JAPANESE.

  • OVER 50 SOURCE-VOCABULARIES.



  • 8. BARRIER WORD METHOD.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • NATURAL-LANGUAGE MEDICAL TEXT: SEQUENCE OF MEDICAL CONCEPTS SEPARATED BY GRAMMATICAL OBJECTS.

  • THE GRAMMATICAL OBJECTS, OR BARRIER WORDS: NUMERALS, PUNCTUATION, SINGLE LETTERS, ARTICLES, PREPOSITIONS, AND COMMON VERBS AND MODIFIERS.

  • MEDICAL CONCEPTS, OR KEYWORDS: ARE ONE-WORD OR MULTIPLE-WORD TERMS, CONSISTING OF MEDICALLY SIGNIFICANT WORDS.



  • 9. BARRIER WORD METHOD:
    JAPANESE TRANSLATION.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


    BARRIER WORD METHOD: SAMPLE TEXT.
    LENTIGINOUS COMPOUND NEVUS . this LESION is an EARLY COMPOUND NEVUS , because a NEST has MIGRATED from the EPIDERMIS into the DERMIS ( lower right of c ) . elsewhere , the HISTOLOGY is that of a SIMPLE LENTIGO .


  • barrier words displayed in lower case.

  • KEYWORDS DISPLAYED IN UPPER CASE.



  • LEGEND
    NAME
    UMLS
    CODE
    UMLS
    NAME
    JAPANESE
    LENTIGINOUS C0023321 Lentigo
    COMPOUND NEVUS C0259781 Compound Nevus
    LESION C0012634 LESION 病 気
    EARLY C0205085 Early
    COMPOUND NEVUS C0259781 Compound Nevus
    NEST C0205234 FOCAL 局 所
    MIGRATED C0232902 Migration
    EPIDERMIS C0014520 Epidermis
    DERMIS C0011646 Dermis
    LOWER C0205104 Inferior
    RIGHT C0205090 RIGHT
    HISTOLOGY C0019638 Histologic
    SIMPLE LENTIGO C0302255 Lentigo Simplex






    10. SAMPLE QUERY:
    ENTER JAPANESE ROMAJI.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.
    Click on SUBMIT: The search engine will return a clickable listing of relevant images.

    English. English, Japanese annotations (SJIS)
    English Search Word. Search Word in Japanese.



    11. SAMPLE QUERY:
    SELECT ENGLISH TRANSLATION.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.
    -------------------------------------------------------------------
    Search Requested at: Sun Oct 10 12:28:28 1999, Greenwich Mean Time.
    Search String Requested: HAI haii は い
    -------------------------------------------------------------------
    To begin a search, make a selection, then click on SUBMIT:

    BACK は い ( 飼 )
    GLUTARALDEHYDE ご る た ら る で は い ど
    HYBRID は い ぼ り ど
    HYBRIDS は い ぼ り ど
    HYDATIDIFORM は い だ ち ぢ ほ る む
    LUNG は い ( 肺 )
    LUNGS は い ( 肺 )



    12. SAMPLE QUERY:
    SELECT UMLS TERM.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.
    -------------------------------------------------------------------
    Search Requested at: Sun Oct 10 12:31:31 1999, Greenwich Mean Time.
    Search String Requested: LUNG は い ( 肺 ) HAI haii は い
    -------------------------------------------------------------------
    Please select the desired UMLS CONCEPT, and click on the SUBMIT button:

    1 C0004144 COLLAPSE LUNG
    C0004144 コルラプス 肺
    2 C0007121 LUNG CANCER BRONCHOGENIC CARCINOMA
    C0007121 肺 癌 ブロンコゼニコ カルシノマ
    3 C0024109 LUNG
    C0024109 肺
    4 C0024115 DISEASE LUNG
    C0024115 ヂジズ 肺
    5 C0024121 LUNG NEOPLASM
    C0024121 肺 ネョプラズム
    6 C0034063 EDEMA LUNG
    C0034063 ェデマ 肺
    7 C0034079 LUNG NODULE
    C0034079 肺 ノヅル
    8 C0149726 LUNG MASS
    C0149726 肺 マッス
    9 C0149782 EPIDERMOID CARCINOMA OF LUNG
    C0149782 ェピデルモィド カルシノマ の 肺
    10 C0149925 CARCINOMA SMALL CELL LUNG
    C0149925 カルシノマ 小 セル 肺
    11 C0152013 ADENOCARCINOMA LUNG
    C0152013 アデノカルシノマ 肺
    12 C0175632 LUNG
    C0175632 肺
    13 C0189381 LAVAGE LUNG
    C0189381 ラバジ 肺
    14 C0206062 DISEASE INTERSTITIAL LUNG
    C0206062 ヂジズ ィンテルスチチャル 肺
    15 C0220651 CANCER METASTATIC TO LUNG
    C0220651 癌 メタスタチコ え 肺
    16 C0229919 LYMPHATICS OF LUNG
    C0229919 リンハチコ の 肺
    17 C0235896 INFILTRATION LUNG
    C0235896 ィンヒルトラション 肺
    18 C0238398 EOSINOPHILIC GRANULOMA OF LUNG
    C0238398 ェ ゴラヌロマ の 肺
    19 C0242379 CA LUNG CANCER
    C0242379 癌 肺 癌
    20 C0242488 ACUTE LUNG INJURIES
    C0242488 アクト 肺 ィンジュリ
    21 C0345167 LUNG CYST
    C0345167 肺 シスト
    22 C0345958 LARGE CELL CARCINOMA OF LUNG
    C0345958 大 セル カルシノマ の 肺






    13. SAMPLE QUERY:
    SELECT AFIP LEGEND TITLE.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.
    -------------------------------------------------------------------
    Search Requested at: Sun Oct 10 12:34:34 1999, Greenwich Mean Time.
    Search String Requested:C0024109 LUNG は い ( 肺 ) HAI haii はい
    -------------------------------------------------------------------
    Please select the desired LEGEND TITLES, and click on the SUBMIT button:

    1 ###444 METASTATIC SMALL CELL CARCINOMA (LUNG).
    メタスタチコ 小 セル カルシノマ ( 肺 ) .
    2 ###758 ADENOID CYSTIC CARCINOMA.
    アデノィド ぼこの カルシノマ .
    3 ###1328 MENINGIOMA METASTATIC TO LUNG.
    メニンジョマ メタスタチコ え 肺 .
    4 ###1753 METASTATIC CARCINOMA FROM LUNG.
    メタスタチコ カルシノマ から 肺 .
    5 ###1758 METASTATIC CARCINOMA FROM LUNG.
    メタスタチコ カルシノマ から 肺 .
    6 ###1759 METASTATIC CARCINOMA FROM LUNG.
    メタスタチコ カルシノマ から 肺 .
    7 ###2046 METASTATIC SQUAMOUS CELL CARCINOMA.
    メタスタチコ スクェモス セル カルシノマ .
    8 ###2779 UNDIFFERENTIATED CARCINOMA (SMALL CELL NEUROENDOCRINE TYPE).
    ゥヘレ カルシノマ ( 小 セル ノロェンドコリン 個 ) .
    9 ###3919 METASTATIC TUMOR CELLS ON A BLOOD SMEAR.
    メタスタチコ ッモル セル のに A blood SMEAR .
    10 ###3920 METASTATIC TUMOR CELLS ON A BLOOD SMEAR.
    メタスタチコ ッモル セル のに A blood SMEAR .
    11 ###4030 THYROID PAPILLARY CARCINOMA METASTATIC TO LUNG.
    ハィロィド パピルレリ カルシノマ メタスタチコ え 肺 .
    12 ###4066 POORLY DIFFERENTIATED (INSULAR) CARCINOMA.
    ポリ ヂフヘレンチェテド ( ィンスラル ) カルシノマ .
    13 ###4089 METASTATIC UNDIFFERENTIATED THYROID CARCINOMA.
    メタスタチコ ゥヘレ ハィロィド カルシノマ .
    14 ###4091 SEQUENTIAL MORPHOLOGIC VARIATIONS OF THYROID CARCINOMA WITH EVENTUAL TRANSFORMATION TO UNDIFFERENTIATED CARCINOMA.
    セクェンシャル モホロジコ バリ の ハィロィド カルシノマ つきの ェベ トラホ え ゥヘレ カルシノマ .
    15 ###4388 EARLY PSEUDOGLANDULAR PHASE OF LUNG DEVELOPMENT.
    はゃい ドヅラル そ の 肺 デベロ .
    16 ###4389 EARLY PSEUDOGLANDULAR PHASE OF LUNG DEVELOPMENT.
    はゃい ドヅラル そ の 肺 デベロ .
    17 ###4390 EARLY PSEUDOGLANDULAR PHASE OF LUNG DEVELOPMENT.
    はゃい ドヅラル そ の 肺 デベロ .
    18 ###4391 EXTRALOBAR SEQUESTRATION.
    ェコストラロバル セクェストレション .
    19 ###4392 EXTRALOBAR SEQUESTRATION.
    ェコストラロバル セクェストレション .
    20 ###4393 INTRALOBAR SEQUESTRATION.
    ィントラロバル セクェストレション .
    21 ###4394 INTRALOBAR SEQUESTRATION.
    ィントラロバル セクェストレション .
    22 ###4395 INTRALOBAR SEQUESTRATION.
    ィントラロバル セクェストレション .
    23 ###4396 INTRALOBAR SEQUESTRATION.
    ィントラロバル セクェストレション .
    24 ###4397 BRONCHOGENIC CYSTS.
    ブロンコゼニコ シスト .
    25 ###4398 BRONCHOGENIC CYSTS.
    ブロンコゼニコ シスト .
    26 ###4399 INTRATHORACIC BRONCHOGENIC CYST.
    ィントラ ブロンコゼニコ シスト .
    27 ###4400 INTRATHORACIC BRONCHOGENIC CYST.
    ィントラ ブロンコゼニコ シスト .
    28 ###4401 BRONCHOGENIC CYST.
    ブロンコゼニコ シスト .
    29 ###4402 BRONCHOGENIC CYST.
    ブロンコゼニコ シスト .
    30 ###4403 BRONCHOGENIC CYST.
    ブロンコゼニコ シスト .








    14. SAMPLE QUERY:
    VIEW JAPANESE ANNOTATIONS.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.
    -------------------------------------------------------------------
    Search Requested at: Sun Oct 10 12:37:37 1999, Greenwich Mean Time.
    Search String Requested: ###000444 C0024109 LUNG は い ( 肺 ) HAI haii は い
    -------------------------------------------------------------------
    ###444
    METASTATIC SMALL CELL CARCINOMA (LUNG).
    メタスタチコ 小 セル カルシノマ ( 肺 ) .
    This dermis is infiltrated by irregular islands and cords
    と デルミス IS ィンヒルトラテド に ィ ィ と コルド
    of small malignant cells with scant cytoplasm
    の 小 マリ セル つきの ソカント サィトポラズム
    and an associated desmoplastic stromal response.
    と AN ア デスモプラスチコ ストロマル レセ .
    This histology raises a differential that includes:
    と ヒ ラ A ヂフヘレンチャル THAT ィンコルデス :
    primary neuroendocrine carcinoma, metatypical basal cell carcinoma,
    プリマリ ノロェンドコリン カルシノマ , メタチピカル バサル セル カルシノマ ,
    sclerosing lymphoma, and small cell adnexal carcinoma.
    スコレロシンゴ リンホマ , と 小 セル アドネコサル カルシノマ .



    15. RESULTS.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • UMLS-CODES ASSIGNED TO 5,465 AFIP IMAGE-LEGEND TEXTS.

  • 5,364 DISTINCT WORDS.

  • 3,016 DISTINCT UMLS CONCEPTS.

  • 5,465 OCCURRENCES OF TWO UMLS CONCEPTS.

  • ONE OCCURRENCE APIECE OF 875 UMLS CONCEPTS.

  • OTHER UMLS CONCEPTS ASSIGNED TO MULTIPLE IMAGE-LEGENDS.

  • JAPANESE LEXICON INCLUDES 632 KANJI TERMS.



  • 16. CONCLUSION.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.


  • ENGLISH IS DOMINANT LANGUAGE OF THE INTERNET.

  • NON-NATIVE ENGLISH SPEAKERS MAY NEED ASSISTANCE.

  • IMAGE ARCHIVE WEBSITE WITH ENGLISH OR JAPANESE QUERY-WORDS.

  • BILINGUAL ANNOTATIONS.



  • 17. REFERENCES.


    NEXT PAGE.
    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.

  • 1. UMLS Knowledge Sources. 9th edition. 1998. DOCUMENTATION. National Institutes of Health. National Library of Medicine. Bethesda, Maryland 20854.

  • 2. College of American Pathologists. Systematized Nomenclature of Human and Veterinary Medicine (SNOMED International). College of American Pathologists, Northfield, IL, 1993.

  • 3. Berman JJ, Moore GW.
    SNOMED-encoded surgical pathology databases: A tool for epidemiologic investigation.
    Mod Pathol. 1996 Sep;9(9):944-950.

  • 4. Silverberg SG.
    SNOMED-encoded surgical pathology databases: 's no big deal - or is it?
    Mod Pathol. 1996 Sep;9(9):953-954.

  • 5. Moore GW, Berman JJ.
    Automatic SNOMED coding.
    Proc Annu Symp Comput Appl Med Care. 1994;18:225-229.

  • 6. Moore GW, Berman JJ.
    Performance analysis of manual and automated systematized nomenclature of medicine (SNOMED) coding.
    Am J Clin Pathol. 1994 Mar;101(3):253-256.

  • 7. Berman JJ, Moore GW.
    Object-oriented controlled-vocabulary translator using TRANSOFT + HyperPAD.
    Proc Annu Symp Comput Appl Med Care. 1991;15:973-975.

  • 8. Berman JJ, Moore GW, Donnelly WH, Massey JK, Craig B.
    A SNOMED analysis of three years accessioned cases (40,124) of a surgical pathology department: implications for pathology-based demographic studies.
    Proc Annu Symp Comput Appl Med Care. 1994;18:188-192.

  • 9. Moore GW, Berman JJ, Hanzlick RL, Buchino JJ, Hutchins GM.
    A prototype Internet autopsy database. 1625 consecutive fetal and neonatal autopsy facesheets spanning 20 years.
    Arch Pathol Lab Med. 1996 Aug;120(8):782-785.

  • 10. Berman JJ, Moore GW, Hutchins GM.
    Internet autopsy database.
    Hum Pathol. 1997 Apr;28(4):393-394.

  • 11. Moore GW, Miller RE, Hutchins GM. Indexing by MeSH titles of natural language pathology phrases identified on first encounter using the barrier word method. In: Scherrer JR, Côté RA, Mandil SH, eds. Computerized Natural Medical Language Processing for Knowledge Representation. Amsterdam: North-Holland; pp 29-39, 1989.

  • 12. Murphy GF, Elder DA. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Non-Melanocytic Tumors of the Skin, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 13. Elder DA, Murphy GF. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Melanocytic Tumors of the Skin, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 14. Murphy WM, Beckwith JB, Farrow GM. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Kidney, Bladder and Related Urinary Structures, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 15. Rosai J, Carcangiu ML, DeLellis RA. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Thyroid Gland, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 16. DeLellis RA. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Parathyroid Gland, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 17. Kurman RJ, Norris HJ, Wilkinson EJ. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Cervix, Vagina, and Vulva, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 18. Silverberg SG, Kurman RJ. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Uterine Corpus and Gestational Trophoblastic Disease, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 19. Rosen PP, Oberman HA. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Mammary Gland, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 20. Burger PC, Scheithauer BW. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Central Nervous System, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 21. McLean EW, Burnier MN, Zimmerman LE, Jakobiec FA. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Eye and Ocular Adnexa, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 22. Colby TV, Koss MN, Travis WD. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Lower Respiratory Tract, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 23. Brunning RD, McKenna RW. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Bone Marrow, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 24. Fechner RE, Mills SE. Armed Forces Institute of Pathology Atlas of Tumor Pathology. Tumors of the Bones and Joints, Electronic Fascicle version 2.0. Washington, D.C. Armed Forces Institute of Pathology.

  • 25. Hadamitzky W, Spahn M. Kanji & Kana. Revised Edition. A Handbook of the Japanese Writing System. Rutland,Vermont: Charles E. Tuttle Co, 1997.

  • 26. Kodansha's Pocket Kanji Guide. Tokyo: Kodansha International, 1994.

  • 27. O'Neill PG. Essential Kanji. 2,000 Basic Japanese Characters, Systematically arranged for learning and reference. New York: Weatherhill, 1973.

  • 28. Table of ON and KUN Readings for Common KANJI Characters. Tokyo: Japan Ministry of Education. 1981.


    18. ZIPF DISTRIBUTION:
    JAPANESE TRANSLATION.


    PREVIOUS PAGE.
    RETURN TO TABLE OF CONTENTS.

    RANK FREQUENCY UMLS
    CODE
    UMLS
    NAME
    JAPANESE
    1 5465 C0030664 PATHOLOGY 病 理
    2 5465 C0441468 PHOTOGRAPH 写 真
    3 2016 C0007634 CELL 細 胞
    4 1812 C0441469 PICTURE
    5 1140 C0027651 NEOPLASM 新 生 物
    6 1102 C0024109 LUNG
    7 644 C0012634 DISEASE 病 気
    8 617 C0030705 PATIENT 患 者
    9 581 C0205165 SMALL
    10 569 C0205234 FOCAL 局 所
    11 549 C0205164 LARGE
    12 528 C0233426 APPEAR
    13 522 C0006141 BREAST 乳 腺
    14 487 C0205397 OBSERVE 観 察 する
    15 466 C0038128 STAIN 染 色
    16 458 C0150312 PRESENT 存 在
    17 421 C0015392 EYE
    18 413 C0445247 SAME
    19 408 C0010834 CYTOPLASM 細 胞 質
    20 407 C0205392 SOME いくらか
    21 401 C0205182 ATYPICAL 異 型
    22 387 C0022646 KIDNEY
    23 375 C0205402 PROMINENT 著 い
    24 365 C0449774 PATTERN パターン
    25 347 C0205091 LEFT
    26 341 C0040132 THYROID 甲 状 腺
    27 336 C0205160 NEGATIVE
    28 324 C0205090 RIGHT
    29 323 C0042149 UTERUS 子 宮
    30 320 C0449470 TYPE
    31 316 C0007097 CANCER
    32 300 C0205172 MANY 多 くの
    33 300 C0370003 SPECIMEN 検 体
    34 294 C0014609 EPITHELIUM 上 皮
    35 292 C0262950 BONE
    36 285 C0332285 ARISING FROM
    37 285 C0444186 SMEAR 塗 抹
    38 279 C0005953 BONE MARROW 骨 髄
    39 275 C0017542 GIEMSA STAIN ギムザ 染 色
    40 272 C0018964 HEMATOXYLIN ヘマトコシリン
    41 270 C0205428 AFFECTING
    42 269 C0007874 CERVIX 頚 部
    43 268 C0431085 TUMOR CELLS 腫 細
    44 262 C0042591 VESSEL 血 管
    45 258 C0014448 EOSIN ェョジン
    46 253 C0205250 ELEVATED 隆 起 した
    47 249 C0024264 LYMPHOCYTE リンパ 球
    48 247 C0205308 OLD 古 い
    49 236 C0439508 YEAR
    50 234 C0392746 WELL 良 い
    FREQUENCY DISTRIBUTION OF
    50 MOST FREQUENT UMLS CONCEPTS
    IN AFIP LEGEND-TEXTS.