y = a + bx + cw + dv + eu + ft + ......where y is an OUTPUT-VALUE, and a, b, c, .... are coefficients. The values of the coefficients are determined by data analysis, most commonly by the LEAST SQUARES METHOD. In a typical regression, there may be a variety of output-values, and some of the input values may be squared, cubed, etc.
y = a + bx.
http://www.medparse.com/whatnett.htm
http://www.medparse.com/whatperl.htmPERL is not necessarily the best language for AI, but it is the cheapest (cost-free), and the most widely available for different systems, including MS-DOS and LINUX. There are both personal-computer and Internet versions of PERL, so that you can prototype a PERL program on your home computer, and then post it publicly on the Internet. Because PERL is cost-free, many computer programmers have contributed to the advancement of PERL, and the result is a very powerful computer language, with many of its serious bugs and deficiencies taken care of. You should have at least PERL VERSION FIVE for AI application programming. On the other hand, because PERL is free, and Bill Gates cannot monopolize it, and there is no Microsoft (R) version of PERL.
ITEM(0) is at location 123456;
ITEM(1) is at location 123457;
ITEM(2) is at location 123458;
ITEM(3) is at location 123459.....
http://www.medparse.com/whatperl.htmPERL is not necessarily the best language for AI, but it is the cheapest (cost-free), and the most widely available for different systems, including MS-DOS and LINUX. Because it is free, many computer programmers have contributed to the advancement of PERL, and the result is a very powerful computer language, with many of its serious bugs and deficiencies taken care of. You should have at least PERL VERSION FIVE for AI application programming.
A SET IS CHARACTERIZED entirely in terms of its members, i.e., EXTENSIONALLY. A set may NOT be uniquely characterized INTENSIONALLY, i.e., by the manner of its creation. If two sets are created differently, but end up with the same membership, then they are the same set. For example, the set of humans living on the moon in 1850 and the set of Chevrolets built during 1850 are the same set, namely, the null set, Ø.Smith believes that this classical formulation for sets is fundamentally flawed for describing ontologies, and has proposed using an alternate formulation, known as MEREOLOGY.
A SET IS DIFFERENT FROM WHAT IT CONTAINS. that is, s and {s} are different. This property of sets leads to the Russell-Frege paradox: does the set of all sets belong to itself or not? (Is Epimenides a liar or not? (see Appendix C)). The Russell-Frege paradox can be resolved by defining two types of sets: ordinary sets and classes. This double definition involves a lot of extra mathematical bookkeeping.
1. Set membership, denotedDEFAULT LOGIC. Production rules of the form: If A is true and B cannot be proved, then C is true. For example:C. We say that s is a member of S, denotedCS; or s is not a member of S, denoted ~CS;
2. The empty set, Ø, the set containing no members. There exists no s such that sCØ.
3. Set Union, U. The set A U B is the set of all members that belong either to set A or to set B or to both. Set union is analogous to inclusive-or (IOR) in first order propositional logic.
4. Set Intersection, /\. The set A /\ B is the set of all members that belong both to set A and to set B. Set intersection is analogous to logical AND in first order propositional logic.
5. Set Subtraction, -. The set A - B is the set of all members that belong to set A but NOT to set B. Set intersection is analogous to logical NOT in first order propositional logic.
6. Subset. ACB. We say that the set ACB, if the set of all members that belong to set A also belong to set B. Set intersection is analogous to IMPLIES in first order propositional logic.
1. All crows can fly.Obviously, statement #1 is not true all the time, but in many cases, one must REASON UNDER UNCERTAINTY, and some conclusion, even one that is occasionally wrong, is better than no conclusion. In default, or non-monotonic logic, one must have a contingency plan for rescuing the system when it encounters such predictable but unusual errors. In classical logic, the entire system collapses, which is usually not an acceptable result.
2. Jack is a crow.
3. What if Jack has a broken wing?
3+5×7 = 3+(5×7) = 38.whereas
(3+5)×7 = 56.In parenthesis-free notation, the operator is placed BEFORE NOT BETWEEN the two terms that it modifies. Therefore:
+3 × 5 7 = 38.In either case, no parentheses are needed. In fact, in Polish logic notation, an arbitrarily complex expression can always be expressed without parentheses. This notation is a great convenience in certain proofs and demonstrations in logic. The notation seems a little bit bizarre and unintuitive at first, but one can eventually become comfortable with it.
× +3 5 7 = 56.
x is NECESSARILY-TRUE if and only if x is NOT-NECESSARILY-NOT-TRUE
The following legal concepts (collocations), harvested by the barrier word method, are familiar to every adult U. S. citizen with a reasonable knowledge of our national values:
AMENDMENT 1 . CONGRESS shall make no LAW respecting an ESTABLISHMENT of RELIGION, or PROHIBITING the FREE EXERCISE thereof; or ABRIDGING the FREEDOM of SPEECH, or of the PRESS; or the RIGHT of the PEOPLE peaceably to ASSEMBLE, and to PETITION the GOVERNMENT for a REDRESS of GRIEVANCES.
AMENDMENT 2 . A well REGULATED MILITIA , being NECESSARY to the SECURITY of a FREE STATE , the RIGHT of the PEOPLE to keep and bear ARMS , shall not be INFRINGED.
AMENDMENT 3 . No SOLDIER shall, in TIME of PEACE be QUARTERED in any HOUSE , without the CONSENT of the OWNER, nor in TIME of WAR , but in a MANNER to be PRESCRIBED by LAW .
AMENDMENT 4 . the RIGHT of the PEOPLE to be SECURE in their PERSONS , HOUSES , PAPERS , and EFFECTS , against UNREASONABLE SEARCHES and SEIZURES , shall not be VIOLATED , and no WARRANTS shall ISSUE , but upon PROBABLE CAUSE , supported by OATH or AFFIRMATION , and particularly describing the PLACE to be searched , and the PERSONS or things to be seized .
AMENDMENT 5 . no PERSON shall be held to ANSWER for a CAPITAL , or otherwise INFAMOUS CRIME , unless on a PRESENTMENT or INDICTMENT of a GRAND JURY , except in cases arising in the LAND or NAVAL FORCES , or in the MILITIA , when in ACTUAL SERVICE in time of WAR or PUBLIC DANGER ; nor shall any PERSON be SUBJECT for the same OFFENCE to be twice put in JEOPARDY of LIFE or LIMB ; nor shall be COMPELLED in any CRIMINAL CASE to be a WITNESS against himself , nor be deprived of LIFE , LIBERTY , or PROPERTY , without DUE PROCESS of LAW ; nor shall PRIVATE PROPERTY be taken for PUBLIC USE , without JUST COMPENSATION .
AMENDMENT 6 . In all CRIMINAL PROSECUTIONS , the ACCUSED shall ENJOY the RIGHT to a SPEEDY and PUBLIC TRIAL , by an IMPARTIAL JURY of the STATE and DISTRICT wherein the CRIME shall have been COMMITTED , which DISTRICT shall have been previously ascertained by LAW , and to be informed of the NATURE and CAUSE of the ACCUSATION ; to be confronted with the WITNESSES against him ; to have COMPULSORY PROCESS for obtaining WITNESSES in his FAVOR , and to have the ASSISTANCE of COUNSEL for his DEFENCE .
AMENDMENT 7 . in SUITS at COMMON LAW , where the VALUE in CONTROVERSY shall exceed twenty DOLLARS , the RIGHT of TRIAL by JURY shall be PRESERVED, and no FACT tried by a JURY , shall be otherwise REEXAMINED in any COURT of the UNITED STATES , than according to the RULES of the COMMON LAW .
AMENDMENT 8 . EXCESSIVE BAIL shall not be REQUIRED , nor EXCESSIVE FINES imposed , nor CRUEL and UNUSUAL PUNISHMENTS inflicted .
AMENDMENT 9 . the ENUMERATION in the CONSTITUTION , of CERTAIN RIGHTS , shall not be CONSTRUED to DENY or DISPARAGE others RETAINED by the PEOPLE .
AMENDMENT 10 . the POWERS not DELEGATED to the UNITED STATES by the CONSTITUTION , nor PROHIBITED by it to the STATES , are RESERVED to the STATES respectively, or to the PEOPLE .
FREE EXERCISE
REGULATED MILITIA
FREE STATE
UNREASONABLE SEARCHES
PROBABLE CAUSE
INFAMOUS CRIME
GRAND JURY
NAVAL FORCES
ACTUAL SERVICE
PUBLIC DANGER
CRIMINAL CASE
DUE PROCESS
PRIVATE PROPERTY
PUBLIC USE
JUST COMPENSATION
CRIMINAL PROSECUTIONS
PUBLIC TRIAL
IMPARTIAL JURY
COMPULSORY PROCESS
UNITED STATES
COMMON LAW
EXCESSIVE BAIL
EXCESSIVE FINES
UNUSUAL PUNISHMENTS
CERTAIN RIGHTS
UNITED STATES
http://www.netautopsy.org/vhpsapsx.htmThe German-language Zipf Distribution for the GOETHE UNIVERSITY AUTOPSY REGISTER is shown at URL:
http://www.medparse.com/guarzipf.htmZIPF'S LAW formalizes the observation that the high-frequency words in a large free-text document, or text-corpus, are extremely common (Zipf, 1949; Mandelbrot, 1954; Fedorowicz, 1982; Giere, 1981; Zhang, 1981; Moore et al, 1988; Moore et al, 1989). The frequency of words in a free-text document may be ranked, with rank=1 for the most frequent word, rank=2 for the second-most frequent word, etc. According to Zipf's Law, word-rank is inversely proportional to word-frequency. That is, for some constant, k, r = k/f, for r=rank and f=frequency. Thus in a large text-corpus (over a million words), approximately one hundred barrier words (ranks 1 through 100) account for over half the word-occurrences in the document.
Limx -> c f(x) = L.to denote the
L = LIMIT OF f(x) AS x APPROACHES c.This means that the function, f(x), gets closer-and-closer to L, as x gets closer-and-closer to c. As long as x-getting-closer-and-closer-to-c doesn't involve dividing by zero, then typically L = f(c). The limit process gets much tricker if a division by zero (or division by a very small number) is involved in calculating L. Unfortunately, taking a derivative involves just such a process.
Limx -> infinity f(x) = L.denotes the
L = LIMIT OF f(x) AS x APPROACHES INFINITY.
L = LIMIT OF f(x) AS x APPROACHES cmeans that for every epsilon>0, and for every |x-c|<epsilon, there exists a delta>0 such that |f(x)-L|<delta. The analogous Weierstrass definition for x approaching infinity is:
Limx -> infinity f(x) = LIMIT OF f(x) AS x APPROACHES INFINITY.means that for every N>0, and for every x>N, there exists a delta>0 such that |f(x)-L|<delta. Proofs in calculus are then reduced tofinding a formula for calculating delta, given a value of epsilon or N.
x × y = z.For y=0, the answer is in two parts: IF y=0 AND z~=0, THEN THERE IS NO SUCH x. On the other hand, IF y=0 AND z=0, THEN EVERY POSSIBLE x SATISFIES THE MULTIPLICATION (Seife, 2000). Thus, z/0 is NONEXISTENT for z~=0; whereas 0/0 is ANYTHING. Unfortunately, taking a derivative in elementary calculus involves getting perilously close to division by zero (vide infra). This single fact is why reasoning in calculus is fundamentally more difficult than reasoning in algebra. In calculus, you must ALWAYS PAY CLOSE ATTENTION!
Limh->0(f(x+h)-f(x))/h = L.Think of a curve, f(x), and nudge x forward by a small amount, h. Then (f(x+h)-f(x)) is the amount that f() moves upward, while x advances by h. As h approaches zero, the (f(x+h)-f(x))/h becomes the slope of f(x).
L = Limh->0((x+h)2-x2)/h.or:
L = Limh->0(x2+2xh+h2-x2)/h.or:
L = Limh->0(2x+h).Now, let h=0 and complete the calculation, so that L=2x.
SAMPLE EXPERIMENT.
ROLE OF STATISTICS IN EVALUATING RESEARCH RESULTS.
PROBLEM OF VARIABILITY.
POPULATION VERSUS SAMPLES.
HYPOTHESIS TESTING.
DECISION THEORY.
ANALYSIS OF VARIANCE.
CONTINGENCY TABLES.
CORRELATION.
MULTIVARIATE ANALYSIS.
log22 = 1;For SEEKING from a list of 32 objects, one is talking about the difference between n=32 steps (linear) and log232 = 5 steps (logarithmic). For seeking from a list of 300,000,000 social security numbers, from a list of 32 objects, one is talking about the difference between n=300,000,000 steps (linear) and log2536,870,912 = 29 steps (logarithmic). Obviously, for large, practical problems, computational complexity methods can assume great importance.
log24 = 2;
log28 = 3;
log216 = 4;
log232 = 5....
(1) LOG, soluble in LOGn steps. For example, if you wish to FETCH a single record from a SORTED LIST of n records, then this can be accomplished in LOGn steps. The easiest way to understand this concept is to consider LOG2n steps, i.e., logarithm-to-the-base-2. Now suppose that you have a sorted list of 16 names, and you are looking for HARRY. In the first step, split the list in half. Harry belongs to the first half-list.Unfortunately, there are many interesting problems in AI, involving decision trees and routing diagrams, that are NP Complete. Although the status of the NP Complete problem is currently uncertain, despite the depradations of numerous brilliant and determined mathematicians over the past several decades, even if it is solved and turns out to be exponential, then this solution offers only despair to hopeful AIers.In the second step, split the remaining list in half. Harry belongs to the second half-list.Abner Bill Charley David Edward Frank George Harry _________________ Ike John Karl Larry Milton Norman Oliver PatrickIn the third step, split the remaining list in half. Harry belongs to the second half-list.Abner Bill Charley David _________________ Edward Frank George HarryIn the fourth step, split the remaining list in half. Harry belongs to the second half-list.Edward Frank _________________ George HarryThat is, LOG216 = 4 steps, compared to 16 steps for an unsorted list.George _________________ Harry
If you think that it's silly to go to all this mathematical sophistication to fetch from sorted lists, consider this: the U. S. Social Security Administration has a list of some 300,000,000 social security numbers, that it must, in principle, be able to fetch. Since 228=268,435,456 and 229=536,870,912, we are talking about the difference between 29 steps and 300,000,000 steps. Suppose that the computer software can perform 1000 steps per second. Then we are talking about the difference between 0.03 seconds and 300,000 seconds = 5,000 minutes = 83 hours = 3.5 days.
Likewise, a clinical pathology laboratory in a moderate-sized community hospital (250 beds) performs about 1,000,000 laboratory tests annually, not to mention a total patient census of, perhaps, 300,000, plus inpatient-admission-dates, encounter-dates, outpatient-visit-dates,..... Since 210=1,048,576, we are talking about the difference between 10 steps and 1,000,000 steps, 0.01 seconds and 1,000 seconds= 16 minutes. Perhaps not much, but these resources add up quickly, and there is no excuse for wasting resources when good solutions are already known.
(2) LINEAR, soluble in n steps. For example, if you wish to FETCH a single record from an UNSORTED LIST of n records, then this can be accomplished in n steps. In the worst case, the algorithm must examine every single record before it hits the desired record.
(3) LOG-LINEAR, soluble in nLOGn steps. For example, if you wish to SORT a LIST of n records, all the quality algorithms (QUICKSORT, HEAPSORT, etc.) employ a variant of the list-division method described above.
(3) QUADRATIC, soluble in n2 steps.
(4) POLYNOMIAL, soluble in nk steps, where k is a fixed integer, hopefully less than 4 or 5.
(5) NP-COMPLETE, mathematically uncertain, but at best polynomial and at worst exponential. There are many interesting problems in AI involving decision trees and routing diagrams, that fall into this category.
(6) EXPONENTIAL, soluble in 2n steps.
(7) SUPER-EXPONENTIAL, soluble only after greater than 2n steps. Includes Pressburger algebra.
(8) INSOLUBLE, not soluble in an infinite number of steps. Includes such Gödel-unprovable statements as: the consistency of ordinary arithmetic; the Generalized Continuum Hypothesis; and the Axiom of Choice.
mod120 = 0
mod121 = 1
mod122 = 2
mod123 = 3 .........
mod1212 = 0
mod1213 = 1
mod1214 = 2
mod1215 = 3
Inference and Reasoning.
Knowledge Representation.
Logic Programming.
Natural Language
Programming Languages.
Game Theory.
Pattern recognition, Image processing.
Robotics.
Search.
Theorem Proving.
Vision.
MR._JONES IS_A MALE.
MR._JONES HAS_AN ENLARGED_LIVER.
ENLARGED_LIVER HAS_DIFFERENTIAL_DIAGNOSIS METASTATIC_TUMOR, HEPATOCELLULAR_CARCINOMA, AMYLOIDOSIS,....
Required probabilistic independence of component probabilities. In many cases, there is a medical relationship between different components (parts) of a system of diseases.
Required knowledge of component probabilities. The probabilities for most diseases are unknown.
Requirement that all possible disease-categories be known in advance. There is no mechanism for DETECTING OR ADDING A NEW DISEASE, if the existing list of diseases appears unlikely.
where []=null-sentence, Nx=noun-phrase, N=noun, P=preposition, A=adjective, etc. In this simple BNF model, the null-sentence, [], points to a noun-phrase (Nx) (expression 1); and the noun-phrase, in turn, points to one of three choices: noun-only (N) (expression 2); adjective-noun (AN) (expression 3); or noun-preposition-noun (NPN) (expression 4). The term to the left of ==> is called the "argument" the term to the right of ==> is called the "value." This simplified BNF supports surgical pathology diagnoses such as "hemangioma" (=N, expression 2); "actinic keratosis" (=AN, expression 3); or "adenocarcinoma of colon" (=NPN, expression 4). Perhaps a majority of surgical pathology diagnoses, particularly in a pathology practice with predominantly biopsy specimens, can be parsed with such a simple BNF model. However, many surgical pathology reports, especially in academic institutions with large resection specimens, are much more complex than would be suggested by this simple model. The encoder Perl script supports a BNF grammar of arbitrary size and complexity.1. [] ==> [Nx] 2. [Nx] ==> [N] 3. [Nx] ==> [AN] 4. [Nx] ==> [NPN]
2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, .....Then, for example, the mathematical statement "not x and y" is Gödelized as:
2not×3x×5and×7yThat is:
21×33×52×74 = 3,241,350This plan for Gödelization was introduced in 1927 at a talk attended by Prof. John von Neumann, who used elements of this design for the von Neumann computer (ENIAC).
2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, .....It has been suggested, by nineteenth century German mathematician Ludwig Kroneker, that prime numbers are the FUNDAMENTAL BUILDING BLOCKS of all mathematics. In the motion picture, CONTACT, starring Jodie Foster, the artificial language used to make contact with extra-terrestrial intelligence was built up from prime numbers. Euclid's FUNDAMENTAL THEOREM OF NUMBER THEORY states that every whole number is a unique, ordered product of powers of prime numbers. Prime numbers form the basis for most modern ENCRYPTION ALGORITHMS used in electronic financial systems.
SMALL BLUE CELL TUMOR:Most of these diagnoses are absolutely specific to a certain bodysite or clinical history, but they cannot be distinguished by microscopic findings alone. For example, retinoblastoma always occurs in the eye of children; medulloblastoma always occurs in the brainstem of children; Merkel Cell carcinoma of skin begins in the skin, but may metastasize to other locations in the body. If a surgeon submits a small blue cell tumor to the pathologist for diagnosis, the surgeon doesn't want the 25-choice grocery list of possibilities. The surgeon wants the single choice that is most consistent with the clinical findings.
Small cell carcinoma of lung.
Merkel Cell carcinoma of skin.
Small cell adenocarcinoma of prostate.
Small cell carcinoma of larynx.
Medulloblastoma.
Esthesioneuroblastoma.
Retinoblastoma.
PAINLESS, SUBCUTANEOUS METACARPAL-PHALANGEAL JOINT MASS.
Ganglion cyst.
1. A robot shall never harm a human being, nor through his inaction, allow a human being to come to harm.Later in his life, Asimov found many problems with these laws, perhaps the most serious being the conflict between #2 and #3. Asimov envisioned highly complex robots with hundreds of years of experience. If a malicious human told a robot to harm itself, the robot would be obliged to do so, or else be in conflict with #2.
2. A robot shall always obey an order given by a human being, except as it conflicts with #1.
3. A robot shall attempt to protect himself, except as it conflicts with #1 or #2.
1. More computer processing power. Computers today cannot even capture and digest images in hours, that humans can work with in a matter of seconds. Example: microscopic slides in human anatomic pathology.
2. Some computer problems are insoluble, no matter what. For example: trisecting the angle with straightedge and compass (Euclid). finding a universal formula for quintic polynomials and above (Galois). the consistency of ordinary arithmetic (Gödel). The Axiom of Choice and the Generalized Continuum Hypothesis in ordinary (Zermelo-Fraenkel) set theory (Gödel, Cohen).
3. Some of these computer-insoluble problems may be irrelevant in the future. Who cares if you can't trisect an angle with straightedge and compass, when you can get as close to a trisection as you want, using calculus and limit theory? Same with finding the roots of quintic polynomials.