Article Text


Molecular mimicry in autoimmune disease
  1. Susan Leech
  1. Department of Child Health, King’s College School of Medicine and Dentistry, Bessemer Road, London SE5 9PJ, UK
  1. Dr Leech.

Statistics from

The origins of autoimmune disease are multifactorial. Environmental factors and a genetic predisposition result in tissue injury caused by autoreactive T cells or antibodies. Usually a single organ or individual cell type is affected in the absence of gross abnormalities of the immune system. Autoimmune diseases tend to have long, asymptomatic prodromal periods and the initiating events leading to loss of self tolerance occur long before the disease becomes clinically manifest. This makes the initiating factors harder to identify and they remain largely unknown in humans.

Key messages

  • Molecular mimicry is a mechanism by which immunological self tolerance can be broken, leading to autoimmune disease

  • Three way molecular mimicry has been shown to exist between:

    the MHC acting via T cell selection
    foreign antigens that upregulate antigen presentation and provide co-stimulation
    the host target antigen, recognised by self reactive T cells, driving autoimmune disease
  • Molecular mimicry provides a way of intervening against and preventing autoimmune disease

Several different pathological processes have the potential to break tolerance and cause autoimmune disease. Antigenic similarity between pathogenic organisms or foreign proteins and self proteins (molecular mimicry) is one of them.

The major histocompatibility complex (MHC) is a collection of genes on chromosome 6 that codes for the human leucocyte antigens (HLA). These are glycoproteins expressed on the surface of cells that bind short peptides, degraded or generated by the cell, and present them to T lymphocytes (figs 1 and 2). The term “molecular mimicry” was used in the 1970s to explain persistent viral infections. It was suggested that the MHC and viruses encoded similar peptide sequences, which allowed the host to regard an infecting virus as “self” and forego an immune response. More recently, it has been used as a hypothesis to explain autoimmune disease.1 Several pathogens share antigenic determinants with host proteins. Often, these are used to gain entry into the cell; rhinovirus binds to an adhesion molecule, ICAM-1, on epithelial cells,2 and human immunodeficiency virus (HIV) binds to CD4,3 and enters the cell using a chemokine receptor. Most infections result in a specific immune response against the infecting organism. Cross reactivity may occur between the clones of T and B lymphocytes generated against an infecting agent and a host protein sequence (immunological crossreactivity). This “hit and run” mechanism could be induced to persist long after the pathogen has disappeared, by the presence of autoantigens driving the immune response.

Figure 1

MHC class I: HLA A, HLA B, HLA C—for example, HLA B27. Expressed on the surface of virtually all cells except mature erythrocytes and trophoblast cells. Presents antigen (processed peptides) to CD8 T cells. Comprises an α polypeptide chain and β2 microglobulin (invariant).

Figure 2

MHC class II: HLA DP, HLA DQ, HLA DR—for example, HLA DR3. Constitutively expressed on the surface of macrophages, monocytes, dendritic cells, and B lymphocytes. Presents antigen (processed peptides) to CD4 T cells. Comprises an α and β polypeptide chain.

How does molecular mimicry occur?

During T cell development, primitive cells leave the bone marrow and enter the thymus (fig 3). During the subsequent three weeks they develop a T cell receptor (containing CD3) on the cell surface along with an accessory molecule; either CD4 or CD8. The primitive T cells undergo both positive selection, which selects cells able to interact usefully with peptide presented by MHC, and negative selection, which deletes cells that are strongly reactive with self or are unable to interact with presented peptide antigens. Positive and negative selection occur using MHC present on cells within the thymus; it is postulated that much of the peptide presented by this MHC is derived from MHC itself.4 The process of thymic selection eliminates 99% of precursor cells, which undergo apoptosis (programmed cell death), leaving only 1% to reach the periphery. The T cell receptor repertoire generated allows a certain degree of self reactivity in return for a reasonably comprehensive coverage of foreign antigens.

Figure 3

T cell development and thymic education.

If a foreign peptide (such as a virus) resembles these MHC derived peptides, it has the potential to activate such T cells. If a self antigen is also structurally similar, exposure to the foreign peptide results in these activated T cells becoming autoreactive.

“Three way molecular mimicry” is thus said to exist between peptide sequences of three different origins:

the MHC, which is critically involved in thymic T cell selection
foreign antigens (such as bacteria, viruses, or food), which upregulate antigen presentation and provide co-stimulation
self or target antigens, which are recognised by self reactive T cells (fig 4).
Figure 4

Three way molecular mimicry.

 Identification of the mechanism of molecular mimicry fuses the influences of genetics and the environment into a single pathogenic process and is a powerful hypothesis in diseases where both can be shown, epidemiologically, to have an influence.

Ankylosing spondylitis

For example, in diseases with a genetic predisposition, the contribution of environmental factors to the pathogenesis can be estimated by the incidence of the disease in identical twins. The drop in concordance from 100% is an index of the size of the influence of environmental factors. Ninety five per cent of patients with ankylosing spondylitis carry the HLA B27 allele compared with 7% of the general population. Fifty per cent discordance in monozygotic twins implies a significant environmental factor. It was suggested that this might be a result of mimicry between HLA B27 and a microorganism. Several microbial agents (klebsiella, yersinia, and shigella) were epidemiologically associated with the disease. In 1979, cross reactivity between HLA B27 positive lymphocytes and a klebsiella antigen was reported.5 The association has now been linked to the presence of an amino acid sequence, QTDRED (table 1), in the nitrogenase protein of klebsiella and in HLA B27 (amino acids 72–77).6

Table 1

Abbreviations used for amino acids

Insulin dependent diabetes

In humans the HLA associations of insulin dependent diabetes mellitus (IDDM) are not absolute and environmental factors are important. Early work suggested an association of the MHC class I molecules, HLA B8 and HLA B15, with diabetes. Later, stronger associations were found with the class II molecules HLA DR3 and HLA DR4. The class II associations were assumed to be a result of the B8−DR3 and B15−DR4 linkage disequilibrium (that is, that these alleles appear together on the same chromosome more often than their single gene frequencies suggest). The DR3−DR4 heterozygote had a greater risk and HLA DR2 was protective. Identical twins have a relatively low (45%) concordance rate for IDDM. There is strong evidence for an environmental trigger such as coxsackie virus, rubella, and Mycoplasma pneumoniae. Diabetes occurs in up to 20% of patients following congenital rubella.7 Three way molecular mimicry exists between a peptide of carboxypeptidase H (an enzyme expressed within islet cell granules and an autoantigen) and a peptide of the β chain of HLA DQ, which is processed and presented by HLA DR4,8 and peptides of coxsackie virus coat protein. Another sequence in the β chain of HLA DQ resembles the islet cell autoantigen jun B (a nuclear transcription factor against which T cell autoreactivity has been demonstrated),9 and the islet cell antigen, ICA 512,10 has similarities with peptides in human herpesvirus 1 (herpes simplex virus 1) and 4 (Epstein-Barr virus).


In the 1970s, epidemiological studies suggested an association between diabetes and viral infections. Several studies demonstrated raised titres of both IgM and IgG antibodies to coxsackie virus and later to coxsackie B4 in newly diagnosed type 1 diabetics. Initially, the association of coxsackie virus was thought to be a result of its tropism for β cells,11 which might cause damage to these cells and initiate autoimmunity. More recently, autoimmune reactivity of T cells to the β cell enzyme glutamic acid decarboxylase (GAD; an enzyme that catalyses glutamic acid to γ aminobutyric acid (GABA)) has been shown to be important in the pathogenesis of diabetes.12 The major peptide determinant of GAD recognised by patients with diabetes has a significant sequence similarity to a 15 amino acid sequence within the P2-C protein of coxsackie B virus. Antibodies to the P2-C protein cross react with GAD 65 and vice versa, suggesting that molecular mimicry might be responsible for the disease.13

Finland has one of the highest incidences of diabetes in the world (0.6%). In 1995, as part of Akerblom and Tuomilehto’s childhood diabetes in Finland study group, a prospective study suggested antenatal exposure to coxsackie enterovirus increased the risk of diabetes in the child (particularly if onset was before 3 years of age).14 Serologically verified enterovirus infections were twice as common in siblings who developed clinical diabetes compared with siblings who remained non-diabetic, several years before diagnosis. More recently, these antibodies have been shown to be generated in healthy children who then went on to develop diabetes (although they were also present in non-diabetic controls),15 and are higher in siblings of diabetic patients who then become islet cell antibody positive (particularly if they express the high risk HLA DQB1 genotype).16


There is epidemiological evidence that a short breast feeding time and early ingestion of cows’ milk is a risk factor for diabetes.17 In cows’ milk, the proportion of protein as casein is higher (85%) than in human milk (25%); a large proportion of this is β casein. Thirty seven per cent of patients with diabetes have autoantibodies to β casein at diagnosis. In about 50% of patients with recent onset diabetes, their lymphocytes show an enhanced proliferative response to β casein, which is virtually absent in normal controls and patients with autoimmune thyroid disease.18 Sequence differences between human and bovine β casein could be responsible for the generation of an autoimmune response if milk proteins are introduced within the first weeks of life when foreign proteins can stimulate the immune system.19Enhanced cellular and humoral immune responses have been reported with other components of cows’ milk, such as β lactoglobulin and bovine serum albumin. That multiple cows’ milk proteins generate an immune response in patients with diabetes may suggest a disturbance in gut mucosal immunity in presentation of oral antigens.

Several sequence homologies between bovine β casein and peptides expressed by pancreatic β cells have been identified, including the islet cell antigen, ICAp69,20 and carboxypeptidase H; both putative autoantigens in diabetes. The sequence in position 63–67 of β casein corresponds to a region of variation between the bovine and human forms, and the bovine form is identical to residues 415–419 of the β cell specific glucose transporter, GLUT-2. Autoantibodies to GLUT-2 have been described in 77% of patients with recent onset IDDM.21

Currently, the childhood diabetes in Finland study group is screening infants at birth and half of those with a high risk genotype (HLA DQ) are given a casein hydrolysate formula as supplementary milk and a diet free from dairy products and beef for the first 6–8 months of life. The other half will be given a normal diet. The children are to be followed up to 10 years of age and a reduction in the incidence of diabetes is expected to be seen in the dietary intervention group.

Adenovirus 12 in coeliac disease

Seventy per cent of monozygotic twins are concordant for coeliac disease, suggesting an environmental factor is also involved in the pathogenesis. In 1987, Kagnoff and colleagues22 found that 89% of untreated coeliac patients had antibodies to adenovirus 12, and that these antibodies were also raised in treated children with coeliac disease (31%) compared with controls (0–13%). Antibodies to adenovirus 18 or echovirus 11 were not raised. A similar amino acid sequence between A-gliadin and the E1b protein of adenovirus 12 was reported, and this sequence was found to be an antigenic determinant in active coeliac disease. This suggested that the viral protein might play a role in the pathogenesis of coeliac disease by virtue of immunological cross reactivity. In 1991, Mantzaris and Jewell23 demonstrated that intraduodenal insertion of the peptide produced mononuclear cell infiltration and microscopic changes that were not seen in controls. However, later studies using monoclonal antibodies against the epitope sequence24 and studies looking for persistent adenovirus DNA25 26 in small intestinal biopsies have failed to show evidence of an association.

It is possible that infection with adenovirus sensitises the genetically predisposed host to A-gliadin by molecular mimicry, with resultant gluten sensitive enteropathy, using the hit and run mechanism.

Clinical developments

Molecular mimicry plays a major role in the initiation of common autoimmune diseases. Control of the initiating factor (for example, by vaccination against coxsackie virus protein P2-C) or strategies to avoid contact with the environmental trigger (for example β casein in infant formula) in an at risk population could play an important role in reducing the incidence of the disease and both strategies deserve further attention.

In diabetes, other immune interventions have recently been tried with varying success. Treatment with immunosuppressive drugs, such as cyclosporin A and azathioprine, after clinical diagnosis is ineffective because 80–90% of the β cell mass is already destroyed at onset of the disease. Oral exposure to antigen at the time of sensitisation might direct the immune response from type 1 (cytotoxic) to type 2 (humoral), protecting against autoimmune disease. The American multicentre diabetes prevention trial is looking at the effect of immunomodulation with insulin in early diabetes.27 BCG vaccination, if given in the prediabetic state or shortly after induction of diabetes is protective in mice. It possibly acts via a switch from a type 1 to a type 2 response. However, epidemiological data failed to support a sustained protective role for BCG vaccination against juvenile onset IDDM.28 Nicotinamide inhibits the generation of free radicals and nitric oxide and increases the intracellular NAD pool and energy supply of the cell. To test the preventive effect of nicotinamide against IDDM in high risk individuals, a prospective, randomised, placebo controlled, multicentre study (European nicotinamide diabetes intervention trial; ENDIT) was started in 1993.

The identification of high risk haplotypes HLA DQA1*0301 and DQB1*0302, and an increased risk of diabetes in first degree relatives with antibodies to islet cells, insulin, and GAD has already opened up possibilities for prevention and immunomodulation. Once the sensitivity and specificity of screening tests are improved to allow for population screening, prevention will become practical for the general population. Primary prevention of IDDM by vaccination or exposure prophylaxis, acting via the prevention of the induction of autoimmune disease by molecular mimicry, becomes an exciting possibility.


I acknowledge the helpful comments of Drs J Rennie, C Buchanan, J Mahr, and Professors G Mieli-Vergani and J Price during the preparation of this review.

View Abstract

Request permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.