The Influence of Ecological Factors on the Distribution and the Genetic Structure of <i>Escherichia coli</i>
David M. Gordon
[Section Editor: Monica Riley]
Posted February 27, 2004
School of Botany and Zoology, Australian National University, Canberra, ACT 0200, Australia
Phone: 61 2 6125 3552, Fax: 61 2 6125 5573, E-mail:
This e-mail address is being protected from spambots. You need JavaScript enabled to view it
Since the first print version of Escherichia coli and Salmonella typhimurium: Cellular and Molecular Biology in 1987, the development of new molecular techniques and the decline in cost of most molecular methods have resulted in a rapidly expanding database covering the genetic structures of bacterial populations. Consequently, while in 1987 a single chapter sufficed to describe the genetic structure of Escherichia coli, our knowledge base has grown to such an extent that several chapters are now required.
This chapter focuses on recent data concerning the ecological factors that determine the distribution of E. coli and the genetic structures of naturally occurring E. coli populations. Much of the information presented is descriptive. It will become clear that ecological factors play a significant role in shaping the genetic structures of E. coli populations. However, we have little understanding of the precise nature of the selective forces involved and even less understanding of the natures of the adaptations that enable E. coli to persist in the variety of habitats in which it is found.
The characteristics and evolution of the various pathogenic lineages of E. coli are described elsewhere in the EcoSal website. However, some references to the pathogenic E. coli isolates are made in this chapter because the ecological factors that have been found to affect the distribution and genetic structure of commensal E. coli are relevant to our understanding of the epidemiologies of the various E. coli pathotypes. Consequently, this chapter summarizes some of the older literature concerning the dynamics of E. coli populations within a host and poses some questions that arise from our more recently acquired understanding of the factors affecting the genetic structures of E. coli populations.
Gordon and Cowling (19) reported the results of a survey of >2,300 nondomesticated vertebrates living in Australia. The authors defined the prevalence of E. coli as the fraction of hosts in whose feces E. coli could be detected. The method of isolation was such that E. coli would be detected if it represented, on average, at least 1% of the bacterial-cell population capable of growth on a MacConkey agar plate.
E. coli was found in 56% of the mammalian hosts examined. Mammal species live in different climates, and their body masses (log10) and diets differ. Each of these factors was found to be a statistically significant predictor of the presence of E. coli in an individual host. Overall, E. coli was unlikely to be isolated from hosts living in the desert, while hosts living in the tropics were less likely to harbor E. coli than hosts living in temperate or grassland environments. In general, E. coli is less likely to be isolated from carnivores than from omnivores or herbivores. The likelihood of isolating E. coli from a mammal increases with the body mass (log10) of the host (Fig. 1).
E. coli was detected in 23% of the birds examined. A significant number of the birds were collected from localities where they lived in close association with humans, while others were collected from localities without significant levels of human habitation. Diet varies substantially among species, and although not as extensive as in mammals, interspecies variation in body mass is considerable. Human association, diet, and log10 body mass were found to be significant predictors of the prevalence of E. coli in birds. Birds living in close association with humans were approximately twice as likely to harbor E. coli than birds living away from human habitation. The probability of a host harboring E. coli increased with its body mass. The prevalence of E. coli was lowest in exclusively seed- or fruit-eating species and higher in those species that include nectar, insects, or vertebrates in their diet.
In general, E. coli was unlikely to be detected in ectothermic vertebrates. E. coli was isolated from 10% of the fish and from 12% of the frogs examined. In reptiles, E. coli was isolated from 33% of the crocodiles, 4% of the turtles, 2% of the snakes, and 10% of the lizards examined. As with birds, association with humans also appeared to explain some of the variation in the prevalence of E. coli among frog and lizard species. E. coli was unusually common in individuals of two species of lizards living in inner Sydney, New South Wales, and two species of tree frog living in the suburbs of Cairns, Queensland.
E. coli is less likely to be isolated from hosts living in tropical or desert climates than from hosts living in grassland or temperate climates. This pattern of variation was consistent across host orders and does not appear to be a host effect, as the prevalence of E. coli in species of the same genus differed depending on the climatic zone. Why E. coli is less prevalent in mammals living in the tropics is unknown. Maximum summer temperatures in the desert regions of central Australia are typically in excess of 35°C, the relative humidity is <25%, and median rainfall is <20 mm a month. It seems likely that these conditions would adversely affect the survival of E. coli in the external environment and may, in turn, limit the transmission of E. coli among individual hosts.
Alternatively, the near absence of E. coli in the feces of desert-dwelling mammals may be a consequence of host biology. Mammals living in arid regions have adaptations that enable them to conserve water. While water loss through defecation represents a small fraction of an animal’s water balance, morphological modifications in the colon to maximize water retention do occur (38, 60, 65). It may be that the lower water content of material in the distal colon adversely affects the survival of E. coli cells in this region, and as a result, cell densities are reduced to such an extent that E. coli is unlikely to be detected in feces.
The likelihood of a host harboring E. coli depends primarily on three factors. The first is the frequency with which a host individual is exposed to E. coli. The second is the probability that an exposure event will result in the establishment of a population. The third is the average length of time the E. coli population can persist in the host, or the rate at which the host loses its population of E. coli.
The observation that some hosts are more likely to harbor E. coli if they are commensally associated with humans than if they are not indicates that exposure rates vary among hosts. Many species of birds, such as house sparrows (Passer domesticus) and starlings (Turdus merula), are fed on household food scraps or forage in compost heaps that are likely to contain E. coli. In addition, these higher exposure levels may also be due to elevated levels of environmental contamination by E. coli due to the feces of domestic pets and livestock, hosts in which E. coli is prevalent. Elevated levels of environmental contamination may also lead to increased carriage of E. coli by invertebrates, which would explain the atypically high prevalence of E. coli in frogs and lizards living near human habitation.
It is unlikely that the extent to which a host is exposed to E. coli is the sole factor responsible for the variation in prevalence observed among different species. The variation may also be a consequence of host characteristics that affect the ability of E. coli to establish a population in the gastrointestinal tract.
Carnivorous mammals have relatively simple guts, because animals represent an energy-rich and easily digested food source. The digestive tract of carnivores is dominated by the small intestine, while the colon is short and often poorly differentiated from the small intestine. Among Australian carnivores, a cecum is usually lacking or poorly developed (25). Short intestines, coupled with high-quality food, result in rapid gut transit times. For example, in the carnivorous marsupials (Dasyuridae), gut transit times increase with body size and vary from about 1 h for the 18-g Sminthopsis crassicaudata to 13 h for the 1,000-g Dasyurus viverrinus (25).
By contrast, plant tissue represents an energy-poor food source that is difficult to digest. Microbial fermentation enables mammals consuming plant tissue to increase the nutritional value of the material they ingest, but it requires the presence of a fermentation chamber. In kangaroos and wallabies (Macropodidae), as well as the rat kangaroos (Potoroidae), the foregut provides the primary site of microbial fermentation, although these species also have a cecum (25). The hindgut is the site of microbial fermentation in the balance of Australian herbivores(25). In the majority of small (<10-kg) herbivores, the cecum forms the fermentation chamber, while in the larger species, fermentation takes place in the colon (25). Retention times are significantly longer in herbivores than in carnivores and, as in carnivores, increase with body mass. For example, food transit times are about 35 h in the 1-kg common ringtail possum (Pseudocheirus peregrinus) and about 100 h in the 10-kg koala (Phascolarctos cinereus) (25).
The digestive tracts of Australian omnivores are intermediate between those of carnivores and herbivores (25). In the small, primarily insectivorous mountain pygmy possum (Burramys parvus), the cecum is very small and the colon is short, while the sugar glider (Petaurus breviceps) has a well-developed cecum. The bandicoots have a modest-sized cecum and colon (25). All Australian rodents possess a cecum (9). In omnivores, food transit rates also depend on body mass and fall between those of carnivores and herbivores (25).
Several mathematical models have been developed with the purpose of exploring the dynamics of bacterial populations in the gastrointestinal tract. These models have investigated the process of bacterial establishment using chemostat models (2, 3, 16, 57, 59, 61) and plug flow reactor models (1, 2, 32). The chemostat models describe a physical system most closely resembling that of a cecum, while the plug flow reactor models describe a typical carnivore gut. These models have shown that establishment is more likely when the bacterial population exhibits wall growth than if no wall growth can occur. Nutrient concentration is also a factor, and there is a threshold concentration of nutrients entering the model gut below which the bacterial population cannot establish itself and above which the population will be established and persist. Further, the rate at which material moves through the gut (transit time) can also determine whether a bacterial population can become established: decreasing transit times lead to populations being less likely to colonize.
The quantity and type of nutrients entering the lower gastrointestinal tract will vary with the host diet. Gut turnover rates and transit times will also vary with diet, gut morphology, and body mass. The theoretical models suggest that these factors may well influence the likelihood of E. coli being able to establish a population in a host and hence determine the prevalence of E. coli in a particular host species. Further experimental and theoretical work is required in order to determine if one of the factors—host diet, gut morphology, or body mass (food transit rate)—is more likely to dominate the dynamics of bacterial colonization in the gastrointestinal tract.
To what extent the residence time of an E. coli population varies with diet, gut morphology, or body mass is unknown. There have been few studies that have examined the persistence time of an E. coli strain in a host, and these studies have largely been restricted to humans (see below). The results of these studies indicate that a strain may persist for months, if not years. However, the data also suggest that, inevitably, the resident strain is eventually lost. The factors responsible for the turnover of strains in a host are unclear. Long-term longitudinal studies of the persistence of E. coli in different host species are required.
The results of the survey of the distribution of E. coli in Australian vertebrates (19) indicate that, as a species, E. coli is adapted to mammals with hindgut microbial fermentation chambers, or in the absence of a hindgut fermentation chamber, the host must be "large." The contrast between rodents and carnivores highlights the apparent importance of a hindgut fermentation chamber. E. coli is common in most rodent species, all of which posses a cecum. E. coli is uncommon in bats and many marsupial carnivores, all of which lack a cecum. Yet, many species of rodent are at least as small as bats and smaller than many of the species of marsupial carnivore in which E. coli is uncommon. The observation that the prevalence of E. coli in carnivores increases with host body mass indicates that a large body mass can compensate for the lack of a hindgut fermentation chamber.
Multilocus enzyme electrophoresis (MLEE) studies indicate that E. coli, relative to other members of the family Enterobacteriaceae, exhibits a moderate degree of genetic diversity. For example, in a study of isolates from native Australian mammals, E. coli exhibited less MLEE diversity than Citrobacter freundii, Enterobacter cloacae, Hafnia alvei, or Klebsiella oxytoca and a level of diversity comparable to that of Klebsiella pneumoniae (21, 41). However, such comparisons are fraught with difficulties, as the degree of genetic variation observed can depend very much on the precise nature of the samples being compared due to host and geographic structures (21, 41).
Asexual reproduction (vertical gene transmission) gives rise to bacterial lineages that are related by descent (58). New lineages arise as a consequence of mutation. Bacteria also have the ability to exchange genetic material with cells of different strains or species through the processes of conjugation, transformation, and transduction (58). Gene transfer results in genetic diversification through the introduction of new genetic material into a lineage, but gene transfer can also lead to genetic homogenization as genetic material becomes distributed among lineages. Thus, the genetic structure of a bacterial population or species is the net result of the processes of mutation and gene transfer coupled with the outcomes of selection. When mutation is the predominant process, bacterial populations are highly clonal, as evidenced by significant levels of linkage disequilibrium (the nonrandom association of alleles among loci). By contrast, when gene transfer predominates (10 to 20 times the mutation rate [35]), the genetic structure of the population is said to be panmictic and linkage disequilibrium is not observed.
It is now well established that there is extensive variation in genetic structure among bacterial species (15). Species like Neisseria gonorrhoeae and Helicobacter pylori show little indication of clonality. Neisseria meningitidis and Streptococcus pneumoniae are species that are considered to be weakly clonal. By contrast, species such as Salmonella enterica are highly clonal, and mutation is the dominant force shaping this species’ genetic structure. E. coli also has a clonal genetic structure, although not to the extent exhibited by S. enterica (58).
Early MLEE studies demonstrated the existence of "subspecific" structure in E. coli (64). More extensive MLEE analysis, together with other techniques of genetic analysis, has confirmed the existence of this subspecific structure (12, 24, 33, 39). At present, there are four recognized subspecies of E. coli, which have been designated A, B1, B2, and D. A and B1 are considered to be sister groups, and group B2 is considered to represent the "ancestral lineage" of E. coli (33). It appears that the great majority of E. coli strains isolated from feces can be assigned to one of these four groups. Recently, a PCR-based method has been developed that enables an E. coli isolate to be assigned to one of the four E. coli groups by using a dichotomous-key approach based on the presence or absence of two genes (chuA and yjaA) and an anonymous DNA fragment (10).
The existence of subspecific structure in E. coli has for the most part been determined by assessing genetic variation that is thought to be largely neutral in its effects on the fitness of a strain (54). However, strains of the four groups of E. coli also differ in their phenotypic characteristics, including their abilities to exploit different sugars, their antibiotic resistance profiles, and their growth rate-temperature relationships (D. M. Gordon, unpublished).Genome size varies among the four E. coli groups, with A and B1 strains having smaller genomes than B2 or D strains (4). The distributions (presence or absence) of a range of putative virulence factors thought to be involved in the ability of a strain to cause extraintestinal disease also vary among the four E. coli groups (30).
Host diet and the climate where the host was collected were found to account for a significant amount of the variation in the distribution of A, B1, B2, and D strains among nondomesticated mammalian hosts (19). Host body mass also explained a portion of the variation, but the nature of the relationship between body mass and the distribution of the four E. coli groups depended upon the host diet. In birds, no statistically significant effects of host body mass, diet, or human association on the distribution of A, B1, B2, and D strains could be detected, and too few strains were isolated from ectothermic vertebrates for any meaningful statistical analysis (19).
In mammalian herbivores, the frequency of group A strains declined with increasing body mass (Fig. 2). B2 strains predominated in herbivores weighing <6 kg, but as body size increased beyond 6 kg, there was a sharp decline in the frequency of B2 and a concomitant increase in the frequency of B1 strains. In the omnivorous mammals, the frequency of B2 strains increased with increasing host body mass and the frequencies of all other groups declined (Fig. 2). In carnivores, the frequency of group A strains increased with host body mass, and there was a decline in the frequencies of all other groups (Fig. 2). However, the significant effect of carnivore body mass was entirely due to a single species, the Tasmanian devil (Sarcophilus harrisii). Although Tasmanian devils represented only 37% of the carnivores from which E. coli were recovered, 76% of all group A strains isolated from carnivores were from this species. Thus, compared to other carnivores, group A strains were significantly overrepresented in Tasmanian devils.
The relative frequencies of A, B1, B2, and D strains in the different vertebrate host groups are depicted in Fig. 3 (19). The relative abundances of the four groups were similar in mammalian herbivores and omnivores, but the patterns seen in herbivores and omnivores differed from that found in carnivores. The distributions of the four groups in birds and carnivorous mammals were similar. Ectothermic vertebrates had a distribution of the four E. coli groups very different from the patterns seen in birds and carnivorous mammals or from the distribution observed in mammalian omnivores or herbivores.
In ectotherms, birds, and carnivorous mammals, hosts which all have relatively simple tube-like gastrointestinal tracts, group B1 strains predominate. In hosts with a cecum, the omnivorous and herbivorous mammals, group B2 strains predominate, while in the mammals, the relative frequencies of A, B1, B2, and D strains vary with the host body mass and diet. Gordon and Cowling (19) argue that these observations support the conclusion that the observed interspecies variation in the prevalence of E. coli is due in part to the interaction between gut dynamics and bacterial establishment and is not solely a consequence of interspecies differences in exposure to E. coli.
The results of MLEE studies suggest that there is little geographic differentiation among E. coli populations. Geographic differentiation accounted for about 2% of the allelic variation observed in a collection of E. coli isolates from humans living on four continents (64). In E. coli isolates from nondomesticated animals collected throughout Australia, 5% of the variation could be attributed to geographic effects (21), while 1% of the variation observed in E. coli populations isolated from humans living in different North American cities could be attributed to geographic differentiation (8). Similarly, locality accounted for only 2% of the variation observed in E. coli isolates from house mice living in two populations 15 km apart (17).
By contrast, large differences in the relative frequencies of A, B1, B2, and D strains have been observed in E. coli populations isolated from humans living in different regions (13; Gordon, unpublished). The frequencies of D strains in human feces appear to be relatively constant in different regions, but there are large differences in the relative abundances of group A, B1, and B2 strains (Fig. 4). The reasons for the differences are unknown, although climate may play some role. In a study of E. coli isolates from Australian mammals, the relative abundances of A, B1, B2, and D strains were found to vary significantly with the climate of the localities where the hosts were collected (19). Does diet matter? Certainly there are substantial differences among the diets of a typical French person, Australian, Croatian, or member of the Dogon people in Mali.
Savageau (49) observed that the life cycle of E. coli involves a transition between two distinct environments. He considered the primary habitat of E. coli to be the gastrointestinal tract, while soil, water, and sediment represent the species’ secondary habitats. It has been estimated that a typical E. coli cell spends one-half of its life in an environment external to the host (23, 49). Two hypotheses have been proposed to describe the manner in which E. coli responds to the transition from its primary to secondary habitats. Savageau (49) suggested that E. coli cells cope with the transition by possessing a dual regulation system, in which genes with products in high demand are under positive control while genes with products in low demand are under negative control. The control systems are hypothesized to alternate depending on the specific demands imposed by the primary and secondary environments. Savageau (49) presented numerous examples of physiological functions in enteric bacteria that are consistent with the demand theory, and it has been demonstrated that in E. coli some genes are preferentially expressed in the external environment (14).
Whittam (63) suggested that selection is the dominant force acting on E. coli during the transition from primary to secondary habitats. That is, there are strains of E. coli primarily adapted to the host environment, while other strains are better adapted to the external environment. Whittam based this conclusion on the results of a study of E. coli isolated from domestic birds and the litter on which these birds were raised. Of the 113 distinct MLEE types identified, only 10% were recovered from both habitats. Further, the data revealed two genetically distinct groups of strains: the majority of isolates taken from the birds made up one cluster, while the majority of strains isolated from the environment made up a second cluster. Gordon et al. (22) observed a similar outcome. E. coli was sampled from two two-person households and their associated septic tanks. The only fecal inputs to the septic tanks were from the household members. In one of the households, a single MLEE type represented >90 % of the isolates recovered from the septic tank, and this strain could not be detected in either household member. Further, the growth rate-temperature responses of strains isolated from this household’s septic tank and strains isolated from the household members differed. The septic tank isolates grew better at low temperatures, but more slowly at high temperatures, than the strains isolated from humans.
The consequences for E. coli of the transition between its primary and secondary habitats are of considerable practical significance for water quality assessment (18) and disease transmission (47). Although the nature of the external environment in which E. coli finds itself influences its distribution and genetic structure, we have a very poor understanding of how this affects the transmission dynamics of E. coli or how transmission dynamics vary among strains of the different groups of E. coli.
E. coli causes a significant fraction of human bacterial disease (56) and is responsible for two main types of disease in humans and domestic animals: diarrheal disease and extraintestinal infections. The strains responsible for diarrheal disease have been divided into different pathotypes: enterotoxigenic, enteropathogenic, enterohemorrhagic, enteroinvasive, enteroaggregative, and diffusely adherent E. coli and the various "Shigella" pathotypes. E. coli causes extraintestinal disease when it invades a normally sterile body site, and the range of clinical syndromes caused by E. coli is largely related to the part of the body that has been invaded. The kinds of extraintestinal disease caused by E. coli include cystitis, pyelonephritis, septicemia, and meningitis.
The great majority of strains responsible for extraintestinal infection are group B2 strains, and to a lesser extent group D strains (30, 31, 33, 34, 43). By contrast, strains from groups A and B1 are seldom responsible for extraintestinal disease (42, 43). Strains responsible for intestinal disease are most often derived from groups A and B1, and to a lesser extent group D, and do not appear to be derived from group B2 strains (44, 45, 48).
There is little understanding of the epidemiological factors that lead to infections that cause disease. The E. coli cells responsible for urinary tract infection are thought to originate in the intestinal E. coli community of the infected host. One view is that strains responsible for urinary tract infections are those that are numerically dominant in the intestine at the time of infection (37). Another view is that urinary tract infections are caused by a subset of strains present in the gastrointestinal tract which express a particular suite of virulence traits (37). As previously stated, strains of the B2 group of E. coli are far more frequently found to be the cause of extraintestinal infection than strains of the other E. coli groups. Group B2 strains are also more likely to possess those traits thought be responsible for a strain’s ability to cause extraintestinal disease (6, 28, 31). However, the frequency with which group B2 strains represent the dominant strain in a human host varies considerably among populations (Fig. 4). Do populations with a high frequency of B2 strains suffer from an elevated risk of urinary tract infections?
Many of the E. coli pathotypes responsible for intestinal disease can cause disease at very low infective doses, yet not all hosts exposed to these pathotypes develop disease. The factors responsible for this variation are largely unknown. However, there is a growing body of evidence that some bacterial species can protect the host from intestinal disease, presumably by reducing the likelihood that the pathogenic strain will successfully establish a population in the gastrointestinal tract of the host (62). There is some evidence to suggest that E. coli can limit the establishment of other members of the Enterobacteriaceae. Gordon and FitzGibbon (20) presented data showing that there was a negative association between the presence of E. coli in a host and the presence of most other species of Enterobacteriaceae. Are there commensal strains of E. coli that can limit the establishment of E. coli diarrheal pathotypes?
In a human host, E. coli typically occurs at a density of 107 to 108 cells per g of feces (23). When multiple isolates from a single human fecal sample are examined, the majority of the isolates are identical (7, 8, 22, 50). However, from 1 to 14 electrophoretic types can be recovered from a single sample, and the relative abundances of the electrophoretic types can vary substantially. For example, the probability of choosing two isolates and finding them to be different can vary from 0 to 0.91. Similar results have been seen when multiple isolates from individual nonhuman hosts have been characterized (36, 46). The most abundant clone present in a fecal sample has been called the dominant clone. However, the precise definition of what constitutes a dominant clone varies between studies. For example, Schlager et al. (50) defined the dominant clone as the clone which represented >50% of the isolates, while Gordon et al. (22) defined the dominant clone as any clone that could be recovered through nonselective plating. Is there a relationship between the clonal diversity observed in a host and the E. coli group represented by the dominant strain? For example, is clonal diversity typically lower in hosts in which a B2 strain is dominant and higher in hosts in which a group A strain predominates?
Even when a relatively large number (e.g., 100) of isolates are selected from a sample plated on nonselective agar, it is unlikely that a variant that represents <1% of the total cell population will be detected. Other techniques are required to detect variants occurring at lower frequencies (22, 29). Strains occurring at low frequencies have been designated minor, or minority, variants, although again, the definition varies among studies. Schlager (50) defined any variant that represented <50% of the isolates as a minor variant, while Gordon et al. (22) defined a minority variant as any variant recovered through selective plating. In the study of Gordon et al. (22), minority strains were recovered from both male and female human hosts. However, analysis of the MLEE data showed that in female human hosts, minority strains represented a population of strains distinct from the dominant strains. In the two males, the minority strains recovered were not distinct from the dominant strains found in these hosts. Are strains present at low frequencies in a host more likely to be members of one of the four groups of E. coli?
Few studies have examined the manner in which the clonal composition of an individual host changes with time (7, 26, 51, 52, 53, 55). Overall, variation through time accounts for about 25% of the observed variation in the clonal composition of a host (8, 22, 63). However, the magnitudes of temporal effects can vary considerably among individuals (22). For example, in four hosts examined twice, at either 7- or 11-day intervals, between-sample variation account for 0 and 14% of the observed clonal diversity in the two adult males examined, while temporal effects accounted for 26 and 85% of the observed clonal diversity in the two adult females. Variants recovered repeatedly over several samples have been called resident strains, while those variants recovered only once or from a few consecutive samples have been called transient strains. Does the typical residence time of a strain vary depending on whether the strain is a member of group A, B1, B2, or D? Does the rate at which transient strains appear at detectable frequencies over time vary depending on the E. coli group membership of the resident strain? If this is the case, can these strains prevent or limit the establishment of strains responsible for diarrheal disease?
The theoretical and empirical evidence suggests that the dynamics of the gastrointestinal tract influence both the probability of isolating E. coli from a host and the type of E. coli found in the host. The dynamics of the gastrointestinal tract differ between human males and females. For example, gut transit times are about 50% longer in females than in males (11). Although there is very limited evidence, there is some suggestion in the available data (22) that the clonal composition and dynamics of E. coli populations in the gut differ between human males and females. If this is in fact true, are the traits that enable a strain it to inhabit a female’s gut also traits which might "predispose" the strain to cause urinary tract infection?
The observed distribution of strains from the different E. coli genetic groups indicates that they have different life history tactics and ecological niches. A and B1 strains appear to be generalists, as they can be recovered from any vertebrate group. Group B2 and D strains appear to be more specialized, as they are largely restricted to endothermic vertebrates. At least in nonhuman mammals, group B2 strains appear to be the most specialized, as they predominate in hosts with a hindgut fermentation chamber. It is likely that there are many factors responsible for the different distributions of E. coli genetic groups among host groups. The observed distribution of A, B1, B2, and D strains may reflect intergroup differences in the ability to establish a population in the gut, as appears to be the case for B2 and D strains in ectotherms. Competitive interactions among strains within a host may also play a role. B1 strains can be isolated from hosts of any vertebrate group but predominate in hosts with a simple hindgut, while B2 strains predominate in hosts with hindgut fermentation. Does this outcome occur because B1 strains are primarily adapted to vertebrates with a simple hindgut, or are they competitively inferior to group B2 strains in hosts with hindgut fermentation? What are the attributes of D strains that apparently restrict them to homeotherms, and are they the same traits that restrict B2 strains to homeotherms? Why do D strains typically represent only 15 to 20% of the E. coli strains found in homeotherms? Why does the relative abundance of D strains remain the same regardless of the host species or geographic locality while the relative abundances of A, B1, and B2 strains vary substantially?
To what extent do the transmission biologies of A, B1, B2, and D strains differ? B2 strains are predominantly responsible for extraintestinal infection, and particularly in the case of urinary tract infections, the strains responsible for infection are thought to originate from the intestinal E. coli community of the infected host. By contrast, food and water are thought to be the main sources of strains responsible for intestinal infection, and most of these diarrheal pathotypes are derived from A, B1, and D strains. Are B2 strains primarily adapted for host-to-host transmission while strains of the other groups are more likely to be transmitted from host to environment to host?
Although E. coli can be isolated from a wide range of organisms, the data suggest that the preferred niche of E. coli is a mammal with a hindgut modified to facilitate microbial fermentation or, in the absence of a fermentation chamber, a large host. The morphology of the gastrointestinal tract, food transit time, and the nature of the nutrients available to E. coli resulting from different host diets are all interrelated factors. Further theoretical and empirical studies are needed if we are to determine whether one of the factors—gut structure, transit time, or nutrients—dominates the colonization dynamics of E. coli in a host.
E. coli exhibits subspecific genetic structure, and it appears that each of the subspecies has its own preferred ecological niche. The exact natures of the niches of A, B1, B2, and D strains need to be determined. Further studies are also required in order to discover the extent to which the within-host dynamics of each of the subspecies differ among different host groups, as well as the extent to which each of the subspecies differs in its transmission dynamics. In addition to defining the ecological niche and epidemiology of each of the subspecies of E. coli, studies to identify the natures of the adaptations responsible for these differences among subspecies are required. It is likely that acquiring this knowledge will enhance efforts to manage diseases caused by the various pathotypes of E. coli in humans and domestic animals.
References
1. Ballyk, M., and H. L. Smith. 1999.A model of microbial growth in a plug-flow reactor with wall attachment. Math. Biosci. 158:95–126. [CrossRef]
2. Ballyk, M., D. Le, D. A. Jones, and H. L. Smith. 2001. Microbial competition in reactors with wall attachment: a comparison of chemostat and plug flow models. Microb. Ecol. 41:210–221.
3. Baltzis, B. C., and A. G. Fredrickson. 1983. Competition of two microbial populations for a single resource in a chemostat when one of them exhibits wall attachment. Biotechnol. Bioeng. 25:2419–2439. [CrossRef]
4. Bergthorsson, U., and H. Ochman. 1998. Distribution of chromosome length variation in natural isolates of Escherichia coli. Mol. Biol. Evol. 15:6–16.
5. Bingen, E., S. Bonacorsi, N. Brahimi, E. Denamur, and J. Elion. 1997. Virulence patterns of Escherichia coli K1 strains associated with neonatal meningitis. J. Clin. Microbiol. 35:2981–2982.
6. Boyd, E. F., and D. L. Hartl. 1998. Chromosomal regions specific to pathogenic isolates of Escherichia coli have a phylogentically clustered distribution. J. Bacteriol. 180:1159–1165.
7. Caugant, D. A., B. R. Levin, and R. K. Selander. 1981. Genetic diversity and temporal variation in the E. coli populations of a human host. Genetics 98:467–490.
8. Caugant, D. A., B. R. Levin, and R. K. Selander. 1983. Distribution of multilocus genotypes of Escherichia coli within and between host families. J. Hyg. 92:377–384.
9. Chivers, D. J., and P. Langer. 1994. The Digestive System in Mammals: Food, Form and Function. Cambridge University Press, Cambridge, United Kingdom.
10. Clermont, O., S. Bonacorsi, and E. Bingen. 2000. Rapid and simple determination of Escherichia coli phylogenetic group. Appl. Environ. Microbiol. 66:4555–4558. [CrossRef]
11. Cummings, J. H., S. A. Bingham, K. W. Heaton, and M. A. Eastwood. 1992. Faecal weight, colon cancer risk, and dietary intake of nonstarch polysaccharides (dietary fibre). Gastroenterology 103:1783–1789.
12. Desjardins, P., B. Picard, B. Kaltenböck, J. Elion, and E. Denamur. 1995. Sex in Escherichia coli does not disrupt the clonal structure of the population: evidence from randomly amplified polymorphic DNA and restriction-fragment-length polymorphism. J. Mol. Evol. 41:440–448. [CrossRef]
13. Duriez, P., O. Clermont, S. Bonacorsi, E. Bingen, A. Chaventre, J. Elion, R. Picard, and E. Denamur. 2001.Commensal Escherichia coli are phylogentically distributed among geographically distinct human populations. Microbiology 147:1671–1676.
14. Espinosa-Urgel, M., and R. Kolter. 1998. Escherichia coli genes expressed preferentially in an aquatic environment. Mol. Microbiol. 28:325–332. [CrossRef]
15. Feil, E. J., and B. G. Spratt. 2001. Recombination and the population structures of bacterial pathogens. Annu. Rev. Microbiol. 55:561–590. [CrossRef]
16. Freter, R., S. Brickner, and S. Temme. 1986.An understanding of colonization resistance of the mammalian large intestine requires mathematical analysis. Microecol. Ther. 16:147–155.
17. Gordon, D. M. 1997. The genetic structure of Escherichia coli populations in feral house mice. Microbiology 143:2039–2046. [CrossRef]
18. Gordon, D. M. 2001.Geographical structure and host specificity in bacteria and the implications for tracing the source of coliform contamination. Microbiology 147:1079–1085.
19. Gordon, D. M., and A. Cowling. 2003. The distribution and genetic structure of Escherichia coli in Australian vertebrates: host and geographic effects. Microbiology 149:3575–3586. [CrossRef]
20. Gordon, D. M., and F. FitzGibbon. 1999. The distribution of enteric bacteria among Australian mammals. Microbiology 145:2663–2671.
21. Gordon, D. M., and J. Lee. 1999. The genetic structure of enteric bacteria from Australian mammals. Microbiology 145:2673–2682.
22. Gordon, D. M., S. Bauer, and J. R. Johnson. 2002. The genetic structure of Escherichia coli populations in primary and secondary habitats. Microbiology 148:1513–1522.
23. Hartl, D. L., and D. E. Dykhuizen. 1984. The population genetics of Escherichia coli. Annu. Rev. Genet. 18:31–68. [CrossRef]
24. Herzer, P. J., S. Inouye, M. Inouye, and T. S. Whittam. 1990. Phylogenetic distribution of branched RNA-linked multicopy single-stranded DNA among natural isolates of Escherichia coli. J. Bacteriol. 172:6175–6181.
25. Hume, I. D. 1999. Marsupial Nutrition. Cambridge University Press, Cambridge, United Kingdom.
26. Jenkins, M. B., P. G. Hartel, T. J. Olexa, and J. A. Stuedemann. 2003. Putative temporal variability of Escherichia coli ribotypes from yearling steers. J. Environ. Qual. 32:305–309.
27. Johnson, J. R., and M. Kuskowski. 2000.Clonal origin, virulence factors, and virulence. Infect. Immun. 68:424–425. [CrossRef]
28. Johnson, J. R., and A. L. Stell. 2000.Extended virulence genotypes of Escherichia coli strains from patients with urosepsis in relation to phylogeny and host compromise. J. Infect. Dis. 181:261–272. [CrossRef]
29. Johnson, J. R., J. J. Brown, U. B. Carlino, and T. A. Russo. 1998. Colonization with and acquisition of uropathogenic Escherichia coli as revealed by polymerase chain reaction-based detection. J. Infect. Dis. 177:1120–1124. [CrossRef]
30. Johnson, J. R., P. Delavari, M. Kuskowski, and A. L. Stell. 2001. Phylogenetic distribution of extraintestinal virulence-associated traits in Escherichia coli. J. Infect. Dis. 183:78–88. [CrossRef]
31. Johnson, J. R., A. L. Stell, P. Delavari, A. C. Murray, M. Kuskowski, and W. Gaastra. 2001. Phylogenetic and pathotypic similarities between Escherichia coli isolates from urinary tract infections in dogs and extraintestinal infections in humans. J. Infect. Dis. 183:897–906. [CrossRef]
32. Jones, D., H. V. Kojouharov, D. Le, and H. Smith. 2002. Bacterial wall attachment in a flow reactor. SIAM J. Appl. Math. 62:1728–1771. [CrossRef]
33. Lecointre, G., L. Rachdi, P. Darlu, and E. Denamur. 1998. Escherichia coli molecular phylogeny using the incongruence length difference test. Mol. Biol. Evol. 15:1685–1695.
34. Manges, A. R., J. R. Johnson, B. Foxman, T. T. O’Bryan, K. E. Fullerton, and L. W. Riley. 2001. Widespread distribution of urinary tract infections caused by a multidrug-resistant Escherichia coli clonal group. N. Engl. J. Med. 345:1007–1013. [CrossRef]
35. Maynard Smith, J., N. H. Smith, M. O’Rourke, and B. G. Spratt. 1993.How clonal are bacteria? Proc. Natl. Acad. Sci. USA 90:4384–4388. [CrossRef]
36. McLellan, S. L., A. D. Daniels, and A. K. Salmore. 2003. Genetic characterization of Escherichia coli populations from host sources of fecal pollution by using DNA fingerprinting. Appl. Environ. Microbiol. 69:2587–2594. [CrossRef]
37. Mobley, H. L. T., and J. W. Warren. 1996. Molecular pathogenesis and clinical management. Urinary Tract Infect. 3:67–94.
38. Murray, B. R., I. D. Hume, and C. R. Dickman. 1995.Digestive tract characteristics of the spinifex hopping-mouse, Notomys alexis and the sandy inland mouse, Pseudomys hermannsburgensis in relation to diet. Aust. Mammal 18:93–97.
39. Ochman, H., and R. K. Selander. 1984. Standard reference strains of Escherichia coli from natural populations. J. Bacteriol. 157:690–693.
40. Okada, O., and D. M. Gordon. 2003. Genetic and ecological structure of Hafnia alvei in Australia. Syst. Environ. Microbiol. 26:585-693. [CrossRef]
41. Okada, S., and D. M. Gordon. 2001.Host and geographical factors influence the thermal niche of enteric bacteria isolated from native Australian mammals. Mol. Ecol. 10:2499–2513. [CrossRef]
42. Picard, B., C. Journet-Mancy, N. Picard-Pasquier, and P. Goullet. 1993. Genetic structure of B2 and B1Escherichia coli strains responsible for extraintestinal infection. J. Gen. Microbiol. 139:3079–3088.
43. Picard, B., J. Sevali Garcia, S. Gouriou, P. Duriez, N. Brahimi, E. Bingen, J. Elion, and E. Denamur. 1999. The link between phylogeny and virulence in Escherichia coli extraintestinal infection. Infect. Immun. 67:546–553.
44. Pupo, G. M., D. K. R. Karaolis, R. Lan, and P. R. Reeves. 1997. Evolutionary relationships among pathogenic and nonpathogenic Escherichia coli inferred from multilocus enzyme electrophoresis and mdh sequence studies. Infect. Immun. 65:2685–2692.
45. Pupo, G. M., R. Lan, and P. R. Reeves. 2000. Multiple independent origins of Shigella clones of Escherichia coli and convergent evolution of their many characteristics. Proc. Natl. Acad. Sci. USA 97:10567–10572. [CrossRef]
46. Pupo, G. M., R. Lan, P. R. Reeves, and P. R. Baverstock. 2000. Population genetics of Escherichia coli in a natural population of native Australian rats. Environ. Microbiol. 2:594–610. [CrossRef]
47. Rasmussen, M. A., and T. A. Casey. 2001. Environmental and food safety aspects of Escherichia coli O157:H7 infections in cattle. Crit. Rev. Microbiol. 27:57–73. [CrossRef]
48. Rolland, K., N., Lambert-Zechovsky, B. Picard, and E. Denamur. 1998. Shigella and enteroinvasive Escherichia coli strains are derived from distinct ancestral strains of E. coli. Microbiology 144:2667–2672. [CrossRef]
49. Savageau, M. A. 1983. Escherichia coli habitats, cell types, and molecular mechanisms of gene control. Amer. Nat. 122:732–744. [CrossRef]
50. Schlager, T. A., J. O. Hendley, A. L. Bell, and T. S. Whittam. 2002. Clonal diversity of Escherichia coli colonizing stools and urinary tracts of young girls. Infect. Immun. 70:1225–1229. [CrossRef]
51. Sears, H. J., and I. Brownlee. 1952. Further observations on the persistence of individual strains of Escherichia coli in the intestinal tract of man. J. Bacteriol. 63:47–57.
52. Sears, H. J., I. Brownlee, and J. K. Uchiyama. 1950. Persistence of individual strains of Escherichia coli in the intestinal tract of man. J. Bacteriol. 59:293–301.
53. Sears, H. J., H. Janes, R. Saloum, I. Brownlee, and L. F. Lamoreaux. 1956. Persistence of individual strains of Escherichia coli in man and dog under varying conditions. J. Bacteriol. 71:370–372.
54. Selander, R. K., D. A. Caugant, and T. S. Whittam. 1987.Genetic structure and variation in natural populations of Escherichia coli, p. 1625–1648. In F. C. Neidhardt et al. (ed.), Escherichia coli and Salmonella typhimurium, Cellular and Molecular Biology. American Society for Microbiology, Washington, D.C.
55. Shooter, R. A., K. A. Bettelheim, S. M. J. Lenox-King, and S. O’Farrell. 1977. Escherichia coli serotypes in the faeces of healthy adults over a period of several months. J. Hyg. 78:95–98. [CrossRef]
56. Siitonen, A. 1994. What makes Escherichia coli pathogenic? Ann. Med. 4:229–231. [CrossRef]
57. Smith , H. L., and P. Waltman. 1995. The Theory of the Chemostat. Cambridge University Press, Cambridge, United Kingdom.
58. Sprat, B. G., and M. C. Maiden. 1999. Bacterial population genetics, evolution and epidemiology. Phil. Trans. R. Soc. Lond. B 354:701–710. [CrossRef]
59. Stemmons, E. D., and H. L. Smith. 2000.Competition in a chemostat with wall attachment. SIAM J. Appl. Math. 61:567–595. [CrossRef]
60. Stevens, C. E. 1988. Comparative Physiology of the Vertebrate Digestive System. Cambridge University Press, Cambridge, United Kingdom.
61. Topiwala, H., and G. Hamer. 1971. Effect of wall growth in steady state continuous culture. Biotechnol. Bioeng. 13:919–922. [CrossRef]
62. Vollard, E. J., and H. A. Clasener. 1994. Colonization resistance. Antimicrob. Agents Chemother. 38:409–414.
63. Whittam, T. S. 1989. Clonal dynamics of Escherichia coli in its natural habitat. Antonie Leeuwenhoek J. Microbiol. 55:23–32. [CrossRef]
64. Whittam, T. S., H. Ochman, and R. K. Selander. 1983. Geographic components of linkage disequilibrium in natural populations of Escherichia coli. Mol. Biol. Evol. 1:67–83.[PubMed]
65. Woodall, P. F., and J. D. Skinner. 1993. Dimensions of the intestine, diet and faecal water loss in some African antelope. J. Zool. Lond. 229:457–471. [CrossRef]