Can we use mendelian randomization to learn about the causal effects of chemical exposures on our health?

Author
Published

October 31, 2025

Doi

With the expected continued scaling of high-throughput mass spectrometry, many participants of cohort studies will not only have genetic information available, but also have exposure-related information. Can we use high-throughput exposure data to apply mendelian randomization (MR) for environmental chemicals? In this blog post I briefly go over MR, the assumptions most relevant to chemical exposures, and what exposure information can add.

dag {
bb="0,0,1,1"
G [pos="0.1,0.6"]
U [pos="0.6,0.3"]
X [exposure,pos="0.4,0.6"]
Y [outcome,pos="0.8,0.6"]
G -> X
U -> X
U -> Y
X -> Y
}

MR

MR uses instrumental variables to estimate causal effects for exposures on health and other personal outcomes. More specifically, MR uses the fact that conditional on parental genotypes, alleles are randomly allocated from parents to offspring.1 If we can find alleles that are reliably related to our exposure of interest, we can – under strict assumptions (Box 1) – use the random assignment of this gene as an instrument to infer the causal effect of an exposure on an outcome.

Assumptions classical IV

  1. Relevance: the genetic instrument (G) must be robustly associated with the exposure (X).
  2. Exchangeability (also known as independence): the genetic instrument (G) must not share any common causes (confounders) with the outcome (Y).
  3. Exclusion restriction: the genetic instrument (G) must affect the outcome (Y) only through its effect on the exposure (X). Any pathway from the instrument to the outcome that bypasses the exposure (e.g., via horizontal pleiotropy, where one gene influences multiple traits) violates this assumption.

Additional assumptions for estimation & interpretation of MR

  1. Gene-environment equivalency: hypothetically changing exposure through a change in genotype or a change in environment should produce the same effect on the outcome. Conceptually comparable to the consistency assumption in the standard causal inference literature.
  2. Homogeneity or monotonicity: no potential differences in the causal effect among individuals. Homogeneity assumes the exposure’s effect on the outcome is uniform across the entire population. Monotonicity is a less strict assumption as it only requires that the genetic instrument’s effect on the exposure goes in one direction (e.g., it increases the exposure for everyone it affects; it doesn’t decrease it for some).

Please see Sanderson et al. (2022) and Richmond and Smith (2022) for details.

MR has proved its value for evidence synthesis in studies of high-density lipoprotein cholesterol (HDL-C) and coronary heart disease. In these studies, a range of genetic variants that affect the level of HDL-C were used to test whether HDL-C decreases the risk of coronary heart disease.2 Many of the exposures we study in environmental epidemiology, however, are a different kind of exposure than HDL-C: the body does not produce these exposures; genetic variation can only influence how the body handles the exposure upon entry. Can we still use MR with such exposures?

Yes, this difference is not relevant to MR as researchers have famously used the variation in two genes involved in alcohol metabolism – ALDH2 and ADH1B – to instrument the effect of alcohol consumption on cardiovascular diseases. Variation in these genes reduces consumption (by causing an unpleasant physical reaction to alcohol) and as such there is a clear gene-environment equivalency3. This means that hypothetically changing exposure through a change in genotype or a change in environment should produce the same effect on the outcome. Such exact mimicking may be less clear for many other environmental exposures as there is no genetic variant for using (or being externally exposed to) environmental chemicals such as PFAS.4 This may complicate their causal effect interpretation, but that does not mean all is lost: we can still use MR to generate (qualitative) evidence by performing tests instead of estimating effects.5

And this has also been done numerous times for our chemical exposures of interest. For example, Cherry et al. (2002) found that the paraoxonase (PON1) enzyme hydrolyses diazinonoxon – a potentially toxic component found in an organophosphate used in sheep dip. Importantly, they also found a genetic variant associated with PON1 and thus lower detoxification. This variant could then serve as proxy for internal exposure to sheep dip and help us investigate the causal effect of sheep dip on health. Similarly, researchers have used genetic variation in the NAT2 gene that is involved in the detoxification pathway of carcinogens to triangulate the evidence of smoking on bladder cancer (García-Closas et al. (2005)), and two SNPs impacting arsenic metabolism efficiency were used to test the causal effect of arsenic exposure on skin lesions (Pierce et al. (2013)).

Gene-environment interactions

The papers on sheep dip and smoking did not mention MR. If placed in a general framework, it’s usually gene-environment interaction, not MR. However, we can think of many of these genetic instruments in MR as a gene-environment interaction.6 Ideally, this would be a gene-environment interaction without an independent gene effect. In other words, the association of the gene with the outcome is only visible in the presence of the exposure. These gene-environment interactions affect metabolism of exposure7 by affecting absorption, distribution, metabolism, and excretion (ADME) characteristics of the targeted exposure (Ritz et al. (2017)). Thereby, they affect the susceptibility of an individual and can serve as a proxy for internal exposure to the environmental exposure of interest which – in exposure assessment and epidemiology – is ultimately what we care about.

The body’s strategy for “handling” an environmental chemical (a xenobiotic) depends on its chemical nature. Substances that are already water-soluble, such as many metals, generally rely on rapid transport and excretion through the kidneys or intestinal tract. Environmental chemicals such as PCBs and PAHs are fat-soluble, however, allowing them to pass through cell membranes and bioaccumulate in fatty/lipid-rich tissues. Because they aren’t water-soluble, they cannot be easily excreted by the kidneys. In a three-phase process they are made water-soluble that allows excretion:

  1. Phase I (modification): enzymes, most famously the Cytochrome P450 (CYP450) superfamily (abundant in the liver), perform oxidation to add a reactive group. However, this initial modification can sometimes activate the xenobiotic, converting it into a reactive electrophile or free radical that is more toxic than the original compound.
  2. Phase II (conjugation): enzymes like NAT2 or GST attach a large water-soluble “tag” (e.g., glutathione) to the reactive group, neutralizing it.
  3. Phase III (transport – not always included in classical model, but polymorphisms in transporters may also contribute to differences in metabolism): transporters (like on the cell’s surface) then actively pump this new, large, water-soluble metabolite out of the cell and into the bile or blood, where it can be eliminated by the kidneys.

Water-soluble substances, like many heavy metals and metalloids (e.g., arsenic) – which are inducers of oxidative stress and cellular damage – don’t need Phase I to dissolve, but they still need to be detoxified and actively transported. They often use the Phase II (conjugation) and Phase III (transport) mechanisms directly. For example, arsenic is made less toxic and is “tagged” for excretion by adding methyl groups (a Phase II conjugation reaction).

This three-phase system (for fat-soluble chemicals) is part of a broader environmental response machinery. Motsinger-Reif et al. (2024), table 3 highlights other gene classes that respond to the presence of environmental chemicals or the damage they cause. These damage-response pathways are highly relevant for all environmental exposures – both the fat-soluble chemicals and water-soluble substances like heavy metals.

When we only want to test for and not estimate an effect though, we do not even need exposure information!8 We only need genetic variant and outcome data to implicate specific exposures in the etiology of a disease. In fact, this was what Katan proposed in his early writing on MR (e.g. Katan (1986)) where he described an approach that examined the association between genetic variants and the outcome to test for an effect of the exposure on the outcome. This is precisely what Carol Ann Gross-Davis et al. (2015) did. Namely, they conducted a case-control study of myeloproliferative neoplasms (MPNs) where they genotyped cases and controls for variants in ‘environmentally sensitive genes’. They found that certain genotypes, such as the NAT2 slow acetylator genotype and variants in CYP1A2, GSTA1, and GSTM3, were associated with a 3–5-fold increased risk of MPNs. Based on these “gene-only” associations, they concluded that the environmental exposures whose toxicity is modified by these specific genes, like aromatic compounds, may play a role in the etiology of MPNs.

However, importantly, they could not implicate a specific chemical, because the chemicals they studied share metabolism. In MR terms we would say that there is horizontal pleiotropy9 where the variant affects the outcome through other variables than a specific, single exposure (see below for an illustration). In general, this is a big hurdle for MR for environmental chemicals.

dag {
bb="0,0,1,1"
"metabolism of 2,4'-DDT" [pos="0.400,0.175"]
"metabolism of Tetradioxin" [exposure,pos="0.400,0.600"]
CYP1A1 [pos="0.100,0.600"]
Y [outcome,pos="0.800,0.600"]
"metabolism of 2,4'-DDT" -> Y
"metabolism of Tetradioxin" -> Y
CYP1A1 -> "metabolism of 2,4'-DDT"
CYP1A1 -> "metabolism of Tetradioxin"
}


Nevertheless, I think such approaches can play a role in evidence synthesis of environmental chemicals: the sources of bias are generally different from the more common epidemiological studies and as such they could be used in triangulation. Put differently, the confounding effect of the metabolizing gene will likely be different than if we were to directly study the environmental exposure. Moreover, sometimes it just takes a while to find suitable instruments10. For example, it took years to find a likely valid instrument for circulating vitamin C levels (Smith (2010)). Although it’s important to note that functional polymorphisms may not exist in many cases (Weiss and Terwilliger (2000)).

The paper from Carol Ann Gross-Davis et al. (2015) is from a decade ago and it seems that the Environmental Genome Project (EGP) resource they used to retrieve 647 known environmentally sensitive genes has been (spiritually) succeeded by projects like the Environmental Polymorphisms Registries (EPR), the Personalized Environment and Genes Studies (PEGS), and the Comparative Toxicogenomics Database. The latter seems like an especially useful resource to retrieve genes involved in metabolism of an environmental chemical. It can also provide insight into polymorphisms by its grouping of all chemicals that have been associated with a gene in past (mostly in vitro) studies.

What exposure information can add

So strictly speaking we do not even need measurement of exposures to do MR, but I think such measurements will nonetheless be useful in understanding the environmental response machinery as MR is most useful in evidence synthesis if the mechanism of association of the polymorphisms with the exposure (metabolism) is well-understood. Namely, biological knowledge is needed to know that the genes themselves do not cause disease, and if metabolism of other exposures is related to the gene or disease. Measurements of exposures can be useful in both cases: we can test for an association in the absence of exposure11, and we can of course use the exposure data to search for new candidate instruments, thereby generating useful biological information beyond the dominant in vitro studies used to select many environmentally sensitive candidate genes in the past.12

Past studies that incorporated chemical exposure have used classical job exposure matrices (JEMs) (e.g. Quintero Santofimio et al. (2025)) and existing studies could use high-throughput biomarkers. Of note here is the difference in measurement error structure between these exposure assessment methods: biomarkers typically have a classical error structure that dilutes effects towards the null and reduces statistical power, whereas JEMs – that assess exposure at the group level – mostly involve Berkson’s error, which does (in the general case) not bias estimates but instead reduces their precision. These differences could have implications for the design and merit of these studies for evidence synthesis.

With biomarkers, it’s worth distinguishing what type is used. Here, I want to briefly distinguish between the utility of the parent compound, the Phase I metabolite, or the Phase II metabolite for MR studies.

Using the parent compound in blood can provide a good proxy for the long-term, total-body burden of exposure. Measurement of a Phase I metabolite can be important as it sometimes is the more toxic, reactive form of the chemical, while measurement of a Phase II metabolite can help identify slow metabolizers as they would have lower levels of this final metabolite.

However, as a proxy for genetic metabolic function these can all be noisy because they are influenced by external exposure levels which are, of course, independent of the gene’s function.

Metabolic ratios (e.g. Phase II metabolite / Phase I metabolite) can help by normalizing for these differences in external exposure and as such can be more suitable for searching for genetic instruments. The Pierce et al. (2013) study on arsenic is one example of this, where they calculate a ratio as a measure of an individual’s methylation capacity.

I should note, though, that ratios in the presence of measurement error can bring some additional statistical challenges.

Conclusion

Gene-environment interactions for chemicals seem to mostly get attention with respect to targeted prevention by identifying susceptible individuals. I think it would be worthwhile to also increase their use in qualitative testing because their sources of bias will be different from more common epidemiological study designs and as such they will be useful for the synthesis of evidence.13 The MR framework then provides us with useful guidance to critically inspect these tests which is key with, for example, the challenging pleiotropy of environmental chemicals. Increased availability of exposure data can assist in the biological knowledge underpinnings of MR though measurement error will make this less trivial than we had perhaps hoped.

References

Argos, Maria et al. 2018. “Screening for Gene–Environment (G×E) Interaction Using Omics Data from Exposed Individuals: An Application to Gene-Arsenic Interaction.” Mammalian Genome 29 (1): 101–11. doi:10.1007/s00335-018-9737-8.
Brennan, Paul. 2004. “Commentary: Mendelian Randomization and Gene–Environment Interaction.” International Journal of Epidemiology 33 (1): 17–21. doi:10.1093/ije/dyh033.
Burstyn, I. et al. 2009. “The Virtues of a Deliberately Mis-Specified Disease Model in Demonstrating a Gene-Environment Interaction.” Occupational and Environmental Medicine 66 (6): 374–80. doi:10.1136/oem.2008.039081.
Carol Ann Gross-Davis et al. 2015. “The Role of Genotypes That Modify the Toxicity of Chemical Mutagens in the Risk for Myeloproliferative Neoplasms.” International Journal of Environmental Research and Public Health 12 (3): 2465–85. doi:10.3390/ijerph120302465.
Cherry, Nicola et al. 2002. “Paraoxonase (PON1) Polymorphisms in Farmers Attributing Ill Health to Sheep Dip.” Lancet (London, England) 359 (9308): 763–64. doi:10.1016/s0140-6736(02)07847-9.
Davey Smith, George, and Shah Ebrahim. 2003. Mendelian Randomization’: Can Genetic Epidemiology Contribute to Understanding Environmental Determinants of Disease?*.” International Journal of Epidemiology 32 (1): 1–22. doi:10.1093/ije/dyg070.
Davies, Neil M et al. 2019. “Within Family Mendelian Randomization Studies.” Human Molecular Genetics 28 (R2): R170–79. doi:10.1093/hmg/ddz204.
Dick, Danielle M et al. 2015. “Candidate GeneEnvironment Interaction Research: Reflections and Recommendations.” doi:10.1177/1745691614556682.
García-Closas, Montserrat et al. 2005. NAT2 Slow Acetylation, GSTM1 Null Genotype, and Risk of Bladder Cancer: Results from the Spanish Bladder Cancer Study and Meta-Analyses.” doi:10.1016/S0140-6736(05)67137-1.
Hutter, Carolyn M et al. 2013. “Gene-Environment Interactions in Cancer Epidemiology: A National Cancer Institute Think Tank Report.” doi:10.1002/gepi.21756.
Katan, M. B. 1986. “Apolipoprotein E Isoforms, Serum Cholesterol, and Cancer.” Lancet (London, England) 1 (8479): 507–8. doi:10.1016/s0140-6736(86)92972-7.
Kodali, Hanish P, Brian T Pavilonis, and C Mary Schooling. 2018. “Effects of Copper and Zinc on Ischemic Heart Disease and Myocardial Infarction: A Mendelian Randomization Study.” The American Journal of Clinical Nutrition 108 (2): 237–42. doi:10.1093/ajcn/nqy129.
Lee, Derrick G. et al. 2022. “Interactions Between Exposure to Polycyclic Aromatic Hydrocarbons and Xenobiotic Metabolism Genes, and Risk of Breast Cancer.” Breast Cancer 29 (1): 38–49. doi:10.1007/s12282-021-01279-0.
Luo, Hao, Igor Burstyn, and Paul Gustafson. 2013. “Investigations of GeneDisease Associations: Costs and Benefits of Environmental Data.” Epidemiology 24 (4): 562–68. doi:10.1097/EDE.0b013e3182944dd5.
Motsinger-Reif, Alison A. et al. 2024. “Gene-Environment Interactions Within a Precision Environmental Health Framework.” Cell Genomics 4 (7): 100591. doi:10.1016/j.xgen.2024.100591.
Pierce, Brandon L et al. 2013. “Arsenic Metabolism Efficiency Has a Causal Role in Arsenic Toxicity: Mendelian Randomization and Gene-Environment Interaction.” International Journal of Epidemiology 42 (6): 1862–72. doi:10.1093/ije/dyt182.
Quintero Santofimio, Valentina et al. 2025. “Gene–Occupational Exposure Interactions in Small Airways Obstruction in the UK Biobank: A Cross-Sectional Study.” Occupational and Environmental Medicine, October, oemed-2025-110112. doi:10.1136/oemed-2025-110112.
Richmond, Rebecca C., and George Davey Smith. 2022. “Mendelian Randomization: Concepts and Scope.” Cold Spring Harbor Perspectives in Medicine 12 (1): a040501. doi:10.1101/cshperspect.a040501.
Ritz, Beate R. et al. 2017. “Lessons Learned From Past Gene-Environment Interaction Successes.” American Journal of Epidemiology 186 (7): 778–86. doi:10.1093/aje/kwx230.
Sanderson, Eleanor et al. 2022. “Mendelian Randomization.” Nature Reviews Methods Primers 2 (1): 1–21. doi:10.1038/s43586-021-00092-5.
Schulte, P. A., C. Whittaker, and C. P. Curran. 2015. “Considerations for Using Genetic and Epigenetic Information in Occupational Health Risk Assessment and Standard Setting.” Journal of Occupational and Environmental Hygiene 12 (sup1): S69–81. doi:10.1080/15459624.2015.1060323.
Smith, George Davey et al. 2007. “Clustered Environments and Randomized Genes: A Fundamental Distinction Between Conventional and Genetic Epidemiology.” PLOS Medicine 4 (12): e352. doi:10.1371/journal.pmed.0040352.
Smith, George Davey. 2010. “Mendelian Randomization for Strengthening Causal Inference in Observational Studies: Application to Gene × Environment Interactions.” Perspectives on Psychological Science 5 (5): 527–45. doi:10.1177/1745691610383505.
Swerdlow, Daniel I et al. 2016. “Selecting Instruments for Mendelian Randomization in the Wake of Genome-Wide Association Studies.” International Journal of Epidemiology 45 (5): 1600–1616. doi:10.1093/ije/dyw088.
Tudball, Matthew J., George Davey Smith, and Qingyuan Zhao. 2023. “Almost Exact Mendelian Randomization.” arXiv. doi:10.48550/arXiv.2208.14035.
VanderWeele, Tyler J. et al. 2014. “Methodological Challenges in Mendelian Randomization:” Epidemiology 25 (3): 427–35. doi:10.1097/EDE.0000000000000081.
Weiss, K. M., and J. D. Terwilliger. 2000. “How Many Diseases Does It Take to Map a Gene with SNPs?” Nature Genetics 26 (2): 151–57. doi:10.1038/79866.
Weisskopf, Marc G., and Thomas F. Webster. 2017. “Trade-Offs of Personal Versus More Proxy Exposure Measures in Environmental Epidemiology.” Epidemiology 28 (5): 635. doi:10.1097/EDE.0000000000000686.
Westerman, Kenneth E., and Tamar Sofer. 2024. “Many Roads to a Gene-Environment Interaction.” The American Journal of Human Genetics 111 (4): 626–35. doi:10.1016/j.ajhg.2024.03.002.

Footnotes

  1. In non-familial studies (which are dominant in the literature due to data limitations) this is better described as ‘approximate MR’ (Davey Smith and Ebrahim (2003)). And the (implicit) verdict seems to be that this is good enough, although it is at risk of potential biases from dynastic effects, population structure, and assortative mating (Davies et al. (2019), Tudball, Smith, and Zhao (2023), Smith et al. (2007)).↩︎

  2. This showed a null association and in general, MR has been most useful to show null effects whereas classical observational studies showed consistent (confounded) effect sizes.↩︎

  3. Another instance of a violation of gene-environment equivalence is canalization where the body develops compensatory pathways to mitigate the effects of genetic or environmental variations during development. For example, an individual with a genetic predisposition to high levels of a particular substance from birth might develop resistance to its effects, which would not be the case for someone whose levels of that substance increase later in life due to environmental factors. Whether canalization occurs widely is unknown (Kodali, Pavilonis, and Schooling (2018)).↩︎

  4. GD Smith referred to many studies that carry out MR for GWAS hits for such exposures without gene-environment equivalency as ’noodles’ : they will find genetic correlates of noodle eating but this does not instrument a specific taste for noodles that causes them to eat more noodles. You’re just looking at upstream factors instead. (Related GD Smith presentation in better quality available here)↩︎

  5. These tests can be seen as an analogue to intent-to-treat analysis in RCTs.↩︎

  6. As VanderWeele et al. (2014) notes, gene-environment interactions depend on the definition of exposure. For example, the genes used to study bladder cancer cause you to extract more toxins per cigarette and as such amplify the effect of each cigarette smoked. If we define the exposure as pack-years there is a gene-environment interaction. However, if we define the exposure as all aspects of smoking behavior including the metabolism of toxins there is not a gene-environment interaction. Note also how the paper on arsenic talks about arsenic metabolism instead of arsenic.↩︎

  7. Other terms I encountered in the literature that describe the same or closely related phenomena (or which are useful terms for finding related literature): environmental response machinery, environmental response/sensitive genes, reduced detoxification, slow-metabolizer variants, poor/extensive metabolizer, toxicogenetics, precision environmental medicine.↩︎

  8. VanderWeele et al. (2014) also mention that an additional advantage of testing (instead of estimating) is that you do not need to fully specify what the exposure exactly is; the IV assumptions just need to hold under some definition of the exposure.↩︎

  9. In IV terms this is the exclusion restriction.↩︎

  10. Not discussed in detail in this post, but suitable instruments also implies that they are strong enough. Instruments that are only weakly associated with the exposure are known as “weak instruments” and can suffer from low power and introduce significant bias. This bias pulls the estimate toward the confounded observational association in one-sample MR and toward the null in two-sample MR. See e.g. Sanderson et al. (2022) and Richmond and Smith (2022) for more information.↩︎

  11. Absence of exposure can be very difficult with ubiquitous exposures such as PFAS though.↩︎

  12. Directly modeling the gene-environment interaction seems attractive when you have both exposure, disease, and gene data available. But Burstyn et al. (2009) note that deliberately mis-specifying your gene-environment interaction model by using a “gene-only” model can be beneficial if we can assume the gene only affects disease risk in exposed subjects and exposure measurements are assessed with error.↩︎

  13. If I had to counter my own position I would say that the need for this is limited because in the study of chemical exposures, by environmental and occupational epidemiology, confounding (i.e. what MR overcomes) is much less of a problem than in other parts of epidemiology. Namely, we often assess exposures at the group-level and the relationship of these group-level measurements to individual confounders is often very limited (see Weisskopf and Webster (2017) for example). The great challenge is instead assessing exposure. Weisskopf and Webster (2017) also describe a very cool general (non-genetic) instrumental variable in environmental epidemiology as the group-level exposure assessments can be seen as an instrumental variable for personal-level exposure measurements.↩︎

Citation

BibTeX citation:
@online{oosterwegel2025,
  author = {Oosterwegel, Max J.},
  title = {Can We Use Mendelian Randomization to Learn about the Causal
    Effects of Chemical Exposures on Our Health?},
  date = {2025-10-31},
  url = {https://maxoosterwegel.com/blog/mr-chemical-agents/},
  doi = {placeholder},
  langid = {en}
}
For attribution, please cite this work as:
Oosterwegel, Max J. 2025. “Can We Use Mendelian Randomization to Learn about the Causal Effects of Chemical Exposures on Our Health?” October 31. doi:placeholder.