The CAMDA Contest Challenges

For 2023, we present:

  • The Synthetic Clinical Health Records Challenge provides a rich set of highly realistic Electronic Health Records (EHR) tracing the diagnosis trajectories of diabetic patients, created with dual-adversarial auto-encoders trained on data from 1.2 million real patients in the Population Health Database of the Andalusian Ministry of Health. Predict relevant diabetes endpoints like blindness or cardiopathy from past diagnosis trajectories!

CAMDA encourages an open contest, where all analyses of the contest data sets are of interest, not limited to the questions suggested here. There is an online forum for the free discussion of the contest data sets and their analysis, in which you are encouraged to participate.

We look forward to a lively contest!

Synthetic Clinical Health Records

Although data protection is necessary to protect patients' intimacy, privacy regulations are also an obstacle to biomedical research. An interesting alternative is the use of synthetic patients. However, conventional synthetic patients are useless for discovery given that they are built out of known data distributions. Interestingly, Generative Adversarial Networks (GANs) and related developments have emerged as powerful tools to generate synthetic data in a way that captures relationships between the variables produced even if such relationships were previously unknown. GANs became popular in the generation of highly realistic synthetic pictures but have been applied in many fields, including in the generation of synthetic patients with applications such as medGAN and others.

This dataset includes an ordered list of pathologies for 999,936 synthetic patients. These synthetic patients have been generated from a real cohort with 979,308 diabetes patients, retrieved from the Health Population Database (Base Poblacional de Salud, BPS) at the Andalusian Health System (Spain). The synthetic dataset has been generated with a Dual Adversarial AutoEncoder (DAAE) approach. Two challenges are suggested on these data, although any other original analysis you may think will alse be wellcome:

1) Finding some strong relationships in diabetes-associated pathologies that allows to predict any pathology before this is diagnosed. Some well-known pathological diabetes consequences, which can be considered relevant endpoints to predict, can be: a) Retinopathy (Code “703”), b) Chronic kidney disease (Code “1401”), c) Ischemic heart disease (Code “910”), d) Amputations (Code “1999”)

2) Another proposed challenge is the prediction of disease trajectories in diabetes patients (see for example: Jensen et al. Nat Commun. 2014)

Prediction proposals which are submitted with the model trained and the code required to run the model can be tested on the real dataset by the organisers and participate in a collective publication.

Data download
Please read and accept the data download agreement for access.

Please sign up to announcements from the CAMDA general discussion forum for alerts.

We thank the Institute of Advanced Research in Artificial Intelligence for its support.

Anti-Microbial Resistance Prediction and Forensics

Antimicrobial resistance is one of the biggest challenges facing modern medicine. Because the management of COVID-19 was increasingly becoming dependent on pharmacological interventions, there is greater risk for accelerating the evolution and spread of antimicrobial resistance [1]. A study in a tertiary hospital environment revealed concerning colonisation patterns of microbes during extended periods [2]. It also highlighted the diversity of antimicrobial resistance gene reservoirs in hospitals that could facilitate the emergence and transmission of new modes of antibiotic resistance. This year we would like CAMDA community to look into AMR related challenges.

The goal will be to explore a metagenomic surveillance data from a selection of about 400 samples provided by MetaSUB International Consortium collected during global City Sampling Day 2016 and 2017 in several cities in US (Baltimore, Denver, Mineapollis, New York, Sacramento, San Antonio) and worldwide (Berlin, Bogota, Doha, Ilorin, Lisbon, Sao Paulo, Tokyo, Vienna, Zurich) to trace the AMR patterns.

A focus should be placed especially on AMR markers and resistance groups identified in about 150 isolates from hospital in one of abovementioned US cities collected in similar time. Can you tell which one?
As it was shown in the past CAMDA challenges an antibiotic resistance as functional biomarkers can accurately predict the origin of urban metagenomics samples.

Of course, you are welcome to use your imagination to carry out any side analysis that you would like, using the provided datasets.

Data download
Please read and accept the data download agreement for access.

Please sign up to announcements from the CAMDA metagenomics forum for alerts.

[1] E Afshinnekoo et al. COVID-19 drug practices risk antimicrobial resistance evolution. The Lancet Microbe (2021), 10.1016/S2666-5247(21)00039-2
[2] KR Chng et al. Cartography of opportunistic pathogens and antibiotic resistance genes in a tertiary hospital environment. Nature Medicine (2020), 1-11

PAST KEYNOTE SPEAKERS

Atul Butte, MD, PhD
Atul Butte, MD, PhD
Stanford University School of Medicine

Nikolaus Rajewsky, PhD
Nikolaus Rajewsky, PhD
Max-Delbrück-Center for Molecular Medicine

Terry Speed, PhD
Terry Speed, PhD
The Walter and Eliza Hall Institute of Medical Research

Sandrine Dupoit, PhD
Sandrine Dudoit, PhD
University of California, Berkeley

John Quackenbush, PhD
John Quackenbush, PhD
Harvard School of Public Health

Eran Segal, PhD
Eran Segal, PhD
Weizmann Institute of Science

John Storey, PhD
John Storey, PhD
Princeton University

Chris Sander, PhD
Chris Sander, PhD
Memorial Sloan Kettering Cancer Center

Temple F. Smith, PhD
Temple F. Smith, PhD
Boston University

Curtis Huttenhower, PhD
Curtis Huttenhower, PhD
Harvard School of Public Health

Christopher E. Mason, PhD
Christopher E. Mason, PhD
Weill Cornell Medicine

Lodewyk Wessels, PhD
Lodewyk Wessels, PhD
Netherlands Cancer Institute

Cesare Furlanello, PhD
Cesare Furlanello, PhD
Fondazione Bruno Kessler

Rita Colwell, PhD
Rita Colwell, PhD
University of Maryland

Nikos Cyrpides, PhD
Nikos Cyrpides, PhD
DOE Joint Genome Institute

Francesca Ciccarelli, PhD
Francesca Ciccarelli, PhD
Francis Crick Institute

IMPORTANT DATES
Call for Abstracts Opens23 January 2023
CAMDA Extended Abstracts Deadline18 May 2023
Late Posters Submissions Open18 May 2023
Poster Acceptance Notifications25 May 2023
CAMDA Acceptance Notification29 May 2023
CAMDA Conference26-27 July 2023
Click to save the dates!
CAMDA PARTNERS

Frontiers in Genetics

F1000Research

FDA

MetaSUB

Smashing Studio

ISMB 2023 MAIN EVENT
STAY CONNECTED