AI-guided detection of antibiotic-resistantbacteria using resistance genes

dc.contributor.authorAerts, Erik
dc.contributor.departmentChalmers tekniska högskola / Institutionen för matematiska vetenskapersv
dc.contributor.examinerKristiansson, Erik
dc.contributor.supervisorKristiansson, Erik
dc.contributor.supervisorJohnning, Anna
dc.date.accessioned2024-07-03T11:54:39Z
dc.date.available2024-07-03T11:54:39Z
dc.date.issued2024
dc.date.submitted
dc.description.abstractAntibiotic resistance is threatening advancements made in modern medicine. Understanding the genomics behind multi-resistant profiles can assist in planning the correct treatment which can lower the abundance of antibiotic usage and hamper the vicious resistance cycle. Transformer-based AI models have shown state-of-the-art performance in understanding complex patterns in data. The thesis aimed to create a framework on how to implement transformers to predict bacterial resistance profiles by training on genomic data. The framework consisted of a transformer-based encoder and parallel classification networks for predicting antibiotic susceptibility. Each model trained on antibiotic resistance genes (ARGs) from Escherichia coli where a subset of isolates had recorded resistance profiles. The results showed that having a high complexity in the encoder is key for the model to accurately predict resistance to antibiotics where the occurrence of resistance is rare. This is relevant for any clinical setting, as models with less than 12 encoder blocks could not find these resistance profiles. The framework benefited from pretraining on unlabeled genomic data as performance generally increased. However, the type of masked language model pre-training which benefited the system more was situational and no conclusion was drawn. Finally, the thesis also found features in the data on which the models were basing decisions off on. The number of ARGs of an isolate was deemed the most influential feature in the data which relates to how much information the transformer can process. Following, relations between ARGs gyrA-D87N / parC-S80I and aph(3”)-Ib / aph(6’)-Id were shown to be an important decision basis for the models. Likewise, two point mutations of the pmrB gene also stood out as important ARGs in the decision-making processes for the models. The reasons why these ARGs are weighted highly by the models are currently unknown but are of interest to be studied further for a better understanding of underlying factors to multi-resistance.
dc.identifier.coursecodeMVEX03
dc.identifier.urihttp://hdl.handle.net/20.500.12380/308225
dc.language.isoeng
dc.setspec.uppsokPhysicsChemistryMaths
dc.subjectArtificial intelligence, antibiotic resistance, transformer, self-attention, embedding, encoder, pre-training, fine-tuning, masked language modelling.
dc.titleAI-guided detection of antibiotic-resistantbacteria using resistance genes
dc.type.degreeExamensarbete för masterexamensv
dc.type.degreeMaster's Thesisen
dc.type.uppsokH
local.programmeBiomedical engineering (MPBME), MSc
Ladda ner
Original bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
Master_Thesis_Erik Aerts_2024.pdf
Storlek:
7.09 MB
Format:
Adobe Portable Document Format
Beskrivning:
License bundle
Visar 1 - 1 av 1
Hämtar...
Bild (thumbnail)
Namn:
license.txt
Storlek:
2.35 KB
Format:
Item-specific license agreed upon to submission
Beskrivning: