Research Project Title:
Semi-Supervised Learning for Clinical Phenotyping
abstract:My project aims to learn the phenotype of a patient, consisting of hundreds of different clinical state variables based on routinely collected, unlabeled, and “noisy” clinical data available in Electronic Medical Records (EMRs). These state variables help describe the patient and provide answers to high-level questions, such as whether the patient has an infection, an altered mental state, or a history of alcoholism, which are not obvious or directly observable from the clinical records. Obtaining these labeled descriptions for a supervised learning algorithm is often expensive and requires extreme efforts from clinicians. We hope to be able to efficiently and accurately infer these phenotypes from unlabeled clinical data in a semi-supervised manner, incorporating specialist expertise as guidance.
“Through this SuperUROP project, I want to gain more experience in health care research and make a positive contribution to my group. I have taken machine-learning courses and I want to expand on that knowledge with real-world applications, creating a social impact with my work. To me, even a 1 percent improvement in the field means a lot. I hope to publish a paper by the end of the SuperUROP project, if I have meaningful results to display.”