Abstract:
Disease computable phenotype is a data model designed to identify specific clinical conditions or characteristics, which automatically extracts information from clinical databases such as electronic health records through algorithms. Phenotypic data for rare diseases often reside in unstructured text. Due to the scarcity of rare disease cases, atypical symptoms, and insufficient physician experience, misdiagnosis and underdiagnosis rates remain high. In this context, the application of computable phenotype technology holds promise for improving the accuracy and efficiency of rare disease diagnosis. This article reviews the current research status, challenges, and opportunities of computable phenotype technology in biomedicine, particularly in the field of rare diseases, and proposes a development and validation framework for rare disease computable phenotypes, aiming to provide research and development insights for computable phenotypes to empower the diagnosis and treatment of rare diseases.