年份 | 2017 |
学科 | 计算生物与生物信息学 Computational Biology and Bioinformatics |
国家/州 | United States of America |
SiteKey: A Novel Binding Site Predictor for Ordered Proteins Interacting with Intrinsically Disordered Proteins
Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) are segments of proteins that lack a defined tertiary structure, giving them the ability to interconvert among a range of conformations and folds. This structural plasticity adds to the complexity of predicting protein behavior and treating numerous diseases. For example, a key tumor suppressor protein p53 has significant regions of disorder and has been associated with lung, breast, and brain cancer. While we now have effective algorithms for predicting ordered binding partners of IDRs, an algorithm for identifying and characterizing binding sites on such proteins has remained elusive, despite some effort. Here we present a novel machine learning algorithm SiteKey — a random forest classifier, trained on features derived from both protein sequence and structure, capable of identifying these binding sites with 88.4% accuracy and an area under the ROC curve of 0.9441. These results should provide a new approach to rational drug design in which binding regions can specifically be targeted to prevent major diseases, including cancer.
高中生科研 英特尔 Intel ISEF
资讯 · 课程 · 全程指导
请扫码添加微信好友
英特尔国际科学与工程大奖赛,简称 "ISEF",由美国 Society for Science and the Public(科学和公共服务协会)主办,英特尔公司冠名赞助,是全球规模最大、等级最高的中学生的科研科创赛事。ISEF 的学术活动学科包括了所有数学、自然科学、工程的全部领域和部分社会科学。ISEF 素有全球青少年科学学术活动的“世界杯”之美誉,旨在鼓励学生团队协作,开拓创新,长期专一深入地研究自己感兴趣的课题。
>>> 实用链接汇总 <<<
· 数学 · 物理 · 化学 · 生物 · 计算机 · 工程 ·
Studies that primarily focus on the discipline and techniques of computer science and mathematics as they relate to biological systems. This includes the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavior, and social systems.
Computational Biomodeling (MOD): Studies that involve computer simulations of biological systems most commonly with a goal of understanding how cells or organism develop, work collectively and survive.
Computational Epidemiology (EPD): The study of disease frequency and distribution, and risk factors and socioeconomic determinants of health within populations. Such studies may include gathering information to confirm existence of disease outbreaks, developing case definitions and analyzing epidemic data, establishing disease surveillance, and implementing methods of disease prevention and control.
Computational Evolutionary Biology (EVO): A study that applies the discipline and techniques of computer science and mathematics to explore the processes of change in populations of organisms, especially taxonomy, paleontology, ethology, population genetics and ecology.
Computational Neuroscience (NEU): A study that applies the discipline and techniques of computer science and mathematics to understand brain function in terms of the information processing properties of the structures that make up the nervous system.
Computational Pharmacology (PHA): A study that applies the discipline and techniques of computer science and mathematics to predict and analyze the responses to drugs.
Genomics (GEN): The study of the function and structure of genomes using recombinant DNA, sequencing, and bioinformatics.
Other (OTH): Studies that cannot be assigned to one of the above subcategories. If the project involves multiple subcategories, the principal subcategory should be chosen instead of Other.
翰林课程体验,退费流程快速投诉邮箱: yuxi@linstitute.net 沪ICP备2023009024号-1