|
Professor, University of California, Berkeley
Bio: Haiyan Huang received her BS in Mathematics at Peking University in 1997, her PhD in Applied Mathematics at the University of Southern California in 2001. She did postdoc at Harvard University from 2001-2003. She is currently a Professor in the Department of Statistics at UC Berkeley. She served as Chair of the Statistics Department from July 2022 to June 2025, and as Director of the Center for Computational Biology at UC Berkeley from July 2019 to June 2022. She is a Fellow of the American Statistical Association (ASA) and the Institute of Mathematical Statistics (IMS).
Synthetic random heteropolymers (RHPs), composed of a predefined set of monomers, offer a promising strategy for creating protein mimicking materials with tailored biochemical functions. When designed appropriately, RHPs can replicate protein behavior, enabling applications in drug delivery, therapeutic protein stabilization, biosensing, tissue engineering, and medical diagnostics. However, designing RHPs that achieve specific biological functions in a time- and cost-effective manner remains a major challenge. In this talk, I will review this problem and discuss several successful efforts we have made to address it, using statistical, computational, and AI approaches. These include a generalized semi-hidden markov model (GSHMM) and a modified variational autoencoder (VAE) within a semi-supervised framework, designed to capture the structures of critical chemical features as well as individual RHP sequence patterns. These studies highlight the potential of computational approaches to accelerate the rational design of RHPs for a wide range of biological, medical, and healthcare applications.
Professor, Northeastern University
Bio: Professor David Madigan is the Provost and Senior Vice President for Academic Affairs at Northeastern University, where he is also a Professor of Statistics. Previously he served as Executive Vice President of Arts and Sciences and Dean of the Faculty at Columbia University, and as Chair of Columbia’s Department of Statistics. He has also held positions at AT&T Inc., Rutgers University, and the University of Washington.
His research spans Bayesian statistics, text mining, Monte Carlo methods, pharmacovigilance, and probabilistic graphical models. He holds a B.Sc. in Mathematical Sciences and a Ph.D. in Statistics from Trinity College Dublin. He is a Fellow of the American Statistical Association, the Institute of Mathematical Statistics, and the AAAS, and has served as Editor-in-Chief of Statistical Science.
The Bayesian approach to data analysis has become routine and the fierce Bayesian-frequentist debates of old have fizzled out. Authors have often lauded the simplicity and interpretability of the Bayesian approach and this has proven to be compelling in countless real-world applications. There are, however, some contexts in which Bayesian thinking provides extraordinary advantages - Bayesian magic if you will. I will describe three such contexts in some detail - zero-profiling for location estimation in wireless networks, automated informative priors in text categorization, and hierarchical association rule mining in healthcare.
US Food and Drug Administration (FDA)
Bio: For the past 27 years, Gene A. Pennello has been a review and research statistician at the US Food and Drug Administration (FDA), Center for Devices and Radiological Health (CDRH). At FDA, Gene’s official area of expertise is design and analysis of studies concerning diagnostic devices, with particular emphasis on drug-diagnostic co-development and evaluation of imaging systems. In 2020, Gene joined the Division of Imaging, Diagnostics, and Software Reliability (DIDSR), whose research includes addressing emerging challenges in regulatory assessment of artificial-intelligence(AI)-enabled medical devices. Gene is a fellow of the American Statistical Association (ASA).
Artificial intelligence (AI) is transforming health care by accelerating advances in medical product development and tools for augmenting the capabilities of health care practitioners. In particular, the number of FDA-approved, AI-enabled medical devices has exploded from a handful in 2010 to nearly 1250 today, with nearly 1000 in radiology and over 100 for cardiovascular use (https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-aiml-enabled-medical-devices). Approved AI-enabled medical devices include those intended for imaging acquisition and processing (e.g., segmentation, denoising, reconstruction), classification and diagnosis, early detection of life-threatening, time sensitive conditions (e.g., intracranial hemorrhage or sepsis), and long-term risk assessment (e.g., 5-year breast cancer risk). AI-enabled devices on the horizon include multi-modal or other large language model (LLM) software as a medical device (SaMD) for generating clinical reports intended for medical decision support, individualized decision support via digital twins, and robotics to assist with endoscopy or other critical and time sensitive medical procedures. Evaluation of the development and performance of AI-enabled medical devices can appear daunting for many reasons. For example, AI models underlying devices may be developed on a vast amount of clinical data and / or synthetic data, have a huge effective number of parameters, be unexplainable and uninterpretable, and be designed to generate complex output for an unfamiliar intended use. In this talk I will review recurring statistical issues with AI-enabled device development and performance evaluation. For these issues, I will attempt to simplify matters by applying general statistical principles with the hope of identifying either an existing statistical method for addressing the issue or a needed statistical innovation. Opportunities for statistical innovation in medical device AI, if seized, may lead to more predictable, transparent, and efficient practices and tools for development and evaluation, which in turn may shorten the expected review times of AI-enabled devices by regulators, thereby allowing patients and providers more timely access to these products.
DahShu 2025 ContactFor all general questions about the symposium, including program details, registration, and logistics: Email: dahshu2025@gmail.com | Our Social Networks |