Online Program Home
  My Program

All Times EDT

Legend:
* = applied session       ! = JSM meeting theme

Activity Details


1
Sun, 8/2/2020, 12:30 PM - 3:30 PM Virtual
Invited E-Poster Session — Invited Poster Presentations
ASA, Text Analysis Interest Group
Chair(s): Xiaoyue Maggie Niu, Penn State University
01: An Innovative Model for Establishing Data Science Units in Academic Medical Centers
Manisha Desai, Stanford University; Mary Boulos, Stanford University
02: Partial Identification for Capture-Recapture Surveys
Jinghao Sun, Yale University; Forrest W. Crawford, Yale University
03: Methods for Inference of Microbial Interactions Using Copula Models with Mixture Margins
Rebecca Deek, University of Pennsylvania; Hongzhe Li, University of Pennsylvania
04: Explained Variance Decompositions for Mediation Effect Sizes with Multiple Exposures
Shanshan Zhao, NIEHS; Yue Jiang, Duke University; Jason Fine, University of North Carolina Chapel Hill
05: A Bayesian Model to Infer Contact Network Structure from HIV Epidemic Data
Nicole Carnegie, Montana State University
06: A Minimax Optimal Ridge-Type Set Test for Global Hypothesis with Applications in Whole Genome Sequencing Association Studies
Xihong Lin, Harvard TH Chan School of Public Health; Yaowu Liu, Harvard University
07: Probabilistic Forecasts of Arctic Sea Ice Thickness
Peter Gao, University of Washington; Adrian Raftery, University of Washington; Cecilia Bitz, University of Washington
08: Modeling the Marked Presence-Only Data: A Case Study of Estimating the Female Sex Worker Size in Malawi
Ian Laga, Penn State University; Le Bao, Penn State University; Xiaoyue Maggie Niu, Penn State University
09: Revealing Spatial Gene Patterns and Interactions in Mouse Brain via Stability-Driven NMF
Yu Wang, UC Berkeley; Reza Abbasi-Asl, UC San Francisco; Nathan Gouwens, Allen Institute; Zizhen Yao, Allen Institute; Bosiljka Tasic, Allen Institute; Hongkui Zeng, Allen Institute; Anton Arkhipov, Allen Institute; Bin Yu, UC Berkeley
10: Neyman-Pearson Classification Algorithms and NP Receiver Operating Characteristics
Jingyi Jessica Li, University of California, Los Angeles; Xin Tong, University of Southern California; Yang Feng, New York University
11: Bayesian Meta-Analysis of Censored Rare Events
Shouhao Zhou, Penn State University; Xinyue Qi, UT-MD Anderson Cancer Center; Christine B. Peterson, The University of Texas MD Anderson Cancer Center
12: Estimation of the Intensity Function of an Inhomogeneous Poisson Process with a Change-Point
Brendan Murphy, University College Dublin; Tin Lok James Ng, University of Wollongong
13: Achieving Differential Privacy with Elliptical Perturbations
Matthew Reimherr, Penn State University; Jordan Awan, Penn State University
14: A Bayesian Semiparametric Approach to Estimating a Bacterium’s Wild-Type Distribution and Prevalence Estimation: Accounting for Contamination and Measurement Error (BayesACME)
Will Eagan, Purdue University; Bruce A. Craig, Purdue University
15: Ethical Academic Collaboration from the Outside In
Kim Love , K. R. Love Quantitative Consulting and Collaboration
16: Addressing Measurement Error in Microbiome Data
David Clausen, Department of Biostatistics, University of Washington; Amy Willis, Department of Biostatistics, University of Washington
17: Sparse Functional Principal Component Analysis in High Dimensions
Xiaoyu Hu, Peking University; Fang Yao, Peking University
18: Consistently Estimating Graph Statistics Using Aggregated Relational Data
Tyler McCormick, University of Washington; Arun Chandrasekhar, Stanford University; Emily Breza, Harvard University; Mengji Pan, Mengjie Pan
19: Unsupervised Multi-Granular Word Segmentation and Medical Term Discovery via Graph Partition
Zheng Yuan, Tsinghua University; Yuanhao Liu, University of Michigan; Qiuyang Yin, Tsinghua University; Boyao Li, Tsinghua University; Sheng Yu, Tsinghua University
20: The Future of Precision Health
Michael R. Kosorok, University of North Carolina at Chapel Hill
21: The Role of Stratification in Sequential Monte Carlo
Wenshuo Wang, Harvard University; Jun S. Liu, Harvard University; Ke Deng, Tsinghua University; Yichao Li, Tsinghua University
22: Latent Space Hawkes Processes
Owen Ward, Columbia University; Jing Wu, Columbia University; Tian Zheng, Columbia University
23: GWEB-Anno: An Empirical-Bayes-Based Approach for Genetic Risk Prediction Using GWAS Summary Statistics and Functional Annotations
Wei Jiang, Yale University; Hongyu Zhao, Yale University
24: Teaching Data Science Using Team-Based Learning
Eric Alfred Vance, University of Colorado Boulder
25: Modeling Reciprocity: Designing a Collaborative Undergraduate Research Course
Emily H Griffith, North Carolina State Universit; Stephany Dunstan, North Carolina State University
26: Adaptive Testing of the Composite Null Hypotheses in Mediation Analysis
Yinqiu He, University of Michigan
27: Incorporating Prior Knowledge on Phenotyping Accuracy for Bias Reduction in Association Studies Using Electronic Health Records Data
Yong Chen, University of Pennsylvania
28: Learning Healthcare Delivery Network with Longitudinal Electronic Health Records Data
Jiehuan Sun, University of Illinois at Chicago; Katherine Liao, Harvard Medical School; Tianxi Cai, Harvard University
29: Estimating Causal Effects of Continuous Exposure: Comparing the Relative Performance of Commonly Used Methods
Donna L Coffman, Temple University; Beth Ann Griffin, RAND; Daniel F McCaffrey, ETS ; Brian G Vegetabile, RAND Corporation
30: Estimating Association Between Baseline Covariates and Outcome Under Additive Nonignorable Missingness
Mauricio Sadinle, University of Washington; Chloe Krakauer, University of Washington
31: ISLE: An Integrated Learning (And Research) Environment for Statistics and Data Science
Rebecca Nugent, Carnegie Mellon University; Philipp Burckhardt, Carnegie Mellon University; Christopher R Genovese, Carnegie Mellon University
32: A Random Effects Stochastic Block Model for Joint Community Detection in Multiple Networks with Applications to Neuroimaging
Yuguo Chen, University of Illinois at Urbana-Champaign
33: A Study of Police Use of Force Using Novel Spatial Point Process Methodology
Claire E. Kelling, Penn State; Murali Haran, Pennsylvania State University; Aleksandra Slavkovic, Pennsylvania State University; Corina Graif, Penn State
34: Leveraging Auxiliary Information in Missing Data
Jerry Reiter, Duke University
 
 

17 * !
Mon, 8/3/2020, 10:00 AM - 11:50 AM Virtual
Technology Impact on Total Survey Error — Invited Papers
Survey Research Methods Section, Social Statistics Section, Government Statistics Section, Text Analysis Interest Group
Organizer(s): Stanislav Kolenikov, Abt Associates
Chair(s): Antje Kirchner, RTI International
10:05 AM Detecting Housing Units from Satellite Imagery Using Computer Vision Presentation
Stephanie Eckman, RTI International; Qiang Qiu, Purdue University; Tien-Yu Liu, Duke University
10:25 AM Automatic Coding of Open-Ended Questions: Does Double Coding of the Training Data Reduce the Error of Automatic Coding?
Zhoushanyue He, University of Waterloo; Matthias Schonlau, University of Waterloo
10:45 AM Total Survey Error and Geographic Information Systems
Ned English, NORC; Kevin Brown, NORC ; Chang Zhao, NORC
11:05 AM Passive Data Collection Using Smartphones
Bella Struminskaya, Utrecht University
11:25 AM Discussant: Stanislav Kolenikov, Abt Associates
11:45 AM Floor Discussion
 
 

49
Mon, 8/3/2020, 10:00 AM - 2:00 PM Virtual
Statistical Measurements of Social Issues and Trends — Contributed Papers
Social Statistics Section, Text Analysis Interest Group
Chair(s): Antje Kirchner, RTI International
A Study of Spatial Misalignment with an Application to Police Use of Force
Claire Kelling; Murali Haran, Pennsylvania State University; Aleksandra Slavkovic, Pennsylvania State University; Corina Graif, Penn State
Estimating the Effect of Violence on Internal Displacement in Afghanistan Using Mobile Phone Metadata
Xiao Hui Tai, U.C. Berkeley; Joshua Blumenstock, U.C. Berkeley; Shikhar Mehra, U.C. Berkeley
Temporal Importance and Resilience of Topics: Inferring Polarity Through Topic Models Presentation
Shane Bookhultz, Virginia Tech; Scotland Leman, Virginia Tech; Shyam Ranganathan, Virginia Tech; James Hawdon, Virginia Tech; Tanushree Mitra, Virginia Tech
Racial Disparities in Policing: A Variance Decomposition Approach
Mikaela Meyer, Carnegie Mellon University; Amelia M Haviland, Carnegie Mellon University
Global Solutions to Host, Protect and Resettle Refugees
Allison Conners, ISR Foundation; Woosang Chang, ISR Foundation; Asaph Young Chun, Statistics Research Institute - Statistics Korea
 
 

53
Mon, 8/3/2020, 10:00 AM - 2:00 PM Virtual
Applications of Data Linkage and Machine Learning Techniques — Contributed Papers
Survey Research Methods Section, Text Analysis Interest Group
Chair(s): Jeffrey Gonzalez, Economic Research Service
Visualizing Text Mining Results of the Consumer Feedback Data Presentation
Shankang Qu, PepsiCo
Machine-Learning Algorithms to Improve Payment Imputation in the Medical Expenditure Panel Survey (MEPS) Presentation
Emily Mitchell, Agency for Healthcare Research and Quality (AHRQ); Chandler McClellan, Agency for Healthcare Research and Quality (AHRQ); Jerrod Anderson, Agency for Healthcare Research and Quality (AHRQ); Samuel H Zuvekas, Agency for Healthcare Research and Quality (AHRQ)
Estimating Linkage Errors Under Regularity Conditions Presentation
Abel Dasylva; Arthur Goussanou, Statistics Canada
Side Effect Reduction of Prior and Processed Information on Survey Design (Parts 1, 2 and 3)
Abdellatif Demnati, Independent Researcher
Addressing Optimization Challenges in Health Survey Research Using Multi-Objective Constrained Binary Particle Swarm Optimization
Di Xiong, UCLA SPH; Honghu Liu, UCLA
Using Data Science to Build Survey Sampling Frames from Scratch
Joseph Rodhouse, National Agricultural Statistics Service; Tyler Wilson, National Agricultural Statistics Service
 
 

67
Mon, 8/3/2020, 10:00 AM - 2:00 PM Virtual
Section on Statistical Computing: Data Science — Contributed Papers
Section on Statistical Computing, Text Analysis Interest Group
Chair(s): Samuel W.K. Wong, University of Waterloo
Evaluation Error Requirements for Generating Random Variates Using Dominated Rejection Algorithms Presentation
Timothy Hall, PQI Consulting
The Modified Directed Likelihood in High-Dimensions Presentation
Yanbo Tang, University of Toronto; Nancy Reid, University of Toronto
Defining Areas of Interest for Eye-Tracking Data: Implementing a Systematic Approach Presentation
Joanna Coltrin, Utah State University; Eric McKinney, Utah State University; Breanna Studenka, Utah State University; Juergen Symanzik, Utah State University
Tests of Equality of Equality of Several High-Dimensional Contingency Tables
Silvia Sharna, Bowling Green State University; Mian Adnan, Bowling Green State University; Asif Shams Adnan, Dhaka, Bangladesh; Rahmatullah Imon, Ball State University
Clinical Trials Data Sharing: Streamlined Process for Deidentification
Amaanti Sridhar, RTI International; Marie Gantz, RTI International
Positive Orthant Hyperspherical Distribution and Applications
Jose Guardiola, Texas A&M University-Corpus Christi
A Comparative History of Programing in Statistics and Data Science
Ben Barnard, Wells Fargo; Gabriel Odom, Florida International University
A Simple and Scalable Algorithm for Anomaly Identification with an Application in Credit Card Fraud Detection
Cheng Peng, West Chester University of Pennsylvania
 
 

74
Mon, 8/3/2020, 10:00 AM - 2:00 PM Virtual
Text Analysis in Machine Learning and Statistical Models — Contributed Papers
Section on Statistics in Defense and National Security, Text Analysis Interest Group, Section on Statistical Computing
Chair(s): Daniel Ries, Sandia National Laboratories
Finding the Source of Grandma’s Chili: Investigative Text Mining
Scott Wise, SAS Institute, Inc.
Zero-Inflated Beta Distribution Applied to Word Frequency and Lexical Dispersion in Corpus Linguistics
Brent Burch, Northern Arizona University; Jesse Egbert, Northern Arizona University
Dynamically Evolving Transformer Models for Article Tagging for Biosurveillance
Karl Pazdernik, Pacific Northwest National Laboratory; Samuel Dixon, Pacific Northwest National Laboratory; Daniel Farber, Pacific Northwest National Laboratory; Aaron Tuor, Pacific Northwest National Laboratory; Andrew Barker, Pacific Northwest National Laboratory; Elise Saxon, Pacific Northwest National Laboratory; Lauren Charles, Pacific Northwest National Laboratory
Naive Dictionary on Musical Corpora: From Knowledge Representation to Pattern Recognition
Qiuyi Wu, University of Rochester; Ernest Fokoue, Rochester Institute of Technology and SAMSI
Uncovering Biases in Off-The-Shelf Natural Language Processing Tools
Elizabeth Cary, Pacific Northwest National Laboratory; Lee Burke, Pacific Northwest National Laboratory; Madelyn Dunning, Pacific Northwest National Laboratory; Jill Brandenberger, Pacific Northwest National Laboratory; Michael Henry, Pacific Northwest National Laboratory; Karl Pazdernik, Pacific Northwest National Laboratory
Detecting Pharmaceutical Innovations in Text-Based Data Using Machine Learning
Devika Mahoney-Nair, University of Virginia; Gizem Korkmaz, University of Virginia; Gary Anderson, National Science Foundation; Neil Alexander, University of Virginia; Aaron Schroeder, University of Virginia; Sallie Ann Keller, Distinguished Professor in Biocomplexity, U of Virginia
 
 

75
Mon, 8/3/2020, 10:00 AM - 2:00 PM Virtual
Contributed Poster Presentations: Biometrics Section — Contributed Poster Presentations
Biometrics Section, Text Analysis Interest Group
1: Comparing Samples of Partially Paired Categorical Data
PHILIP DIXON, Iowa State University
2: A State-Space Approach for Analyzing Longitudinal Outcomes
Alicia Chua, Boston University School of Public Health; Yorghos Tripodis, Boston University School of Public Health
3: Drug Safety Evaluation Using Panel Count Model
Yizhao Zhou, Georgetown University; Ao Yuan, Georgetown University Medical Center; Ming Tan, Georgetown University
4: Arctangent Augmented Square Root Transformation
Mitchell J Rosen, Covance, Inc.
5: A Comparison of Ordinary Least Squares and Reduced Major Axis Regression
Krista Watts, United States Military Academy; Nicholas Clark, United States Military Academy; Diana Thomas, United States Military Academy
6: Designing an Intra-Subject Dose Escalation Clinical Trial for Evaluation of Acute Intermittent Hypoxia in Stroke Survivors
Elizabeth Gray, Northwestern University, FSM; Masha Kocherginsky, Northwestern University, FSM; Milap Sandhu, Northwestern University, AbilityLab; William Rymer, Northwestern University, AbilityLab
7: Predicting ordinal outcomes incorporating nonparametric interactions
Yuting Lu, New York University, Department of Environmental Medicine; Yongzhao Shao, New York University School of Medicine
8: Reactogenicity Adverse Events; Collection, Analysis and Reporting Process in Vaccine Clinical Trials
Tulin Shekar, Merck
9: Detection Limits by Semiparametric Cumulative Probability Models
Yuqi Tian, Vanderbilt University; Bryan E Shepherd, Vanderbilt University; Chun Li, University of Southern California, Department of Preventive Medicine; Nathan James, Vanderbilt University
10: Modeling Simultaneous Responses with Nested Working Correlation and Bayesian Estimates for Longitudinal Data with Time-Dependent Covariates
Elsa Aimara Vazquez Arreola, Arizona State University; Jeffrey R Wilson, Arizona State University
11: Mining for Health: A Comparison of Word Embedding Methods for Electronic Health Records
Emily Getzen, University of Pennsylvania; Qi Long, University of Pennsylvania
12: Comparing Association of Preoperative Transrectal Ultrasound Prostate Weight with Prostate Weight Obtained After Radical Prostatectomy
Irene Helenowski, Northwestern University; Sean Sachdev, Northwestern University; Borko D Jovanovic, Northwestern University; Michael J Gurley, Northwestern University; Robin G Leikin, Northwestern University; William J Catalona, Northwestern University; Timothy M Kuzel, Rush University
13: PARAMETER ESTIMATION for STOCHASTIC DIFFERENTIAL EQUATIONS DRIVEN by WIENER and POISSON NOISE
Charles Eugene Smith, North Carolina State University; Loren Cobb, Univ. of Colorado Denver
14: Copula Directional Dependence for Analyzing Metabolic Responses in Obese Rodents to Nutritional Strategies
Yoonsung Jung, Prairie View A&M University; Javad Barouei, Prairie View A&M University
15: Classification of Adverse Events from the Apple Heart Study Using Natural Language Processing
Rebecca Gardner, Stanford University, Quantitative Sciences Unit; Santosh Gummidipundi, Stanford University, Quantitative Sciences Unit; Mai Nguyen, Stanford Center for Clinical Research; Sushmitha Tallapalli, Stanford Center for Clinical Research; Vidhya Balasubramanian, Stanford University, Quantitative Sciences Unit; Justin Lee, Stanford University, Quantitative Sciences Unit; Ariadna Garcia, Stanford University, Quantitative Sciences Unit; Haley Hedlin, Stanford University, Quantitative Sciences Unit; Christoph Olivier, Stanford Center for Clinical Research; Justin Parizo, Stanford Center for Clinical Research; Kenneth Mahaffey, Stanford Center for Clinical Research; Marco Perez, Stanford Division of Cardiovascular Medicine; Mintu Turakhia, Stanford Division of Cardiovascular Medicine; Manisha Desai, Stanford University
16: An R Shiny Web Application Useful for Aiding Researchers in the Analysis of Eye Tracking Data
Nathan Varberg; Nathaniel Thom, Wheaton College
17: Effect of Different Error Rate Adjustments on Reproducibility
Scott Richter, UNC Greensboro; Melinda McCann, Oklahoma State University
18: Stochastic Curtailment Methods for Futility Testing in Single-Arm Clinical Trials with Time-to-Event Endpoint Using Weibull Distribution
Muhammad Waleed, University of Kansas Medical Center; Jianghua He, University of Kansas Medical Center; Milind A Phadnis, University of Kansas Medical Center
19: Neural Networks for Clustered and Longitudinal Data Using Mixed Effects Models
Francesca Mandel, University of Pennsylvania; Ian Barnett, University of Pennsylvania
20: A SAS Macro for Implementing a Group Sequential Design with Time-to-Event Endpoint Using the Generalized Gamma Distribution
Nadeesha Thewarapperuma, University of Kansas Medical Center ASA Student Chapter; Milind A Phadnis, University of Kansas Medical Center
22: Relaxing the Independence Assumption in Relative Survival Analysis: A Parametric Approach
Reuben Adatorwovor, University of North Carolina at Chapel Hill; Aurelien Latouche, Conservatoire National des Arts et Métiers; Jason Fine, University of North Carolina Chapel Hill
23: Survival Outcome Related High-Throughput Gene Expression Analysis Under Case-Cohort Design
Huining Kang, University of New Mexico; Lidong Wang, University of New Mexico
 
 

126 * !
Mon, 8/3/2020, 1:00 PM - 2:50 PM Virtual
Recent Advances in Bayesian Mixed Membership Modeling for Network, Longitudinal, and Multivariate Data — Topic Contributed Papers
Section on Bayesian Statistical Science, International Society for Bayesian Analysis (ISBA), IMS, Text Analysis Interest Group
Organizer(s): Elena A Erosheva, University of Washington ; Gongjun Xu, University of Michigan
Chair(s): Tanzy Love, University of Rochester
1:05 PM Exponential Family Mixed Membership Models for Soft Clustering of Multivariate Data
Brendan Murphy, University College Dublin; Arthur White, Trinity College Dublin
1:25 PM A New Class of Mixed Membership Models for Educational Testing: Partial-Mastery Cognitive Diagnosis Models
Elena A Erosheva, University of Washington ; Zhuoran Shang , University of Michingan; Gongjun Xu, University of Michigan
1:45 PM Multilevel Mixed Membership Stochastic Block Models
Tracy Sweet
2:05 PM Longitudinal Structural Mixed Membership Models for Estimating Latent Health Trajectories Using Administrative Claims Data
Zhenke Wu, University of Michigan; Mengbing Li, University of Michigan
2:25 PM Detectability Limits in Dynamic Networks with Link Persistency
Amir Ghasemian
2:45 PM Floor Discussion
 
 

145 * !
Tue, 8/4/2020, 10:00 AM - 11:50 AM Virtual
Statistics of Social Media — Invited Papers
Section on Statistical Computing, Social Statistics Section, Text Analysis Interest Group
Organizer(s): Juha Alho, University of Helsinki
Chair(s): Stanislav Kolenikov, Abt Associates
10:55 AM What Authors Reveal of Themselves in Internet Discussions? Presentation
Juha Alho, University of Helsinki
11:20 AM Discussant: Frauke Kreuter, University of Maryland, University of Mannheim & IAB
11:40 AM Floor Discussion
 
 

198
Tue, 8/4/2020, 10:00 AM - 2:00 PM Virtual
Census and Survey Data Analysis — Contributed Papers
Government Statistics Section, Text Analysis Interest Group
Chair(s): Ernest Davenport, University of Minnesota-College of Education & Human Development
Cost Analysis of Experimental Data Collection Methods Used by National Household Education Survey (NHES) Presentation
Kayla Varela, U.S. Census Bureau; Allison Zotti, U.S. Census Bureau
Coverage Estimation in the 2021 Census of England and Wales
Viktor Racinskij, Office for National Statistics
A Network-Based Analysis of International Refugee Migration Patterns Using GERGMs
Natallia Katenka, University of Rhode Island; Katherine Abramski, World Food Programme; Marc Hutchison , University of Rhode Island
 
 

200
Tue, 8/4/2020, 10:00 AM - 2:00 PM Virtual
New Developments in the Design of Experiments — Contributed Papers
Section on Physical and Engineering Sciences, Text Analysis Interest Group, Quality and Productivity Section
Chair(s): David Collins, Los Alamos National Laboratory
Optimal Sensor Placement for Finite Element Model Validation
Ethan Davis, North Carolina State University; Jonathan Stallrich, North Carolina State University; Munir Winkel, North Carolina State University; Peter Parker, NASA Langley Research Center; Ken Toro, NASA Langley Research Center
New Priors for Bayesian Analysis of Screening Designs
Michael McKibben, North Carolina State University; Jonathan Stallrich, North Carolina State University
Stop Treating Supersaturated Designs Like Other Screening Designs
Maria Weese, Miami University; Jonathan Stallrich, North Carolina State University; Byran JAY Smucker, Miami University (Ohio); David Edwards, Virginia Commonwealth University
The A-Criterion Is Better Than the D-Criterion for Screening Designs
Jonathan Stallrich, North Carolina State University; Katherine Moyer, North Carolina State University; Bradley Jones, JMP
Design Optimization Based on D-optimality for Multiple Responses
Damola Akinlana, University of South Florida; Lu Lu, University of South Florida
Construction of Parallel Flats Designs
Robert Mee, University of Tennessee; Chunyan Wang, Nankai University
A Multitaper Spectrum Estimator for Unevenly Sampled Time Series
Aaron Springford
 
 

203
Tue, 8/4/2020, 10:00 AM - 2:00 PM Virtual
Contemporary Machine Learning — Contributed Papers
Section on Statistical Learning and Data Science, Text Analysis Interest Group
Chair(s): Michael Lavine, Army Research Office
Risk Minimization Under Sampling Bias Arising from Customer Interactions
Scott Rome, Comcast; Michael Kreisel, Comcast
An Optimal Approach in Adaptive Collection Reducing the Bias
Tong Wang
Transfer Learning for Auto-Coding Free-Text Survey Responses
Peter Baumgartner, RTI International; Amanda Smith, RTI International; Murrey Olmsted, RTI International; Dawn Ohse, RTI International; Bucky Fairfax, RTI International
Post-Hoc Mixture Models of the Best Linear Unbiased Predictors from Linear Mixed Effects Models to Classify Longitudinal Data with Haphazardly Spaced Intervals: A Simulation Study
Md Hossain, Nemours Biomedical Research, A.I. DuPont Children's Hospital; Benjamin E Leiby, Division of Biostatistics
Modeling Temporary Shocks with Latent Processes for High-Dimensional Demand Time Series
Benedikt Sommer, Maersk ; Klaus Kähler Holst, Maersk Research & Data; Pierre Pinson, Technical University of Denmark
Real-Time Regression Analysis of Streaming Clustered Data Sets
Lan Luo, University of Michigan; Peter X.K. Song, University of Michigan
 
 

205
Tue, 8/4/2020, 10:00 AM - 2:00 PM Virtual
Applications of Machine Learning — Contributed Papers
Section on Statistical Learning and Data Science, Text Analysis Interest Group
Chair(s): Jennifer Green, Montana State University
Summarizing and Extracting Insights from Consumer Review Data
Jingting Hui, PepsiCo; Jason Parcon, PepsiCo
Comparison of Machine Learning Methods with Traditional Models for Use of Public Trial Registry Data to Predict Sites Needed and Time from Study Start to Primary Completion Date Presentation
Linghui Li, AstraZeneca; Gabriela Feldberg, AstraZeneca; Faisal Khan, AstraZeneca; Sandra Smyth, AstraZeneca; Karin Schiene, AstraZeneca
Comparing Machine Learning and Penalized Regression for Predicting Diabetic Kidney Disease Progression: Evidence from the Chronic Renal Insufficiency Cohort (CRIC) Study
Jing Zhang, Moores Cancer Center, University of California, San Diego; Tobias Fuhrer, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland; Brian Kwan, University of California, San Diego; Daniel Montemayor, Department of Medicine, University of Texas Health Science Center at San Antonio; Kumar Sharma, Department of Medicine, University of Texas Health Science Center at San Antonio; Loki Natarajan, University of California, San Diego
Opportunities and Challenges in the Use of Smartphone and Smartwatch-Based Step Count Measures in Studies of Physical Activity and Health
Briana Cameron, 23andMe; Teresa Filshtein Sonmez, 23andMe; Stella Aslibekyan, 23andMe; Robert Gentleman, 23andMe
Improving Productionized Insights in Machine Learning Models Through Data-Quality Quantification
Christopher Barbour, Atrium; Paul Harmon, Atrium; Eric Loftsgaarden, Atrium
 
 

301 *
Wed, 8/5/2020, 10:00 AM - 11:50 AM Virtual
Natural Language Processing Applications in Defense and National Security — Topic Contributed Papers
Section on Statistics in Defense and National Security, Text Analysis Interest Group, Section on Statistical Learning and Data Science, Section on Statistical Computing
Organizer(s): Joseph D Warfield, Johns Hopkins University Applied Physics Laboratory
Chair(s): Joseph D Warfield, Johns Hopkins University Applied Physics Laboratory
10:05 AM Neural Language Processing to Detect, Attribute, Characterize and Defend Against Digital Deception
Svitlana Volkova, Pacific Northwest National Laboratory
10:25 AM A Sliding Information Distance for Change Point Detection in Text or Audio
Richard Field, Sandia National Laboratories; Christina Ting, Sandia National Laboratories; Travis Bauer, Sandia National Laboratories
10:45 AM Few-Shot Learning for Text Applications: Exploring Authorship Identification with Small Data
Lauren Phillips, Pacific Northwest National Laboratory; Sarah Reehl, Pacific Northwest National Laboratory; Ana Usenko, Western Washington University
11:05 AM Classifying Documents Through the Use of Artificial Intelligence
Kelly Townsend, Johns Hopkins University, Applied Physics Laboratory; Alex Firpi, Johns Hopkins University Applied Physics Lab
11:25 AM Discussant: David Marchette, US Naval Surface Warfare Center Dahlgren Division
11:45 AM Floor Discussion
 
 

352 *
Wed, 8/5/2020, 10:00 AM - 2:00 PM Virtual
Section on Statistical Consulting CPapers 1 — Contributed Papers
Section on Statistical Consulting, Text Analysis Interest Group
Chair(s): Summer Han, Stanford University
Using Publicly-Available Data to Assess an Organization’s Research Scope
Peter John De Chavez, PepsiCo; Jingting Hui, PepsiCo
A Framework for Successful Statistical Consulting: A Concise, Field-Specific Handbook to Help Scientists Make Effective Use of Statistical Methods in Their Field Presentation
Wanchunzi Yu, Bridgewater State University; Irina Seceleanu, Bridgewater State University; Kevin Rion, Bridgewater State University
Methodological Challenges That Keep Us Statistically Sharp: Modeling Associations Between Repeated Patient and Physician Reported Outcomes
Naomi Brownstein, Moffitt Cancer Center
“But What About Interactions, Are Any of Those Significant?” Resolving the Collaborative Nightmares of Covariate Interactions Through Regularization
Ryan A. Peterson, University of Colorado; Joseph Cavanaugh, University of Iowa
Goals for Statistics and Data Science Collaborations
Eric Alfred Vance, University of Colorado Boulder
Evolving the Applied Statistics Value Proposition
Randy Bartlett
People Are “Buggy”: Challenges in Working with Other Humans Presentation
Michiko Wolcott, Msight Analytics
 
 

353 * !
Wed, 8/5/2020, 10:00 AM - 2:00 PM Virtual
Visual Stories That Count! — Contributed Papers
Section on Statistical Graphics, Text Analysis Interest Group
Chair(s): Shailaja Suryawanshi, Merck & Co., Inc.
Shutter Plot: A Visual Display of Summary Statistics Over a Scatter Plot
Mamunur Rashid, DePauw University, Mathematics Dept.; Jyotirmoy Sarkar, Indiana University-Purdue University Indianapolis
A Visual Story of the COVID-19 Outbreak and How Misinformation Spreads
Zoe Liu, Boston Scientific
Two-Sample Testing for Latent Distance Graphs with Unknown Link Functions
Yiran Wang, North Carolina State University; Minh Tang, NC State University; Soumendra Lahiri, Washington University in St. Louis
Creating an Interactive Visualization of General Social Survey Data: From Start to Finished Product Presentation
Nola du Toit, NORC at the University of Chicago; Edward Mulrow, NORC at the University of Chicago; Rene Bautista, NORC at the University of Chicago; Mu-Hsien Lee, NORC at the University of Chicago
Measuring the Significance of Text Source Using Visual Statistical Inference Presentation
Mahbubul Majumder, University of Nebraska at Omaha; Jim Rogers, University of Nebraska at Omaha
Interactive Spatiotemporal Data Visualization of the U.S. Public Library Performance
Xiaoyue Cheng, University of Nebraska at Omaha; Josie Gatti Schafer, University of Nebraska at Omaha; Amanda Vander Wal, University of Nebraska at Omaha; Ly Le, University of Nebraska at Omaha
APCoA: Covariate Adjusted Principal Coordinates Analysis Presentation
Yushu Shi, University of Texas MD Anderson Cancer Center; Liangliang Zhang, M.D. Anderson Cancer Center; Kim-Anh Do, The University of Texas MD Anderson Cancer Center; Christine B. Peterson, The University of Texas MD Anderson Cancer Center; Robert Jenq, The University of Texas MD Anderson Cancer Center
 
 

356
Wed, 8/5/2020, 10:00 AM - 2:00 PM Virtual
Statistical Learning: Methods and Applications — Contributed Papers
Section on Statistical Learning and Data Science, Text Analysis Interest Group
Chair(s): Scott Rome, Comcast
Combination of Optical Character Recognition and Natural Language Processing to Identify Patients with Sleep Apnea in EHR Data
Enshuo Hsu, University of Texas Medical Branch; Yong-Fang Kuo, University of Texas Medical Branch; Rizwana Sultana, University of Texas Medical Branch; Gulshan Sharma, University of Texas Medical Branch
Flexible Feature Selection and Cluster Analysis for Heterogeneous Data with Application to a Diffusion Tensor Imaging Study
Wanying Ma, Novartis Pharmaceuticals Company; Luo Xiao, North Carolina State University; Jaroslaw Harezlak, Indiana University
Genetic Algorithms for Feature Selection
Huanjun Zhang, Texas A&M University; Edward Jones, Texas A&M University
Towards an Adaptive Algorithm for Online Substance Use Episode Detection
Joshua Rumbut, University of Massachusetts Dartmouth, University of Massachusetts Medical School; Hua Fang, University of Massachusetts Dartmouth, University of Massachusetts Medical School
Common and Distinctive Pattern Analysis Between High-Dimensional Data Sets
Hai Shu, New York University; Zhe Qu, Tulane University
Achieving Impact with Data Science and Machine Learning in Drug Development
David Ohlssen, Novartis Pharmaceuticals
Automatic Identification and Classification of Different Types of Otitis from Free-Text Pediatric Medical Notes: A Deep-Learning Approach
Corrado Lanera, University of Padova
 
 

398 * !
Wed, 8/5/2020, 1:00 PM - 2:50 PM Virtual
Beyond Traditional Approaches: Evolving Artificial Intelligence and Machine Learning to Advance Clinical Research and Drug Development — Topic Contributed Papers
Biometrics Section, Biopharmaceutical Section, Section on Statistical Learning and Data Science, Text Analysis Interest Group
Organizer(s): Demissie Alemayehu, Pfizer Inc.
Chair(s): Birol Emir, Pfizer Inc.
1:05 PM Adaptive Online Machine Learning for Real-Time Individualized Forecasting in Clinician-AI Team
Rachael Phillips, University of California, Berkeley; Mark Van der Laan, University of California, Berkeley
1:25 PM Recent Advances in the Application of Natural Language Processing to Unstructured and Semi-Structured Data in the Pharmaceutical Industry Presentation
Peter Henstock, Pfizer Inc
1:45 PM Learning Decision Rules with Observational Data
Xinkun Nie, Stanford University; Stefan Wager, Stanford University
2:05 PM “Data Nuggets” Tools for Analyzing Big Data
Javier Cabrera, Rutgers University
2:25 AM Floor Discussion
 
 

400 * !
Wed, 8/5/2020, 1:00 PM - 2:50 PM Virtual
Multiple Aspects of Bayesian Model Selection and Variable Selection in Linear and Nonlinear Models — Topic Contributed Papers
Section on Bayesian Statistical Science, International Society for Bayesian Analysis (ISBA), International Indian Statistical Association, Text Analysis Interest Group
Organizer(s): Arnab K Maity, Pfizer
Chair(s): Arnab K Maity, Pfizer
1:05 PM High-Dimensional Bayesian Model Averaging for Gene Network Inference
Adrian Raftery, University of Washington
1:25 PM Generative modeling of compositional count data based on the Dirichlet-tree model with application to microbiome studies
Li Ma, Duke University; Jialiang Mao, Duke University
1:45 PM Bayesian Hierarchical Modeling for Process Optimization
Min Wang, The University of Texas at San Antonio; Linhan Ouyang, Nanjing University of Aeronautics and Astronautics; Chanseok Park, Pusan National University; Yan Ma, Nanjing University of Science and Technology; YiZhong Ma, Nanjing University of Science and Technology
2:05 PM Probabilistic Machine Learning for Uncertainty Quantification of Neutron Star Radius
Debdeep Pati, Texas A&M University; Anirban Battacharya, Texas A&M University; Yeunhwan Lim, Max Planck Institut für Kernphysik, Technische Universität Darmstadt; Jeremy Holt, Texas A&M University
2:25 PM Floor Discussion
 
 

455 * !
Thu, 8/6/2020, 10:00 AM - 11:50 AM Virtual
Big Data, Technology Platform and Digital Innovation with Measurable Impact — Topic Contributed Panel
Text Analysis Interest Group, Section on Statistical Learning and Data Science, Quantum Computing in Statistics and Machine Learning, Section on Statistical Computing
Organizer(s): Kelly Zou, Pfizer Inc
Chair(s): Kelly Zou, Pfizer Inc
10:05 AM Big Data, Technology Platform and Digital Innovation with Measurable Impact
Panelists: Siddhartha Dalal, Columbia University
Mike Henderson, SAS
Joseph Imperato, Pfizer
Stanislav Kolenikov, Abt Associates
Lourenco Miranda, Society Generale
Mike Porath, The Mighty
May Yamada-Lifton, SAS
11:45 AM Floor Discussion
 
 

576 * !
Thu, 8/6/2020, 3:00 PM - 4:50 PM Virtual
Matching Methods for Causal Inference with Emerging Data and Statistical Challenges — Topic Contributed Papers
Biometrics Section, ENAR, Section on Statistics in Epidemiology, Text Analysis Interest Group
Organizer(s): Shu Yang, North Carolina State University
Chair(s): Shu Yang, North Carolina State University
3:05 PM Individualized Matched Design of Observational Studies
Jose Zubizarreta
3:25 PM Double Score Matching Estimators of Average and Quantile Treatment Effects
Yunshu Zhang, North Carolina State University; Shu Yang, North Carolina State University
3:45 PM Estimation for Imputed Survey Data
Xiaofei Zhang, Iowa State University; Wayne A Fuller, Iowa State University
4:05 PM Causal Inference with Free-Text in Both Randomized and Observational Settings
Michael Baiocchi, Stanford University; Jordan Rodu, University of Virginia; Joshua Kravitz, Stanford University
4:25 PM Machine Learning Methods for Causal Inference from Complex Observational Data
Alexander Volfovsky, Duke University
4:45 PM Floor Discussion
 
 

585
Thu, 8/6/2020, 3:00 PM - 4:50 PM Virtual
Bayesian Neural Networks — Topic Contributed Papers
International Society for Bayesian Analysis (ISBA), Section on Bayesian Statistical Science, Section on Statistical Learning and Data Science, Text Analysis Interest Group
Organizer(s): Deborshee Sen, Duke University
Chair(s): Rudradev Sengupta, Janssen Pharmaceutical Companies of Johnson and Johnson, Beerse, Belgium
3:05 PM Bayesian Dimension Reduction Using Neural Networks Presentation
Deborshee Sen, Duke University; David Dunson, Duke University; Theodore Papamarkou, Oak Ridge National Laboratory
3:25 PM Bayesian Deep Net GLM and GLMM
Robert Kohn, University of New South Wales; Minh Ngoc Tran, The University of Sydney
3:45 PM Challenges in Bayesian Inference via Markov Chain Monte Carlo for Neural Networks
Theodore Papamarkou
4:05 PM Practical Bayesian Inference for Shallow CNNs in NLP
Jacob Hinkle, Oak Ridge National Lab; Devanshu Agrawal, Oak Ridge National Lab; Theodore Papamarkou, Oak Ridge National Laboratory
4:25 PM Floor Discussion
 
 

220205
Thu, 8/6/2020, 6:00 PM - 7:00 PM Virtual
Text Analysis Interest Group Business Meeting — Other Cmte/Business
Text Analysis Interest Group
Chair(s): Stanislav Kolenikov, Abt Associates
Zoom Web: https://qualtrics.zoom.us/j/8022580518

Dial-in One-Tap:
1-646-876-9923
8022580518#