Online Program Home
  My Program

All Times EDT

Legend:
* = applied session       ! = JSM meeting theme

Activity Details


220764
Tue, 8/3/2021, 3:00 PM - 4:00 PM
Text Analysis Interest Group's (TAIG) Annual Business Meeting — Other Cmte/Business
Text Analysis Interest Group
Chair(s): Kelly H. Zou, Viatris

Text Analysis Interest Group's (TAIG) Annual Business Meeting - Virtual
Start Time: 3:00 pm EDT
Join from PC, Mac, Linux, iOS or Android: https://qualtrics.zoom.us/j/92044195481
 
 

65 * !
Mon, 8/9/2021, 10:00 AM - 11:50 AM Virtual
Causal Inference with Latent Variables — Invited Papers
Mental Health Statistics Section, Health Policy Statistics Section, Section on Statistics in Epidemiology, Text Analysis Interest Group
Organizer(s): Booil Jo, Stanford University School of Medicine
Chair(s): Xiao-Li Meng, Harvard University
10:05 AM A Statistical Test to Reject the Structural Interpretation of a Latent Factor Model
Tyler J VanderWeele, Harvard University; Stijn Vansteelandt, Ghent University
10:25 AM Natural Language Processing Algorithms and Their Relationship to Latent Variable Modeling
Brian Lee Egleston, Fox Chase Cancer Center; Slobodan Vucetic, Temple University
10:45 AM The Deconfounder: What Is It? What Is Its Theory? Is it Useful?
David Blei, Columbia University; Yixin Wang, U.C. Berkeley
11:05 AM Discussant: Mark Van Der Laan, University of California
11:25 AM Discussant: Kosuke Imai, Harvard University
11:45 AM Floor Discussion
 
 

90
Mon, 8/9/2021, 10:00 AM - 11:50 AM Virtual
Novel Statistical Methods for COVID Pandemic and Other Current Health Policy Issues — Contributed Speed
Health Policy Statistics Section, Text Analysis Interest Group
Chair(s): Christine Mauro, Columbia University
10:05 AM Predicting Local Demand for COVID-19 Inpatient Care with a Bayesian Susceptible-Infectious-Hospitalized-Ventilated-Recovered Model
Stella Self, University of South Carolina; Rongjie Huang, University of South Carolina; Shrujan Amin, Care Coordination Institute; Joseph Ewing, Prisma Health; Caroline Rudisill, University of South Carolina; Alexander McLain, University of South Carolina
10:10 AM COVID-19 Spatiality: Regional Surrogate Synchrony in Case Infection Rates
Samantha Robinson, University of Arkansas
10:15 AM Assessment of 12 Months of COVID-19 Outcomes: Africa and Other Continents
Obafemi Adegoke. Keshinro, University of Lagos; Kazeem Osuolale, Nigerian Institute of Medical Research; Olukemi Abiodun Keshinro, University of Lagos
10:20 AM Predictive Model for Hospitalization Due to COVID-19 Presentation
Fei Han, University of Maryland Baltimore County; Ian Stockwell , University of Maryland Baltimore County; Morgan Henderson, University of Maryland Baltimore County; Lucy Wilson, University of Maryland Baltimore County; Zach Dezman, University of Maryland, Baltimore
10:25 AM Network Autoregression of the COVID Burden
Marten Thompson, University of Minnesota
10:30 AM Adherence to Medications Used to Manage Noncommunicable Diseases During and After the COVID-19 Era Presentation
Tarek A Hassan, Viatris; Jorge Enrique Saenz, Viatris; Danute Ducinskiene, Viatris; Joseph P Cook, Viatris; Joseph S. Imperato, Viatris (Former Employee); Kelly H. Zou, Viatris
10:35 AM Boeing Confident Travel Initiative Passenger Screening Model
Lindsay Waite Jones, Boeing; Ahmad Nahhas, Boeing; Jan Irvahn, Boeing; Grace S Garden, Boeing; William Ferng, Boeing; Elizabeth Killelea, Boeing; Jason W Armstrong, Boeing; Thomas Austin, Boeing; Stephen Jones, Boeing; Joshua J Cummins, Boeing
10:40 AM Using Text Mining, Natural Language Processing, and Text Networks to Describe Content and Sentiment of Organization Emails from a Children’s Hospital
Figaro Loresto, Children's Hospital Colorado; Nadia Shive, University of Colorado; Lindsey Tarasenko, Children's Hospital Colorado
10:45 AM Fairness and Bias in COVID-19 Social Distancing Complaints: Evidence from Large-Scale Mobility Data
Constantine E. Kontokosta, New York University; Boyeong Hong, New York University; Bartosz Bonczak, New York University
10:50 AM The Impact of Lockdown Timing on COVID-19 Transmission across US Counties
Xiaolin Huang, University of Victoria; Xiaojian Shao, National Research Council Canada; Xing Li, University of Saskatchewan; Yushan Hu, University of Victoria; Don D. Sin, St Paul’s Hospital and University of British Columbia; Xuekui Zhang, University of Victoria
11:00 AM Estimating the Association Between COVID-19 Vaccine Coverage and Deaths in Connecticut
Samantha G Dean, Department of Biostatistics, Yale School of Public Health; Olga Morozova, Program in Public Health, Stony Brook University; Matthew Cartter, Connecticut Department of Health; Amanda Durante, Connecticut Department of Health; Forrest W. Crawford, Yale University
11:05 AM A New Statistical Modeling Approach for Survival Analysis of Cancer Patients: Multiple Myeloma Cancer
Lohuwa Mamudu, University of South Florida; Chris P. Tsokos, University of South Florida
11:10 AM Fair Regression for Multiple Undercompensated Groups
Samson Mataraso, Stanford University; Anna Zink, Harvard University; Sherri Rose, Stanford University
11:15 AM Propensity Score Matched Cohort for Benchmarking Hospital Mortality Using Indirect Method of Standardization
Brenda Vincent, North Dakota State University ; Bong-Jin Choi, North Dakota State University
11:20 AM Assessing Classification Accuracy of Hospitals' Performance on Discrete Outcome
Zhou Lan, Yale School of Medicine; Zhenqiu Lin, Yale New Haven Hospital; Haiqun Lin, Rutgers University; Shu-Xia Li, Yale New Haven Hospital; Chengan Du, Yale School of Medicine
11:25 AM Statistical Modeling of Longitudinal Medical Cost Trajectory
Shikun Wang, The University of Texas MD Anderson Cancer Center; Jing Ning, The University of Texas MD Anderson Cancer Center; Ya-Chen Tina Shih, The University of Texas MD Anderson Cancer Center; Yu Shen, The University of Texas MD Anderson Cancer Center; Liang Li, The University of Texas MD Anderson Cancer Center; Ying Xu, The University of Texas MD Anderson Cancer Center
11:30 AM Sequential Pattern Mining of Electronic Health Record for Early Diagnosis of Amyotrophic Lateral Sclerosis
Ying Liu, Princeton Pharmatech; Cindy Liang, Texas Academy of Mathematics and Science; Lily Sun, Stanford OHS; Jeffery Zhang, Princeton Pharmatech
11:35 AM Historical Aspects of Statistical Approach to Dynamic of Neural Networks
Michael Fundator, Affl; National Academy of Sciences
11:40 AM Does Health Value and Practice Make Difference on Racial and Ethnic Disparities in Health Outcomes
Shuying Sha, Scho
11:45 AM Analysis of the Yearly Transition Function in Measles Disease Modeling
Carlo Stefan Davila-Payan, Centers for Disease Control and Prevention; Andrew Hill, Centers for Disease Control and Prevention; Xi Li, Centers for Disease Control and Prevention; Michael Lynch, Centers for Disease Control and Prevention; Sarah W. Pallas, Centers for Disease Control and Prevention
 
 

92
Mon, 8/9/2021, 10:00 AM - 11:50 AM Virtual
Time Series and Finance — Contributed Speed
Business and Economic Statistics Section, Text Analysis Interest Group
Chair(s): David S Matteson, Cornell University
10:05 AM Short-Term Forecasting with a Computationally Efficient Nonparametric Transfer Function Model
Jun M. Liu, Georgia Southern University
10:10 AM Community Network Auto-Regression for High-Dimensional Time Series
Elynn Y. Chen, University of California, Berkeley; Jianqing Fan , Princeton University; Xuening Zhu, Fudan University
10:15 AM The Hyperbolic Conditional Autoregressive Range (HYCARR) Model
Isuru Ratnayake, Kansas University Medical Center; V A Samaranayake, Missouri University of Science and Technology
10:20 AM Why Are Lumber Prices So High?
Matthew Arvanitis, USDA Forest Products Laboratory; Delton Alderman, USDA Forest Products Laboratory
10:25 AM Mutual Information, Granger Causality, and Point Processes
Victor Solo, UNSW, Sydney; Ahmed Pasha, Air University
10:30 AM Directional Accuracy of MMS Survey of Inflation-Output Forecasts: A ROC Analysis
yasemin ulu, SVSU
10:35 AM Uncovering Dynamic Relationships of FAANG+M Stock Prices
Yang Xue, North Carolina A&T State University; Seong-Tae Kim, North Carolina A&T State University
10:40 AM An Asymmetric Hyperbolic Generalized Autoregressive Conditional Heteroscedastic Model
K.C.M.R. Anjana Yatawara, Missouri University of Science and Technology; V A Samaranayake, Missouri University of Science and Technology
10:45 AM Information Content of Time Durations in the Limit Order Book
Zheting Zhu, University of Manitoba; Julieta Frank, University of Manitoba
10:50 AM The Specific Indirect Effect of Correspondence Audits: Moving from Research to Operational Application
Leigh Nicholl, The MITRE Corporation; Lucia Lykke, The MITRE Corporation; Max McGill, The MITRE Corporation; Alan Plumley, Internal Revenue Services
11:00 AM On a Quantile Autoregressive Conditional Duration Model Applied to High-Frequency Financial Data
Helton Saulo, University of Brasilia; Narayanaswamy Balakrishnan, McMaster University; Roberto Vila, University of Brasilia
11:05 AM Estimating Inequality Process Parameters from Corporate Market Capitalizations Presentation
John Angle, The Inequality Process Institute, LLC
11:10 AM COVID-19 and Auto Loan Origination Trends
Jose J Canals-Cerda, Federal Reserve
11:15 AM The 2020 Global Stock Market Crash: Endogenous or Exogenous?
Min Shu, University of Wisconsin Stout; Ruiqiang Song, Michigan Technological University; Wei Zhu, Stony Brook University
11:20 AM Improving Hedging Portfolios Using Machine Learning via Gaussian Process Hyperparameter Tuning
Zihao Chen, Iowa State University; Cindy Yu, Iowa State University
11:25 AM Mode Prediction and Hedging Portfolio Construction Based on Quantile Regression Through Machine Learning Methods
Guoliang Ma, Iowa State University; Cindy Yu, Iowa State University
11:30 AM Deciphering Federal Reserve Communication via Text Analysis of Alternative FOMC Statements
Taeyoung Doh, Federal Reserve Bank of Kansas City
11:35 AM Estimating Factors in Dynamic Equicorrelation Model
Raja Velu, Syracuse University; Zhaoque Zhou , Syracuse University
11:40 AM Option Pricing with Higher-order Stochastic Volatility Models
Md. Nazmul Ahsan, Concordia University; Jean-Marie Dufour, McGill University
11:45 AM Granger Causality Test in Predictive Conditional Modal Regression
Tae-Hwy Lee, University of California, Riverside; Yaojue Xu, University of California, Riverside
 
 

132
Mon, 8/9/2021, 1:30 PM - 3:20 PM Virtual
SLDS CSpeed 1 — Contributed Speed
Section on Statistical Learning and Data Science, Text Analysis Interest Group
Chair(s): Jaime Lynn Speiser, Wake Forest School of Medicine
1:35 PM Applications of Machine Learning Methods to Identify Pediatric Patients with De Novo Acute Myeloid Leukemia from a Real-World Data Set
Yimei Li, University of Pennsylvania
1:40 PM Estimation of a Distribution Function with Increasing Failure Rate Average
Ganesh B. Malla, University of Cincinnati-Clermont College
1:45 PM A Semiparametric Complementary Log-Log Model with Applications in Rare Event Mining
Cheng Peng, West Chester University of Pennsylvania; Kai Peng, Ningbo University of Technology
1:50 PM A Deep Convolutional Neural Network Approach for Predicting Cumulative Incidence Based on Pseudo-Observations
Pablo Gonzalez Gonzalez Ginestet, Karolinska Institutet; Philippe Weitz, Karolinska Institutet; Erin Gabriel, Karolinska Institutet; Mattias Rantalainen, Karolinska Institutet
1:55 PM Subgroup Identification of Elderly Bladder Cancer Patients Based on Mental and Physical Scores: A Clustering Approach
Mojgan Golzy, University of Missouri-Columbia; Katie Murray, University of Missouri-Columbia
2:00 PM Impact of Tweets Pre-Processing Techniques on a Dictionary for Environment
Camilla Salvatore, University of Bergamo; Daniele Toninelli, University of Bergamo; Michela Cameletti, University of Bergamo; Stephan Schlosser, Georg-August-Universität Göttingen
2:05 PM Can Machine Learning Improve Correspondence Audit Case Selection? Considerations for Algorithm Selection, Validation, and Experimentation
Lucia Lykke, The MITRE Corporation; Ben Howard, The MITRE Corporation; David Pinski, The MITRE Corporation; Alan Plumley, Internal Revenue Services
2:10 PM Applying Machine Learning Methods for Insight into Textile Recycling Behavior Presentation
Brandon King, North Carolina State University; Lori Rothenberg, North Carolina State University; Jeffrey Joines, North Carolina State University
2:15 PM Empirical Comparison of Multiplicative and Tree-Based Interaction Predictive Models
Chinedu Jude Nzekwe, North Carolina A&T State University; Sayed Mostafa, North Carolina A&T State University; Seong-Tae Kim, North Carolina A&T State University
2:20 PM On Kernel-Target Alignment and Relevant Dimensions in Kernel Feature Spaces Ensuing from the Decision and Regression Tree Ensembles
Dai Feng, AbbVie Inc.; Richard Baumgartner, Merck Research Laboratories
2:30 PM Efficient Semi-Supervised Deep Learning and Machine Learning NLP System to Extract Clinical Measurements from Polysomnogram Laboratory Reports
Ioannis Malagaris, University of Texas Medical Branch; David En Shuo Hsu, University of Texas Medical Branch; Yong-fang Kuo, University of Texas Medical Branch
2:35 PM A Novel Regularized Neuro-Fuzzy Model for Chronic Diseases Outcome Prediction in Longitudinal Dietary Studies
Venkata Sukumar Gurugubelli, University of Massachusetts Dartmouth; Hua Fang, University of Massachusetts Dartmouth
2:40 PM Predicting Nursing Graduates Using Machine Learning Models
Xiaoyue Cheng, University of Nebraska at Omaha; Li Hannaford, Creighton University; Mary Kunes-Connell, Creighton University
2:45 PM Classification of Distinct Trajectories in Longitudinal Data with Irregularly Spaced Intervals: A Large Data Application of Post-Hoc Mixture Modeling of BLUPs from Mixed Models
Md Jobayer Hossain, Nemours Children's Health System; Benjamin E Leiby, Thomas Jefferson University
2:50 PM Worldwide Statistics Without Borders and Client to Consultant Bridge Collaboration: Statistical Storytelling in the Time of COVID
Michal Czapski, Statistics Without Borders; Joshua Derenski, Statistics Without Borders; Stephen Godfrey, Statistics Without Borders; Michelle Vanchu-Orosco, Greater Victoria Coalition to End Homelessness; SWB
2:55 PM Multiomics-Based Tensor Decomposition for Breast Cancer Subtyping
Qian Liu, University of Manitoba; Bowen Cheng, University of Toronto; Pingzhao Hu, University of Manitoba
3:00 PM A Prediction Model Method for Optimizing Appointment Overbooking in Health Care Clinics Using Electronic Health Care Record Data
Nathaniel O'Connell, Wake Forest School of Medicine; Joseph Skelton, Wake Forest School of Medicine
3:05 PM What Can Public Data Tell Us About the Quality of Life in Rural Small Towns?
Denise Bradford, University of Nebraska- Lincoln; Susan VanderPlas, University of Nebraska - Lincoln
3:10 PM What Makes You Unique?
Ben Seiler, Stanford University; Art Owen, Stanford University; Masayoshi Mase, Hitachi, Ltd
 
 

164
Tue, 8/10/2021, 10:00 AM - 11:50 AM Virtual
Social Statistics Speed Session — Contributed Speed
Social Statistics Section, Text Analysis Interest Group
Chair(s): Robin Mejia, Carnegie Mellon University
10:05 AM Retrospective Causal Inference via Matrix Completion, with an Evaluation of the Effect of European Integration on Cross-Border Employment
Jason Poulos, Duke University and SAMSI; Andrea Albanese, Luxembourg Institute of Socio-Economic Research (LISER); Andrea Mercatanti, Bank of Italy; Fan Li, Duke University
10:10 AM Multivariate Small Area Estimation for Continuous and Binary Outcomes: Mapping Collective Efficacy and Crime Prevalence in London
Angelo Moretti, Manchester Metropolitan University; David Buil-Gil, University of Manchester
10:15 AM A Spatio-Temporal Analysis of College Crime in the USA
Fatih Gezer, University of Leeds; Xiaoke Zhang, George Washington University
10:20 AM Embedding Regression: Models for Context-Specific Description and Inference in Political Science
Arthur Spirling, New York University; Pedro Rodriguez, Vanderbilt University; Brandon Stewart, Princeton University
10:25 AM On the Causal Effect of Flipping the Party Order of Candidates in North Carolina’s Nonpartisan Elections
Alessandro Arlotto, Duke University; Alexandre Belloni, Duke University; Fei Fang, Duke University; saša peke?, Duke University
10:30 AM Modeling and Understanding Polarization in News Media Using Dynamic Topic Models Presentation
Shane Dakota Bookhultz, Virginia Tech; Scotland Leman, Virginia Tech; Shyam Ranganathan, Virginia Tech; James Hawdon, Virginia Tech
10:35 AM Polling Errors and Election Forecasting: A Not-So-Hidden Markov Model Presentation
Graham Tierney, Duke Univeristy; Alexander Volfovsky, Duke University
10:40 AM Investigating Similarities and Differences Across States in DVC Scores for Evaluating Redistricting Plans
Thomas R. Belin, UCLA FSPH; Cameron T. Rider, C-FIT Redistricting (Coalition for Fairness, Integrity, and Transparency in Redistricting)
10:45 AM Robustness of DVC Scores to Alternative Density-Variation and Compactness Measures in Evaluating Redistricting Plans
Cameron T. Rider, C-FIT Redistricting (Coalition for Fairness, Integrity, and Transparency in Redistricting); Thomas R. Belin, UCLA FSPH
10:50 AM Modeling Popular Music Genre Preferences Over Time
Aimée M. Petitbon, University of South Carolina; David B. Hitchcock, University of South Carolina
11:00 AM Is Automated Driving Safer? An Application of Survival Analysis in Automated Vehicle Safety Evaluation
Soheil Sohrabi, Texas A&M University; Bahar Dadashova, Texas A&M Transportation Institute; Dominique Lord, Texas A&M University
11:05 AM A Hybrid Approach for Traffic Crash Identification Using Deep Learning and Xgboost
Liang Shi, Virginia Tech; Chen Qian, Virginia Tech; Yanran Wei, Virginia Tech; Feng Guo, Virginia Tech
11:10 AM Evaluating Risk of Eye Glance Patterns by Embedding Based Kernel Two Sample Test Presentation
Chen Qian, Virginia Tech; Jingbin Xu, Virginia Tech; Feng Guo, Virginia Tech
11:15 AM Disparities in Ride-Hailing Usage in the US
Wenjian Jia, University of Virginia; Donna Chen, University of Virginia
11:20 AM Bayesian Criterion-Based Assessments of Recurrent Event Models with Applications to Commercial Truck Driver Behavior Studies
Yiming Zhang, University of Connecticut; Ming-Hui Chen, UCONN; Feng Guo, Virginia Tech
11:25 AM Unified Latent Class Modeling of Score and Rank Data Applied to Grant Panel Review
Michael Pearce, University of Washington; Elena Erosheva, University of Washington; Steven Gallo, American Institute of Biological Sciences
11:30 AM Causal Effect Modification: A Case Study with BPS Data
Bryan Keller, Teachers College, Columbia University
11:35 AM Exact Privacy Guarantees for Sampling Algorithms Implementing the Exponential Mechanism
Jeremy Seeman, Penn State University; Matthew Reimherr, Penn State University; Aleksandra Slavkovic, Penn State University
11:40 AM Unbiased Statistical Estimation and Valid Confidence Intervals Under Differential Privacy
Christian Covington, University of Waterloo; Xi He, University of Waterloo; Gautam Kamath, University of Waterloo; James Honaker, Facebook and Harvard University
11:45 AM Bayesian Estimation of Program-Specific Impacts in the HPOG Program
Stas Kolenikov, Abt Associates; David Ross Judkins, Abt Associates
 
 

167
Tue, 8/10/2021, 10:00 AM - 11:50 AM Virtual
Data Mining and Econometrics — Contributed Speed
Business and Economic Statistics Section, Text Analysis Interest Group
Chair(s): Jonathan R Bradley, Florida State University
10:05 AM Locally Stationary Quantile Regression for Inflation and Interest Rates
Seonjin Kim, Miami University; Zhuying Xu, Indeed Inc; Zhibiao Zhao, Penn State University
10:10 AM Comparison of US Median Family Income with Canada Through Simple Tables and Age-Period-Cohort Models with Interesting Stories Behind
Wenjiang Fu, University of Houston; Li Gan, Texas A&M University; Jiming Jiang, University of California, Davis
10:15 AM Identification of Linear Rational Expectations Models with Exogenous Variables
Peter Zadrozny, Bureau of Labor Statistics
10:20 AM Income Distribution Determinants: A Compositional Data Approach
Rafiq Hijazi, Zayed University
10:25 AM Statistical Inference for Noisy Matrix Completion Incorporating Auxiliary Information
Shujie Ma, University of California, Riverside; Yinchu Zhu, Economics, Brandeis University; PoYao Niu, Department of Statistics, University of California, Riverside
10:30 AM An Application of the LASSO Regression to Assess Poverty on ECOWAS Countries
Brian William Sloboda, Uop, Depart of Labor; Dennis Pearson , Austin Peay State University
10:35 AM Ranking Interestingness Scores for Overdispersed and Heteroskedastic Data at Scale
Serge Sverdlov, Microsoft Corporation
10:40 AM Detecting and Measuring Product Innovation in News Articles Using Google’s BERT
Neil Kattampallil, Biocomplexity Institute, University of Virginia; Gizem Korkmaz, University of Virginia; Gary Anderson, National Center for Science & Engineering Statistics, National Science Foundation
10:45 AM Structural Breaks in Seemingly Unrelated Regression Models
Shahnaz Parsaeian, University of Kansas
10:50 AM Identification and Estimation of Demand in Large Concentrated Markets
Saman Banafti, UC Riverside; Tae-Hwy Lee, University of California, Riverside
11:00 AM Likelihood Specification in Simultaneous Equation Models for Discrete Data
Angela Vossmeyer, Claremont McKenna College; Ivan Jeliazkov, University of California, Irvine
11:05 AM Identifying Number of Factors in Dynamic Factor Models Contributing to GDP Nowcasting: Bayesian Approach with Horse Shoe Shrinkage
Jiayi Luo, Department of Statistics, Iowa State University; Cindy Yu, Iowa State University
11:10 AM Generating Differentially Private Synthetic Heavy-Tailed Data
Tran Tran, Pennsylvania State University; Matthew Reimherr, Penn State University; Aleksandra Slavkovic, Penn State University
11:15 AM Using Real Estate Data to Improve Business Student Interpretation in Regression
Mitra Lal Devkota, University of North Georgia ; Eric B Howington, Valdosta State University
11:20 AM Improving Weight Representivity of Fixed Quantity Consumer Price Index Products
Joshua Klick, Bureau of Labor Statistics
11:25 AM Shapley-Value-Based Feature Attribution for Risk-Utility Tradeoff in Data Privacy
Francis Bilson Darku, University of Notre Dame; Xinxue Qu, University of Notre Dame; Hong Guo, University of Notre Dame
11:30 AM Are Respondents Changing the Way They Self-Select Their Industry Due to the COVID-19 Pandemic?
Sania Khan, US Bureau of Labor Statistics; Emily Thomas, US Bureau of Labor Statistics
11:35 AM Email Solicitation for the Multiple Worksite Report (MWR) During the Pandemic
Kelly Quinn, U.S. Bureau of Labor Statistics; Emily Thomas, US Bureau of Labor Statistics
11:40 AM Latent Class Modeling of Passenger Airfares in the US Airline Industry
Neela D Manage, Florida Atlantic University
 
 

246
Wed, 8/11/2021, 10:00 AM - 11:50 AM Virtual
Data Science — Contributed Speed
Section on Statistical Computing, Text Analysis Interest Group
Chair(s): Catherine Durso, University of Denver
10:05 AM Extensions to the Syrjala Test with Eye-Tracking Data Analysis Applications Presentation
Eric D. McKinney, Utah State University; Jürgen Symanzik, Utah State University
10:10 AM Smooth Ridge Model for Computer Experiments
Asma Farid, QUEEN MARY UNIVERSITY OF LONDON
10:15 AM A Sequential Discrimination Procedure for Two Almost Identically Shaped Real Distributions
Silvey Shamsi, Ball State University; Mian Arif Shams Adnan, Bowling Green State University
10:20 AM Missing Value Estimation for High-Dimensional Data
Mian Arif Shams Adnan, Bowling Green State University; Silvia Irin Sharna, Bowling Green State University
10:25 AM From Research to Deployment and Back: A Computational Framework for Reproducibility and Replicability from Industry Presentation
Sergiy O Nesterko, Fidelity Investments
10:30 AM Creating a Data-Driven Taxonomy
Randall Powers, Bureau of Labor Statistics; Wendy Martinez, Bureau of Labor Statistics; Terrance Savitsky, Bureau of Labor Statistics
10:35 AM Statistical Analysis on Factors Influencing Life Expectancy
Meichen Huang, The University of Texas at Dallas; Akash Roy, Duke University
10:40 AM The Delta-Spherical Dirichlet Distribution and Applications
JOSE H GUARDIOLA, Texas A&M University Corpus Christi
10:45 AM Optimized Data Discretization
Rita Chattopadhyay, Intel Corp
10:50 AM Discriminant Analysis Using Quantile Classifier for Corrupted Label Data
Masaaki Okabe, Doshisha University; Hiroshi Yadohisa, Doshisha University
11:00 AM Finite Mixture of Birnbaum-Saunders Distributions Using the K-Bumps Algorithm
Luis Benites, Pontificia Universidad Católica del Perú; Rocío Maehara, Universidad del Pacífico; Filidor Vilca, Universidade Estadual de Campinas; Fernando Marmolejo-Ramos, University of South Australia, Adelaide, Australia
11:05 AM Fast Model Order Identification for Big Time Series Data Presentation
Brian Guangshi Wu, Oakland University; Dorin Drignei, Oakland University
11:10 AM Novel Modeling of High-Frequency Stock Trading Data
Yuying Huang, University of Victoria; Ke Xu, University of Victoria; Xuekui Zhang, University of Victoria; Li Xing, University of Saskatchewan
11:15 AM Fused Mean Structure Learning in Data Integration with Dependence
Emily C Hector, North Carolina State University
11:20 AM Backfitting for Large-Scale Binary Regressions with Crossed Random Effects
Swarnadip Ghosh, STANFORD UNIVERSITY; Trevor JOHN Hastie, STANFORD UNIVERSITY; Art Owen, Stanford University
11:25 AM On the Final Solution of the Jeffreys-Lindley Paradox Presentation
Miodrag Lovric, Radford University
11:30 AM Sample size determination for stepped-wedge cluster randomized trials with imbalance enrollment using a Shiny app
Zhuopei Hu, University or Arkansas for Medical Sciences; Ruofei Du, University or Arkansas for Medical Sciences; Songthip T Ounpraseuth, University or Arkansas for Medical Sciences
11:35 AM Network Visualization and Analysis on T and B Cell Receptors from SARS-CoV-2 Patients
Hai Yang, UCSF; Li Zhang, UCSF; Zenghua Fan, UCSF; Tao He, San Francisco State University; Lawrence Fong, UCSF
 
 

264 !
Wed, 8/11/2021, 1:30 PM - 3:20 PM Virtual
Frontiers of High-Dimensional Statistics — Invited Papers
IMS, Text Analysis Interest Group
Organizer(s): Pragya Sur, Harvard University
Chair(s): Zhou Fan, Yale University
1:35 PM New Estimates of the Wasserstein Distance Between Document-Generating Distributions in Topic Models
Florentina Bunea, Cornell University
2:00 PM Risk Estimation Under High-Dimensional Asymptotics
Arian Maleki, Columbia University; Kamiar Rahnamad rad, City University of New York; Wenda Zhou, Flat iron institute
2:25 PM Second-Order Stein: SURE for SURE and Other Applications
Cun-Hui Zhang, Rutgers University
2:50 PM Discussant: Pragya Sur, Harvard University
3:15 PM Floor Discussion
 
 

280 *
Wed, 8/11/2021, 1:30 PM - 3:20 PM Virtual
Application of Machine Learning in Clinical Development — Topic-Contributed Papers
Biopharmaceutical Section, Section on Statistical Learning and Data Science, International Chinese Statistical Association, Text Analysis Interest Group
Organizer(s): Dooti Roy, Boehringer Ingelheim Pharmaceuticals, Inc.; Nan Shao, Boehringer Ingelheim Pharmaceuticals, Inc.
Chair(s): Zheng Zhu, Boehringer Ingelheim Pharmaceuticals, Inc.
1:35 PM Application of Digital Medicine in Drug Development
Sandeep M Menon, Pfizer; Tim McCarthy, Pfizer; F. Isik Karahanoglu, Pfizer Global Research and Development
1:55 PM Predicting Patient Adherence in a Changing World
Dooti Roy, Boehringer Ingelheim Pharmaceuticals, Inc.
2:15 PM Automatic Disease Screening of Borderline Personality Disorder Using Electronic Health Records (EHR)
Nan Shao, Boehringer Ingelheim Pharmaceuticals, Inc.; Marianne Goodman, Icahn School of Medicine at Mount Sinai; James J Peters VA Medical Center; Chengxi Zang, Weill Cornell Medicine, Cornell University; Zheng Zhu, Boehringer Ingelheim Pharmaceuticals, Inc.; Zsuzsanna Tamas, Boehringer Ingelheim; Rachel Ovens, Boehringer Ingelheim; Agnes Koczon-Jaremko , Boehringer Ingelheim; Vikas_Mohan Sharma, Boehringer Ingelheim
2:35 PM Application of Natural Language Processing in Drug Development
Hua Xu, The University of Texas Health Science Center at Houston
2:55 PM Floor Discussion
 
 

283 * !
Wed, 8/11/2021, 1:30 PM - 3:20 PM Virtual
Statistical Analysis on Social Media Misinformation Campaigns — Topic-Contributed Papers
Social Statistics Section, Scientific and Public Affairs Advisory Committee, Text Analysis Interest Group
Organizer(s): Edward Kao, Lincoln Laboratory at Massachusetts Institute of Technology
Chair(s): Edward Kao, Lincoln Laboratory at Massachusetts Institute of Technology
1:35 PM Understanding and Reducing the Spread of Misinformation Online: Evidence from Lab and Field Experiments
Gordon Pennycook, University of Regina; Ziv Epstein, Massachusetts Institute of Technology; Mohsen Mosleh, University of Exeter Business School; Antonio A. Arechar, Massachusetts Institute of Technology; Dean Eckles, Massachusetts Institute of Technology; David G. Rand, Massachusetts Institute of Technology
1:55 PM Assessing the Russian Internet Research Agency’s Impact on the Political Attitudes and Behaviors of American Twitter Users in Late 2017
Brian Guay, Duke University; Alexander Volfovsky, Duke University; Christopher Bail, Duke University; Sunshine Hillygus, Duke University; Emily Maloney, Duke University; Friedolin Merhout, University of Copenhagen; Aidan Combs, Duke University; Deen Freelon, University of North Carolina, Chapel Hill
2:15 PM The Low Hanging Fruit of the Twitter Following Graph
Alex Hayes, University of Wisconsin-Madison; Karl Rohe, University of Wisconsin-Madison
2:35 PM Impact Estimation of Disinformation Narratives Using Network Causal Inference
Steven Thomas Smith, MIT Lincoln Laboratory; Edward Kao, Lincoln Laboratory at Massachusetts Institute of Technology; Erika Mackin, MIT Lincoln Laboratory; Danelle Shah, MIT Lincoln Laboratory; Olga Simek, MIT Lincoln Laboratory; Donald B. Rubin, Temple University
2:55 PM Floor Discussion
 
 

287
Wed, 8/11/2021, 1:30 PM - 3:20 PM Virtual
Classroom Teaching and Pedagogy — Contributed Speed
Section on Statistics and Data Science Education, Section on Teaching of Statistics in the Health Sciences, Text Analysis Interest Group
Chair(s): Jamis Perrett, Bayer U.S.
1:35 PM Statistical Literacy Approved for General Education at the University of New Mexico
Milo Schield, University of New Mexico
1:40 PM R Is for Rhyme? Statistics Class 'Stanza Part' with Poetry!
Lawrence Mark Lesser, The University of Texas at El Paso
1:45 PM Alpha Seminar: A Course for New Graduate Students in Statistics
Christopher R Bilder, University of Nebraska-Lincoln
1:50 PM 'Quick Quizzes': A Simple Yet Powerful Learning Tool
Elizabeth Jennings McGuffey, Rice University
1:55 PM Encouraging and Enhancing the Power of Growth Mindset in Statistics Education
Ramadha Piyadi Gamage, Western Washington University - Mathematics
2:00 PM Incorporating Your Own Research in an Introductory Statistics Courses: A Case Study with Podcasts, Web Scraping, and Natural Language Processing
Benjamin Williams, University of Denver; Alyssa Williams, University of Denver
2:05 PM Resources for Teaching Causality in Introductory Statistics Courses
Kevin Cummiskey, West Point; Krista Watts, West Point; Robert Lasater, West Point
2:10 PM WITHDRAWN A Comparison of Online Versus In-Person Examination During the COVID-19 Pandemic
Jeffrey Landgren, University of North Georgia
2:15 PM Education and Pedagogy of Domain-Specific Learning Materials Using Learning Personas
Daniel Chen, Virginia Tech; Anne Brown, Virginia Tech
2:20 PM From Click to Code: Insights from Instructors and Students on Moving from SPSS to R in a Graduate-Level Applied Statistics Course
Anthony Schmidt, University of Tennessee, Knoxville; Louis M Rocconi, University of Tennessee, Knoxville; Austin Boyd, University of Tennessee, Knoxville
2:30 PM Ideas Toward Inclusion: Course Policies, Daily Questions, Biographies
Matthew A Hawks, US Naval Academy
2:35 PM Using Randomized Experiments and Common Midterm Questions to Elucidate Student Learning Trajectories from Simulation-Based Inference Curricula
Beth Chance, Cal Poly - San Luis Obispo; Soma Roy, Cal Poly - San Luis Obispo; Karen McGaughey, Cal Poly - San Luis Obispo; Nathan Tintle, Dordt University; Todd Swanson, Hope College; Jill VanderStoep, Hope College
2:40 PM A Second Course in Data Science
Rosanna Overholser, Oregon Institute of Technology; Cristina Negoita, Oregon Institute of Technology
2:45 PM How to Implement and Assess Open-Ended Projects in an Introductory Statistics Classroom Presentation
Allison Davidson, Muhlenberg College
2:50 PM Strategies for Teaching Methods for Analyzing Data and the Necessary Computer-Based Programming to MPH Students
Amanda Rae Ellis, University of Kentucky
2:55 PM Building and Using Interactive R Tutorials: Experiences from a Categorical Data Analysis Course for Public Health Students
Adam Ciarleglio, George Washington University
3:00 PM Asking Great Questions: Part of a Theory of Communication in Interdisciplinary Collaborations
Eric Alan Vance, LISA-University of Colorado Boulder; Heather Smith, Cal Poly, San Luis Obispo
3:10 PM Incorporating Statistical Consulting in the Undergraduate Curriculum
Wanchunzi Yu, Bridgewater State University; Kevin Rion, Bridgewater State University; Irina Seceleanu, Bridgewater State University
3:15 PM Can Students Learn Statistics While Playing Games? A Regression Example
Ginger Holmes Rowell, Middle Tennessee State University; Shonda Kuiper, Grinnell College; Rodney Sturdivant, Baylor; Andrew Zieffler, University of Minnesota
 
 

319
Wed, 8/11/2021, 3:30 PM - 5:20 PM Virtual
SLDS CSpeed 6 — Contributed Speed
Section on Statistical Learning and Data Science, Text Analysis Interest Group
Chair(s): Weijing Tang, University of Michigan
3:35 PM Estimation of the Mean Function of Functional Data via Deep Neural Networks
GUANQUN CAO, Auburn university; Shuoyang Wang, Auburn university; Zuofeng Shang, New Jersey Institute of Technology
3:40 PM Nonlinear Functional Modeling Using Neural Networks
Aniruddha Rajendra Rao, Pennsylvania State University; Matthew Reimherr, Penn State University
3:45 PM Hyperparameter Optimization of Deep Neural Networks with Applications to Medical Device Manufacturing
Gautham Sunder, Carlson School of Management; Christopher Nachtsheim, Carlson School of Management; Thomas Albrecht, Boston Scientific
3:50 PM Deep Upper Confidence Bound Algorithm for Contextual Bandit Ranking of Information Selection
Michael Rawson, Department of Mathematics, University of Maryland at College Park; Jade Freeman, CCDC Army Research Laboratory
3:55 PM Exploring Neural Networks' Ability to Generate Music
NOAH Daniel SOLOMON, Bridgewater State University; Wanchunzi Yu, Bridgewater State University
4:00 PM Efficient Path Following Algorithms and Its Applications to Case Influence Assessment
Qiuyu Gu, The Ohio State University; Renxiong Liu, Ohio State University; Yunzhang Zhu, Ohio State University
4:05 PM TensorFlow Versus H2O, Round 2: Predicting Currency Prices
Kenneth Davis, Statistical Significance
4:10 PM Low-Rank Matrix/Tensor Estimation via Riemannian Gauss-Newton: Statistical Optimality and Second-order Convergence
Wen Huang, Xiamen University; Xudong Li, Fudan University; Anru Zhang, University of Wisconsin-Madison; Yuetian Luo, University of Wisconsin-Madison
4:15 PM Using Machine Learning Techniques to Model Factors That Influence the Intent of a Person to Take a Coronavirus Test
Sheila Rutto, The University of Texas Rio Grande Valley
4:20 PM On the Algorithmic Stability of Adversarial Training
Yue Xing, Purdue University; Qifan Song, Purdue University; Guang Cheng, Purdue University
4:30 PM A Dynamical View on Optimization Algorithms of Overparameterized Neural Networks Presentation
Zhiqi Bu, University of Pennsylvania; Shiyun Xu, University of Pennsylvania; Kan Chen, University of Pennsylvania
4:35 PM WITHDRAWN: Fair Influence Maximization on Social Networks with Community Structure
Octavio César Mesner, University of Michigan; Ji Zhu, University of Michigan; Liza Levina, University of Michigan
4:40 PM Controlled Group Variable Selection Using Variational Autoencoder-Generated Knockoffs and Reproducibility Evaluation
Xinran Qi, Medical College of Wisconsin
4:45 PM Stability of Text Analytics and Topic Analysis: A Deeper Look at Popular Methods Presentation
Mary Milam Whiteside, The University of Texas at Arlington; Mark E Eakin, The University of Texas at Arlington
4:50 PM Testing Hypotheses in Agent-Based Models
Georgiy Bobashev, RTI International; Hang Xiong, Huazhong Agricultural University, China
4:55 PM WITHDRAWN: Stacked Models and the Explainability Tradeoff in Recommender Systems
shaudi mahdavi hosseini, m.i.t.
5:00 PM An Eigenmodel for Dynamic Multilayer Networks
Joshua Daniel Loyal, University of Illinois at Urbana-Champaign; Yuguo Chen, University of Illinois at Urbana-Champaign
 
 

320
Wed, 8/11/2021, 3:30 PM - 5:20 PM Virtual
Electronic Health Records, Causal Inference and Miscellaneous — Contributed Speed
Section on Statistics in Epidemiology, Text Analysis Interest Group
Chair(s): Charles Hall, Albert Einstein College of Medicine
3:35 PM Modified Cox Regression for Modeling Severe Liver Disease Outcomes and History of FIB4
Mulugeta Gebregziabher, MUSC; Jingwen Zhang, MUSC; Patrick Mauldin, MUSC; Andrew Schreiner, MUSC
3:40 PM Application of the Knockoff Filter to Select Models for Automated Surveillance of Postoperative Infections
Yaxu Zhuang, University of Colorado Anschutz Medical Campus; Robert Meguid, University of Colorado Anschutz Medical Campus; William Henderson, University of Colorado Anschutz Medical Campus; Adam Dyas, University of Colorado Anschutz Medical Campus; Kathryn L Colborn, University of Colorado Anschutz Medical Campus
3:45 PM Efficient and Robust Semi-Supervised Learning: Estimating ATE with Partially Annotated Treatment and Response
Jue Hou, Harvard T.H. Chan School of Public Health; Tianxi Cai, Harvard T.H. Chan School of Public Health; Rajarshi Mukherjee, Harvard T.H. Chan School of Public Health
3:50 PM Detecting Data Entry Errors in Electronic Medical Records Using Basis Splines
Daren Kuwaye, University of Iowa
3:55 PM Cumulative Viral Load and Risk of Diabetes and Hypertension in People with HIV: An Analysis of Electronic Health Records
Adovich Rivera, Northwestern University; Lauren Beach, Northwestern University; Juned Siddique, Northwestern University; Donald Lloyd-Jones, Northwestern University; Matthew Feinstein, Northwestern University
4:00 PM Identifying COVID-19 Diagnoses Using Unstructured Electronic Health Records
Benjamin Ackerman, Flatiron Health; James Roose, Flatiron Health; Shrujal Baxi, Flatiron Health; Patrick Gonzales, Flatiron Health; Sandra D. Griffith, Flatiron Health
4:05 PM A Novel Semiparametric Approach to Analyzing Enriched Electronic Health Record Data
Jill Schnall, University of Pennsylvania; Yizheng Wei, University of South Carolina; Yanyuan Ma, Penn State University; Ravi Parikh, University of Pennsylvania; Jinbo Chen, University of Pennsylvania
4:10 PM Avoidance of Care: How Socioeconomic Inequities Impact COVID-19 Severity and Outcome
Chinyere J Okpara, NYU Langone Health Long Island; Jasmin Divers, NYU Long Island School of Medicine; Megan D Winner, NYU Langone Health Long Island; Meredith Akerman, NYU Long Island School of Medicine; Shahidul Islam, NYU Long Island School of Medicine
4:15 PM Double Sampling for Data Missing Not at Random: Designs and Efficient Estimation Strategies
Alexander Levis, Harvard T.H. Chan School of Public Health; Sebastien Haneuse, Harvard TH Chan School of Public Health
4:20 PM Sensitivity Analysis in the Generalization of Experimental Results
Melody Huang, UCLA
4:30 PM Estimating Causal Measures Using a Generalized Difference-in-Difference Approach
Marcelo Magalhaes Taddeo, Universidade Federal da Bahia (UFBA); Leila Denise A. F. Amorim, Universidade Federal da Bahia (UFBA); Rosana Aquino, Universidade Federal da Bahia (UFBA)
4:35 PM Estimating Heterogeneous Treatment Effects for Patients Hospitalized with COVID-19
Laine Thomas, Duke University, Department of Biostatistics and Bioinformatics
4:40 PM Profile Matching for the Generalization and Personalization of Causal Inferences Presentation
Eric Cohn, Harvard University; Jose Zubizarreta, Harvard University
4:45 PM Trials of Targets
Margret Erlendsdottir, Yale School of Public Health; Forrest W. Crawford, Yale University
4:50 PM Power and Sample Size Calculation for Microbiome Epidemiology
Meghan I. Short, Broad Institute; Emma Schwager, Harvard TH Chan School of Public Health; Siyuan Ma, Harvard T. H. Chan School of Public Health; Lauren McIver, Harvard TH Chan School of Public Health; Jeremy E. Wilkinson, Harvard T. H. Chan School of Public Health; Eric Franzosa, Harvard TH Chan School of Public Health; Curtis Huttenhower, Harvard T.H. Chan School of Public Health
4:55 PM Random Change-Point Non-linear Mixed Effects Model for left-censored longitudinal data: An application to HIV surveillance
Binod Manandhar, City University of New York; Hongbin Zhang, City University of New York
5:00 PM Design and Estimation for the Population Prevalence of Infectious Diseases with Limited Testing Resources
Eric Oh, Biocomplexity Institute, University of Virginia; Alyssa Mikytuck, Biocomplexity Institute, University of Virginia; Vicki Lancaster, Biocomplexity Institute, University of Virginia; Joshua Randall Goldstein, Biocomplexity Institute, University of Virginia; Sallie Keller, Biocomplexity Institute, University of Virginia
5:05 PM Racial and Ethnic Disparities in Years of Potential Life Lost Attributable to COVID-19 in the United States: An Analysis of 45 States and the District of Columbia
Jay J Xu, University of California, Los Angeles; Thomas R. Belin, UCLA FSPH
5:10 PM Exploration of Covariate-Constrained Randomization for Cluster-Randomized Trials in Which Many Clusters Are Available
Amy M. Crisp, University of Florida; Natalie Dean, University of Florida
 
 

345 * !
Thu, 8/12/2021, 10:00 AM - 11:50 AM Virtual
Advances in Macroeconomic Nowcasting and Forecasting: Role of Traditional and Nontraditional Indicators and Big Data — Topic-Contributed Papers
Business and Economic Statistics Section, Government Statistics Section, International Statistical Institute, Text Analysis Interest Group
Organizer(s): Baoline Chen, U.S. Bureau of Economic Analysis
Chair(s): Peter Zadrozny, Bureau of Labor Statistics
10:05 AM Back to the Present: Learning About the Euro Area Through a Now-Casting Model
Danilo Cascaldi-Garcia, Federal Reserve Board; Thiago R.T. Ferreira, Federal Reserve Board; Domenico Giannone, Amazon.com; Michele Modugno, Federal Reserve Board
10:25 AM Nowcasting of Advanced Estimates of Quarterly US Private Consumption of Services with Traditional Indicators and Credit Card Payments Data
Baoline Chen, U.S. Bureau of Economic Analysis; Kyle Hood, Bureau of Economic Analysis
10:45 AM Using Cross-Temporal Hierarchies to Improve Forecasts from Large Data Sets
Tommaso Di Fonzo, University of Padua; Daniele Girolimetto, University of Padua
11:05 AM Are conflict and uncertainty measures useful for macroeconomic nowcasting? An application for Latin America Presentation
Javier J. Perez, Bank of Spain; Hannes Mueller, Barcelona GSE; Marina Diakonova, Bank of Spain; Luis Molina, Bank of Spain; Christopher Rauh, University of Cambridge
11:25 AM Nowcasting GDP in Real Time with a Tone-Adjusted, Time-Varying Layered Topic Model
Jasper de Winter, De Nederlansche Bank
11:45 AM Floor Discussion
 
 

384
Thu, 8/12/2021, 12:00 PM - 1:50 PM Virtual
Next-Generation Sequencing and High-Dimensional Data — Contributed Speed
Biometrics Section, Text Analysis Interest Group
Chair(s): Arielle Kimberly Marks-Anglin, University of Pennsylvania
12:05 PM Efficient Two-Stage Analysis Approaches for Complex Trait Association with Arbitrary Depth Sequencing Data
Zheng Xu, Wright State University; Song Yan, University of North Carolina at Chapel Hill; Shuai Yuan, Marinus Pharmaceuticals; Zifang Guo, Merck & Co.; Yun Li, UNC-Chapel Hill
12:10 PM Statistical Inference from Stem Cell Barcoding Data Using Adaptive Approximate Bayesian Computation
Siyi Chen, Rice University; Marek Kimmel, Rice University; Katherine King, Baylor College of Medicine
12:15 PM A Reduced Rank Regression Model for Microbiome Data Integration
Ying Dai, Oregon State University; Duo Jiang, Oregon State University
12:20 PM SUITOR: Selecting the Number of Mutational Signatures Through Cross-Validation
DongHyuk Lee, National Cancer Institute; Difei Wang, National Cancer Institute; Xiaohong R. Yang, National Cancer Institute; Jianxin Shi, National Cancer Institute/National Institutes of Health; Maria T. Landi, National Cancer Institute; Bin Zhu, National Cancer Institute
12:25 PM Copula Models for Temporally Conserved Microbial Interactions
Rebecca A Deek, University of Pennsylvania; Hongzhe Li, University of Pennsylvania
12:30 PM An Approach to Summarize Biological Networks
Thao Vu, University of Colorado, Anschutz Medical Campus
12:35 PM Detecting Differential Expressed/Spliced Transcripts That Are Associated with Continuous Clinical Covariates, Including Survival Time
Huining Kang, Univeristy of New Mexico; Xichen Li, University of New Mexico
12:40 PM Time-Varying Graphical Models for Microbiome Data
Sarah Robinson, Rice University; Christine B. Peterson, The University of Texas MD Anderson Cancer Center
12:45 PM Enhancing Familial Relationship Inference in Admixed Populations
Daniel Yorgov, Purdue Fort Wayne
12:50 PM Backward Selection for RNA-Seq Differential Expression Analysis Using Pseudo-variables
Yet Nguyen, Old Dominion University; Dan Nettleton, Iowa State University
1:00 PM Tumor classification and survival estimation using metabolic activity scores from RNA sequencing data
Jack Goodman, Frank H. Netter SOM; Marcus Alexander, Yale University
1:05 PM Analyzing Dose Response Effects on Single Nuclei Gene Expression Data
Satabdi Saha, Michigan State University; Samiran Sinha, Texas A&M University; Taps Maiti, Michigan State University; Rance Nault, Michigan State University; Sudin Bhattacharya, Michigan State University; Timothy Zacharewski, Michigan State University
1:10 PM Statistical Methods for Mass Spectrometry Data
SO YOUNG RYU, University of Nevada Reno; Sijia Qiu, University of Nevada Reno
1:15 PM Logistic Tree Gaussian Processes (LoTGaP) for the Microbiome
Morris Greenberg, Duke University; Li Ma, Duke University; Zhuoqun Wang, Duke University; Pulong Ma, Duke University / SAMSI; Anthony Sung, Duke University Hospital
1:20 PM High-Sensitivity Pattern Discovery in Large, Paired Multi-Omic Data Sets
Andrew Ghazi, Broad Institute; Kathleen Sucipto, Harvard TH Chan School of Public Health; Gholamali Rahnavard, Harvard TH Chan School of Public Health; Eric Franzosa, Harvard TH Chan School of Public Health; Lauren McIver, Harvard TH Chan School of Public Health; Jason Lloyd-Price, Harvard TH Chan School of Public Health; Emma Schwager, Harvard TH Chan School of Public Health; George Weingart, Harvard TH Chan School of Public Health; Yo Sup Moon, Harvard TH Chan School of Public Health; Xochitl Morgan, University of Otago; Levi Waldron, CUNY Graduate school Public Health and Health Policy; Curtis Huttenhower, Harvard T.H. Chan School of Public Health
1:25 PM Efficient Estimation of the Maximal Association Between Multiple Predictors and a Survival Outcome
Tzu-Jung Huang, University of Washington; Alex Luedtke, University of Washington; Ian McKeague, Columbia University
1:30 PM Extracting Actigraphy-Based Walking Features with Structured Functional Principal Components
Verena Werkmann, Indiana University; Jaroslaw Harezlak, Indiana University; Nancy W. Glynn, University of Pittsburgh
1:35 PM A Nonparametric Empirical Bayes Approach to Covariance Matrix Estimation Presentation
Huiqin Xin, Department of Statistics, University of Illinois at Urbana-Champaign; Sihai Dave Zhao, Department of Statistics, University of Illinois at Urbana-Champaign
1:40 PM Variable Selection in Functional Logistic Regression Using Group LASSO
James Cameron, George Mason University; Pramita Bagchi, George Mason University
1:45 PM Graph-Based Trajectory Visualization for Text Mining of COVID-19 Biomedical Literature
Yeseul Jeon, Yonsei University of Applied Statistics; Dongjun Chung, Ohio State University; Jina Park, Yonsei University of Applied Statistics; Ickhoon Jin, Yonsei university
 
 

389 * !
Thu, 8/12/2021, 2:00 PM - 3:50 PM Virtual
Words and Insights via Text Analysis — Invited Papers
Text Analysis Interest Group, Section on Statistical Learning and Data Science, Section on Statistical Computing
Organizer(s): Kelly H. Zou, Viatris
Chair(s): Kelly H. Zou, Viatris
2:05 PM Topic-Adjusted Visibility Metric for Scientific Articles
Tian Zheng, Columbia University; Linda S. L. Tan, National University of Singapore
2:30 PM Measuring the Impact of Behavior Change Interventions Using Free-Text
Michael Baiocchi, Stanford University; Jordan Rodu, University of Virginia
2:55 PM Transfer Learning for Latent Dirichlet Allocation Presentation
Tommy W Jones, In-Q-Tel
3:20 PM Text Analysis for Topic Discovery with Applications for Cross-Reference, Summarization, and Optimized Recommended Reading List
Michael Henderson; Jesse Behrens, SAS
3:45 PM Floor Discussion
 
 

433 *
Thu, 8/12/2021, 4:00 PM - 5:50 PM Virtual
Statistical Approaches in Text Analysis — Topic-Contributed Papers
Text Analysis Interest Group
Organizer(s): Jordan Rodu, University of Virginia
Chair(s): Jared Murray, University of Texas
4:05 PM Minimizing Conflicts of Interest: Optimizing the JSM Program
David Banks, Duke University & SAMSI; Qiuyi Wu, University of Rochester; Luca Frigau, University of Cagliari
4:25 PM Grains and Brains: An Introduction to Text Regression Using Beer and Television Reviews
Michael Crotty, JMP; Clay Barker, JMP
4:45 PM NukeLM: Pre-Trained and Fine-Tuned Language Models for the Nuclear and Energy Domains
Daniel Fortin, Pacific Northwest National Laboratory; Lee Burke, Pacific Northwest National Laboratory; Karl Pazdernik, Pacific Northwest National Laboratory; Benjamin Wilson, Pacific Northwest National Laboratory; Rustam Goychayev, Pacific Northwest National Laboratory; John Mattingly, North Carolina State University
5:05 PM On the Need for More Statistics in Text Analysis, with Recent Advances
Jordan Rodu, University of Virginia; Michael Baiocchi, Stanford University
5:25 PM Discussant: Richard Ross, University of Virginia
5:45 PM Floor Discussion
 
 

439
Thu, 8/12/2021, 4:00 PM - 5:50 PM Virtual
Topics in Marketing — Contributed Speed
Section on Statistics in Marketing, Text Analysis Interest Group
Chair(s): Liu Liu, University of Colorado Boulder - Leeds School of Business
4:05 PM Capturing Heterogeneity Among Consumers with Multi-Taste Preferences
Liu Liu, University of Colorado Boulder - Leeds School of Business; Daria Dzyabura, New Economics School
4:10 PM CATA Data: Are There Differences in Perception? Presentation
Fabien Llobell, Addinsoft, XLSTAT; Davide Giacalone, University of Southern Denmark; Sara R. Jaeger, The New Zealand Institute for Plant & Food Research Limited; El Mostafa Qannari, StatSC, ONIRIS, INRAE, Nantes, France.
4:15 PM Data Visualization for Exploratory Factor Analysis
Nivedita Bhaktha, GESIS
4:20 PM Data Storytelling for Empowering Your Customers and Members
John VanderPloeg, Chicago Chapter of ASA; John Lee, Chicago Chapter of ASA
4:25 PM Relaxing Functional Form in Choice Models Through Gaussian Processes
Samuel I Levy, Carnegie Mellon University; Richard Mirman, Dynamic Loyalty Systems Inc.; Alan Montgomery, Carnegie Mellon University
4:30 PM Bayesian Modeling of Marketing Attribution
Ritwik Sinha, Adobe Research; David Arbour, Adobe Research; Aahlad Manas Puli, New York University
4:35 PM Modeling Time-to-Event Marketing Data: Accelerated Failure Time Model with Cure Fraction
Han Wu, Amazon; Vanja Dukic, Amazon
4:40 PM How Has COVID-19 Impacted Customer Relationship Dynamics at Food Delivery Businesses?
Daniel Minh McCarthy, Emory University; Elliot Shin Oblander, Columbia University
4:45 PM Testing New Versions of the Wage Phillips Curve in the MeMo-It Model Used by Istat
Deborah Scaccabarozzi, University of Bergamo; Daniele Toninelli, University of Bergamo; Davide Zurlo, Istat; Fabio Bacchini, Istat; Roberto Iannaccone, Istat
4:50 PM A Representative Sampling Method for Causal Inference in Social Network Experiments
Yanyan Li, University of Southern California; Qing Liu, University of Wisconsin-Madison; Sha Yang, University of Southern California
5:00 PM Shocks, Bubbles, and Crashes in Cryptocurrency Market
Min Shu, University of Wisconsin Stout; Ruiqiang Song, Michigan Technological University
5:05 PM Causal Inference and Machine Learning Techniques for Outbound Calls Strategy
Victor S.Y. Lo, Fidelity Investments; Zhuang Li, Fidelity Investments
5:10 PM Modeling Corporate Culture
Willi Cipolli, Colgate University
5:15 PM The Stories They Tell: My Interactions with the Business and How Storytelling Creates Leaders
Ben Barnard, Wells Fargo
5:20 PM Diagnosis of Bubbles and Crashes in Cryptocurrency Market Based on the Generalized Metcalfe’s Law and the Log-Periodic Power Law Singularity Model
Min Shu, University of Wisconsin Stout; Ruiqiang Song, Michigan Technological University
5:25 PM Modeling Tourists' Length of Overnight Stay in Pokhara, Nepal: Zero-Truncated Negative Binomial and Ordinary Least Square Regression Approach Presentation
Nirajan Bam, University of Northern Colorado, Greeley, USA
5:30 PM Model Selection to Forecast the Trend of COVID-19 for the Counties Near Houston, Texas Presentation
YOONSUNG JUNG, Prairie View A&M University
5:35 PM Causal Mediation Analysis with Three Mediators Where Causally Independent and Dependent Mediators Coexist
Youngho Bae, Sungkyunkwan University; Chanmin Kim, Sungkyunkwan University
5:40 PM AN UNBIASED REGRESSION TYPE ESTIMATOR OF PROPORTION IN RANDOMIZED RESPONSE SAMPLING BY USING ANOVA MECHANISM
Daryan Naatjes, Texas A&M University-Kingsville; Stephen A. Sedory, Texas A&M University-Kingsville; Sarjinder Singh, Texas A&M University -Kingsville
5:45 PM Floor Discussion