Online Program
Return to main conference page
Back to search menu
Key:
Applications
Computational Statistics
Computing Science
Data Science
Data Visualization
Machine Learning
Thursday, May 17
Registration
SDSS Hours
Thu, May 17, 7:30 AM - 5:30 PM
Registration
Exhibits Open
SDSS Hours
Thu, May 17, 7:30 AM - 7:15 PM
Regency Ballroom Foyer
GS01 -
Welcome and Keynote Address
General Session
Thu, May 17, 8:30 AM - 10:00 AM
Grand Ballroom D
Chair(s): Yasmin H. Said, George Mason University
8:30 AM
SDSS Welcome
Yasmin H. Said, George Mason University; Jim Harner, West Virginia University; Ronald L. Wasserstein, American Statistical Association
8:45 AM
Uncovering the Mechanisms of General Anesthesia: Where Neuroscience Meets Statistics
Presentation
Emery N. Brown, MIT, Harvard Medical School, and Massachusetts General Hospital
9:40 AM
Edward J. Wegman Award Ceremony
PS02 -
Public Health/Disease
E-Poster
Thu, May 17, 10:00 AM - 10:45 AM
Regency Ballroom B
1
Daily Smokers’ Attributes Associated with Purchasing Cigarettes on Indian Reservations
Richard A Pack, Burnett School of Biomedical Sciences, College of Medicine, University of Central Florida
2
120/5000 Estimation of Life Years Potentially Lost Due to Traffic Accidents Involving a Motorcycle in Costa Rica
Presentation
Agustín Gómez Meléndez, University of Costa Rica
3
Mapping Rates of Inpatient Hospitalizations Related to Mental Disorders in the State of Missouri: A Conditional Autoregressive Model With Zip Code-Level Data
Presentation
Daphne Lew, Saint Louis University
4
Improved Predictive Models for Readmission of Patients with Diabetes
Presentation
Chathurangi Heshani Karunapala Pathiravasan, Southern Illinois University
5
Development of Prognostic Model for Breast Cancer in Shanghai Breast Cancer Survival Study (SBCSS)
Run Fan, Vanderbilt University Medical Center, Department of Biostatistics
6
A Machine Learning Approach to Improve Fall Risk Prediction in Home Health Care
Yancy Lo, Institute for Biomedical Informatics, The Perelman School of Medicine, University of Pennsylvania
7
Treating Leukemia in Youths
Presentation
Zachary R Smith, University of Michigan - Dearborn
8
Survival of Young Leukemia Patients
Theren Williams, University of Michigan- Dearborn
9
Hospital Readmission Risk Prediction after Joint Replacement Surgery
Presentation
Selah F. Lynch, Institute for Biomedical Informatics, The Perelman School of Medicine, University of Pennsylvania
CS01 -
Automated Model Building
Invited
Thu, May 17, 10:30 AM - 12:00 PM
Grand Ballroom D
Organizer(s): William S. Cleveland, Purdue
Chair(s): Ryan Hafen, Hafen Consulting LLC
10:30 AM
D3M Automated Model Building and Diagnostics
Curtis Lisle, KnowledgeVis LLC
11:00 AM
Candela: An Interactive Visualization Component Library for Data Science
Presentation
Jeffrey Baumes, Kitware, Inc.
11:30 AM
Average-Transform-Smooth (ATS) Diagnostic Methods for Non-Gaussian Models
Presentation
William S. Cleveland, Purdue
CS02 -
Statistics Inference for High-Dimensional Regression
Invited
Thu, May 17, 10:30 AM - 12:00 PM
Grand Ballroom E
Organizer(s): Larry Wasserman, Carnegie Mellon University
Chair(s): Todd A Kuffner, Washington University in St. Louis
10:30 AM
Testing for Global Network Structure Using Small Subgraph Statistics
Chao Gao, University of Chicago
11:00 AM
Inferential Goals, Targets, and Principles in High-Dimensional Regression
Todd A Kuffner, Washington University in St. Louis
11:30 AM
Selective Inference in Linear Regression
Jonathan Taylor, Stanford University
CS03 -
Interactive Statistical Graphics: Where Are We Now?
Invited
Thu, May 17, 10:30 AM - 12:00 PM
Grand Ballroom F
Organizer(s): Adalbert Wilhelm, Jacobs University
Chair(s): Adalbert Wilhelm, Jacobs University
10:30 AM
Exploratory Visualization via Extendible Interactive Graphics
Presentation
Wayne Oldford, University of Waterloo
11:00 AM
Model Exploration via Conditional Visualisation
Presentation
Catherine Hurley, Maynooth University
11:30 AM
Interactive (Web-)Graphics (using R)
Heike Hofmann, Iowa State University
CS04 -
Best Practices in Data Science Education
Invited
Thu, May 17, 10:30 AM - 12:00 PM
Grand Ballroom G
Organizer(s): Ben Baumer, Smith College
Chair(s): Ben Baumer, Smith College
10:30 AM
Start with Data Science as an Introduction to Statistical Thinking
Presentation
Mine Cetinkaya-Rundel, Duke University & RStudio
11:00 AM
Data Science for Everybody: Building and Characterizing Student-Driven Pathways in Introductory Statistics Courses
Rebecca Nugent, Carnegie Mellon Statistics & Data Science
11:30 AM
Data-Driven Curriculum Development
David Robinson, DataCamp
CS05 -
Statistical Machine Learning with Business Applications
Invited
Thu, May 17, 10:30 AM - 12:00 PM
Regency Ballroom A
Organizer(s): Brad Price, West Virginia University
Chair(s): Brad Price, West Virginia University
10:30 AM
A Cluster Elastic Net for Multivariate Regression
Ben Sherwood, University of Kansas
11:00 AM
Selection and Its Inference Using the Whole Solution Paths
Peng Wang, University of Cincinnati
11:30 AM
Shrinking Characteristics of Precision Matrix Estimators
Aaron J. Molstad, Fred Hutchinson Cancer Research Center
CS06 -
Analytics for Fitness Tracker Data
Invited
Thu, May 17, 10:30 AM - 12:00 PM
Lake Fairfax A
Organizer(s): David Marchette, Naval Surface Warfare Center
Chair(s): Shelby Macy, Naval Surface Warfare Center
10:30 AM
Correlating Sleep and Temperature Patterns in Navy Warfighters With Current and Future Health Status
Laura Maple, NSWCDD
11:00 AM
An Artificial Intelligence System for Real-Time Individualized Core Temperature Estimation
Jaques Reifman, US Army MRMC/BHSAI
11:30 AM
Statistical Methods for Micro- and Macro-Level Accelerometry Data
Jiawei Bai, Johns Hopkins University
CS07 -
Optimization
Contributed
Thu, May 17, 10:30 AM - 12:00 PM
Lake Fairfax B
Chair(s): Jingyi Zhu, The Johns Hopkins University
10:30 AM
Topological Mixture Estimation
Presentation
Steve Huntsman, BAE Systems
10:45 AM
Plotting Two-Dimensional Confidence Regions
Presentation
Christopher Weld, College of William & Mary
11:00 AM
Tracking Capability of Stochastic Approximation Algorithms with Constant Gain
Jingyi Zhu, The Johns Hopkins University
11:15 AM
Variable Selection for Consistent Clustering
Ronald Joseph Yurko, Carnegie Mellon University
11:30 AM
BRISC: Bootstrap for Rapid Inference on Spatial Covariances
Arkajyoti Saha, Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health
11:45 AM
Reduced Complexity of Second-Order Simultaneous Perturbation Stochastic Approximation Algorithms
Jingyi Zhu, The Johns Hopkins University
CS08 -
Reasoning with Data
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Grand Ballroom D
Organizer(s): William Szewczyk, Mathematics Research Group, National Security Agency
Chair(s): William Szewczyk, Mathematics Research Group, National Security Agency
1:30 PM
Capturing Subject Matter Expertise for Automated Assisted Analysis
Presentation
William Szewczyk, Mathematics Research Group, National Security Agency
2:00 PM
Task-Centric Document Curation based on Node Embeddings from a Graphical Representation of Workflows
Paul Jones, Laboratory for Analytic Sciences
2:30 PM
Experiences with AI, Expert Knowledge and Data Analysis
Presentation
Octavian Udrea, IBM T.J. Watson Research Center
CS09 -
Advanced Mathematics for Data Analysis
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Grand Ballroom E
Organizer(s): David Marchette, Naval Surface Warfare Center
Chair(s): David Marchette, Naval Surface Warfare Center
1:30 PM
Persistence Images and Applications
Tegan Emerson, Naval Research Laboratory
2:00 PM
A Geometric Formulation of Neural Network Training
Presentation
David A. Johannsen, Naval Surface Warfare Center - Dahlgren
2:30 PM
Information Tests on Statistical Submanifolds
Michael Trosset, Indiana University
CS10 -
Visualization Using Open-Source Tools
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Grand Ballroom F
Organizer(s): Wendy Martinez, U.S. Bureau of Labor Statistics
Chair(s): Wendy Martinez, U.S. Bureau of Labor Statistics
1:30 PM
Visualizing BLS Data in Google Public Data Explorer
Presentation
Christopher Morris, U.S. Bureau of Labor Statistics
2:00 PM
Visualization Using Open-Source Tools: some FDA perspectives
Presentation
Paul Schuette, US Food and Drug Administration
2:30 PM
Small Business Database
Richard Schwinn, Small Business Administration
CS11 -
Big Data Analytics Using R and Spark
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Grand Ballroom G
Organizer(s): Brad Price, West Virginia University
Chair(s): Brad Price, West Virginia University
1:30 PM
Data Science Workflows
Jim Harner, West Virginia University
2:00 PM
Data Science at Scale With R and Sparklyr: Architecture, Ecosystem, and Current Developments
Kevin Kuo, Rstudio
2:30 PM
Interacting with Distributed Data from R using SparkR
Presentation
Hossein Falaki, Databricks
CS12 -
Model Selection in High-Dimensions with Complexities
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Regency Ballroom A
Organizer(s): Hamparsum Bozdogan, University of Tennessee
Chair(s): Hamparsum Bozdogan, University of Tennessee
1:30 PM
A New Approach to Dimension Reduction For Multivariate Time Series
Chung Eun Lee, University of Tennessee, Knoxville
2:00 PM
Coordinate-Independent Sparse Estimation in Semiparametric Models
Haileab Hilafu, University of Tennessee
2:30 PM
Expected Volume Confidence Region Complexity (EVCR_COMP) Criterion in High Dimensions with Applications
Hamparsum Bozdogan, University of Tennessee
CS13 -
Social Network Analysis
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Lake Fairfax A
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): William F. Wieczorek, SUNY Buffalo State
1:30 PM
Social Networks and Simplicial Complexes
Presentation
Daniele Struppa, Chapman University
2:00 PM
Reflections on Computational Social Science, in Honor of Ed Wegman
Claudio Cioffi-Revilla, George Mason University
2:30 PM
The Big Picture: Big Data, Big Theory, and Big Challenges
Presentation
William G. Kennedy, George Mason University
CS14 -
Monitoring Financial Stability with Data Science
Invited
Thu, May 17, 1:30 PM - 3:00 PM
Lake Fairfax B
Organizer(s): Shawn Mankad, Cornell University
Chair(s): Shawn Mankad, Cornell University
1:30 PM
Modeling and Prediction of Financial Trading Networks: An Application to the NYMEX Natural Gas Futures Market
Abel Rodriguez, University of California, Santa Cruz
2:00 PM
Elicitability and Backtesting: Perspectives for Banking Regulation
Natalia Nolde, University of British Columbia
2:30 PM
Systemic Risk from Asset Concentration and Common Holdings among Banks
Celso Brunetti, Federal Reserve Board
PS03 -
Bayesian Modeling
E-Poster
Thu, May 17, 3:00 PM - 3:45 PM
Regency Ballroom B
1
Bayesian Modeling of Non-Stationary, Univariate, Spatial Data
Margaret Goldman, U.S. Geological Survey
2
Choosing Among a Class of Zellner’s g-Priors in Bayesian Regression Models and Subset Selection of Variables Using the Genetic Algorithm and Information Complexity
Yaojin Sun, The University of Tennessee
3
Lagged Exact Bayesian Online Changepoint Detection
Michael Byrd, Southern Methodist University
4
Constrained Bayesian Inference through Posterior Projections
Sayan Patra, Duke University
5
On the Quantification and Efficient Propagation of Imprecise Probabilities Using Monte Carlo Methods
Jiaxin Zhang, Johns Hopkins University
6
Bayesian Optimization of Personalized Models for Real-Time Patient Monitoring
Glen Wright Colopy, Oxford University
CS15 -
Best of Computational and Graphical Statistics
Invited
Thu, May 17, 3:30 PM - 5:00 PM
Grand Ballroom E
Organizer(s): Di Cook, Monash University
Chair(s): Catherine Hurley, Maynooth University
3:30 PM
Clusters Beat Trend!? Testing Feature Hierarchy in Statistical Graphics
Susan Ruth VanderPlas, Nebraska Public Power District
4:00 PM
Fused Lasso Additive Model
Presentation
Ashley Petersen, Division of Biostatistics, University of Minnesota
4:30 PM
Programming With Models: Writing Statistical Algorithms for General Model Structures With NIMBLE
Daniel Turek, Williams College
CS16 -
Text Data Analytics and Visualization
Invited
Thu, May 17, 3:30 PM - 5:00 PM
Grand Ballroom E
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): Kelly S Marczynski, SUNY Buffalo State
3:30 PM
Algorithmic and Visualization Frameworks to Facilitate the Revelation of Interesting Structure in Document Collections
Jeffrey L. Solka, Naval Surface Warfare Center
4:00 PM
Fast k Nearest Neighbor Graph Construction Experiments on a Large Scientific Publication Corpus
Avory Bryant, Naval Surface Warfare Center
4:30 PM
Leveraging Automated Storytelling With b-Privy Analytics: Creating Plausible Explanations of Emerging Technologies to Mitigate Surprise
John T. Rigsby, Naval Surface Warfare Center
CS17 -
Data Science at the National Institute of Statistical Sciences
Invited
Thu, May 17, 3:30 PM - 5:00 PM
Grand Ballroom G
Organizer(s): Jim Rosenberger, NISS and Pennsylvania State University
Chair(s): Jim Rosenberger, NISS and Pennsylvania State University
3:30 PM
Using Administrative Data to Produce Official Statistics: An Application to End-Of-Season Acreage Estimation
Presentation
Andreea L Erciulescu, National Institute of Statistical Sciences and USDA National Agricultural Statistics Service
4:00 PM
Future of Integer Calibration Weighting Methods
Presentation
Luca Sartore, National Institute of Statistical Sciences
4:30 PM
The NCES/NISS Partnership: Data Collection Efforts/Structures/New Initiatives
Nell Sedransk, National Institute of Statistical Sciences
CS18 -
Nonlinear Dimension Reduction
Invited
Thu, May 17, 3:30 PM - 5:00 PM
Regency Ballroom A
Organizer(s): Michael Trosset, Indiana University
Chair(s): Michael Trosset, Indiana University
3:30 PM
Optimality of the Johnson-Lindenstrauss Lemma
Jelani Nelson, Harvard University
4:00 PM
Matrix Sketching for Alternating Direction Method of Moments Optimization
Presentation
Daniel McDonald, Indiana University
4:30 PM
Optimal Dimensionality Reduction for Non-Linear Clustering Via Nystrom Approximation
Presentation
Alex Gittens, Rensselaer Polytechnic Institute
CS19 -
CyberLanguage: Applications of Natural Language Processing to CyberSecurity
Invited
Thu, May 17, 3:30 PM - 5:00 PM
Lake Fairfax A
Organizer(s): Joseph Marr, DZYNE Technologies
Chair(s): Joseph Marr, DZYNE Technologies
3:30 PM
Network Traffic Anomaly Detection Using Recurrent Neural Networks
Benjamin Radford, KeyW
4:00 PM
Modeling Machine-to-Machine Cyber Data as Discrete Sequences of Activity
Bartley Richardson, KeyW
4:30 PM
Time Series Pattern Mining and Visualization Using Statistical Language Processing Techniques
Jessica Lin, George Mason University
CS20 -
Differential and Bitcoin Privacy
Invited
Thu, May 17, 3:30 PM - 5:00 PM
Lake Fairfax B
Organizer(s): Roy E. Welsch, MIT
Chair(s): Roy E. Welsch, MIT
3:30 PM
Differentially Private Model Selection with Penalized and Constrained Likelihood
Presentation
Jing Lei, Carnegie Mellon University
4:00 PM
Blockchain Technology: A New Approach to Digital Privacy?
Christian Catalini, MIT
4:30 PM
Differentially Private Parametric Inference
Marco Avella Medina, MIT
CS21 -
Computational Text Processing
Invited
Thu, May 17, 5:15 PM - 6:15 PM
Grand Ballroom E
Organizer(s): Mark Hansen, Columbia
Chair(s): Mark Hansen, Columbia
5:15 PM
Modeling and Understanding Language with Neural Networks Using Spark and R
Ali Zaidi, Microsoft AI and Research
5:45 PM
Computational Propaganda
Mark Hansen, Columbia
CS22 -
Distinguished Colleagues of Edward Wegman: Mathematical Physics
Invited
Thu, May 17, 5:15 PM - 6:15 PM
Grand Ballroom F
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): David Marchette, Naval Surface Warfare Center
5:15 PM
Laws of the Universe, Information and Mind in the Quantum Universe
Menas C. Kafatos, Chapman University
5:45 PM
Exploring and Exploiting Interestingness in Data Science
Presentation
Kirk Borne, Booz Allen Hamilton
CS23 -
Data Science Platforms I
Invited
Thu, May 17, 5:15 PM - 6:15 PM
Grand Ballroom G
Organizer(s): Jim Harner, West Virginia University
Chair(s): Jim Harner, West Virginia University
5:15 PM
Automating Data Science Processes with H2O Driverless AI
Presentation
Patrick Hall, H2O.ai
5:45 PM
Building Data Science Platforms Using Docker
Jim Harner, West Virginia University
CS24 -
TensorFlow
Invited
Thu, May 17, 5:15 PM - 6:15 PM
Regency Ballroom A
Organizer(s): Tim Hesterberg, Google
Chair(s): Tim Hesterberg, Google
5:15 PM
TensorFlow Autograph: Source Code Transformation for Easier TensorFlow
Alex Wiltschko, Google
5:45 PM
Machine Learning with TensorFlow and R
J.J. Allaire, Rstudio
CS25 -
Time Series Modeling
Invited
Thu, May 17, 5:15 PM - 6:15 PM
Lake Fairfax A
Organizer(s): Jim Harner, West Virginia University
Chair(s): Rida Moustafa, Walmart
5:15 PM
The Divergence Between Observed and Modeled Temperature Trends in the Tropical Troposphere 1958-2017
Ross McKitrick, University of Guelph
5:45 PM
Forecasting with Many Predictors
Presentation
Kyle Caudle, SD School of Mines and Technology
CS26 -
Combining Federal and Regional Data Sources: Challenges and Solutions
Invited
Thu, May 17, 5:15 PM - 6:15 PM
Lake Fairfax B
Organizer(s): Lingzhou Xue, Pennsylvania State University
Chair(s): Nell Sedransk, National Institute of Statistical Sciences
5:15 PM
Six Classes of Methodological Research Questions in the Integration of Multiple Data Sources for Granular Estimation
John Eltinge, U.S. Census Bureau
5:35 PM
Use of the Quarterly Census of Employment and Wages and Third-Party Sources for EIA Surveys
Nanda Srinivasan, Energy Information Administration
5:55 PM
Discussant
Jim Rosenberger, NISS and Pennsylvania State University
PS04 -
Machine Learning Applications
E-Poster
Thu, May 17, 6:15 PM - 7:15 PM
Regency Ballroom B
1
Penalized Regression Within the Game Cribbage
Presentation
Christopher Silberstein, The Ohio State Univerisity
2
Diagnosing and Predicting the Eyewall Replacement Cycle: Learning from Hurricane Irma
Martha Lisbeth Christino, T.C. Williams High School
3
Random Forest Prediction Intervals
Haozhe Zhang, Iowa State University
4
Machine Learning for Acute Kidney Injury with IDEAs: Intraoperative Data Embedded Analytics
Presentation
Lasith Adhikari, University of Florida
5
Predicting Human Alteration of River and Stream Salinity Using Random Forest Models
Presentation
Franco Alexis Sanchez, California State University, Monterey Bay, Department of Mathematics and Statistics
6
Performance of Cross-Validation of Binary Longitudinal Finite Mixture Models: A Simulation and Application.
Presentation
Thom J Taylor, Nicklaus Childrens Research Institute
7
The Sliding Window Fourier Transform
Presentation
Lee F Richardson, Carnegie Mellon university
8
Machine Learning Improved Classification of Psychoses using Clinical and Biological stratification: Update from the Bipolar-Schizophrenia Network for Intermediate Phenotypes (B-SNIP)
Suraj Sarvode Mothi, Department of Psychiatry, Massachusetts General Hospital
9
Inter- and Intra-Institutional Efforts to Build Capacity for Data Science Education
Presentation
Douglas Landsittel, University of Pittsburgh
GS02 -
Symposium on Data Science & Statistics Banquet
General Session
Thu, May 17, 7:15 PM - 8:30 PM
Grand Ballroom D
I Never Met a Datum I Didn’t Like
Barry D. Nussbaum, 2017 President, American Statistical Association
↑