Online Program
Return to main conference page
Back to search menu
Key:
Applications
Computational Statistics
Computing Science
Data Science
Data Visualization
Machine Learning
Friday, May 18
Exhibits Open
SDSS Hours
Fri, May 18, 7:30 AM - 4:00 PM
Regency Ballroom Foyer
Registration
SDSS Hours
Fri, May 18, 7:30 AM - 5:30 PM
Registration
GS03 -
Plenary Session: Contributions to Computational Statistics
General Session
Fri, May 18, 8:30 AM - 10:00 AM
Grand Ballroom D
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): Yasmin H. Said, George Mason University
8:35 AM
Ed Wegman's Influence on the Profession: His Work in Computational Statistics and Density Estimation in Particular
David Scott, Rice University
9:00 AM
Statistical Graphics in Data Science
Presentation
Adalbert Wilhelm, Jacobs University
9:25 AM
Omnibus Regression: Predicting Probability Distributions with Imperfect Data
Jerome H. Friedman, Stanford University
9:50 AM
Floor Discussion
PS05 -
Bioinformatics/Biomedical
E-Poster
Fri, May 18, 10:00 AM - 10:45 AM
Regency Ballroom B
1
Effect of Non-Parametric Mapping Over Parametric Mapping for fMRI
Siddharth Nayak, Institute of Statistical Science, Academia Sinica
2
Identifying Bioethical Issues in Biostatistical Consulting: Findings From a US National Pilot Survey of Biostatisticians
Min Qi Wang, University of Maryland
3
Diagnostic Prediction of Autism in Resting-State Functional Mri Using Conditional Random Forest
Afrooz Jahedi, San Diego State University
4
Data-Driven Statistical Methods for Detecting Gait Instability Using Physiological Signal Metrics
Kristin Morgan, University of Connecticut
5
A Comparison of Selected Parametric and Non-Parametric Statistical Approaches for Candidate Genes Selection in Transcriptome Data
Presentation
Dawit Gezahegn Tadesse, Cincinnati Children's Hospital Medical Center
6
Wavelet-based Classification Applied to fMRI
Presentation
Pedro Alberto Morettin, University of São Paulo
7
Visualizations to Guide Dimension Reduction for Sparse High-Dimensional Data
Snehalata Huzurbazar, West Virginia University
CS27 -
Distinguished Colleagues of Edward Wegman: Applications to Data Science
Invited
Fri, May 18, 10:30 AM - 12:00 PM
Grand Ballroom D
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): Yasmin H. Said, George Mason University
10:30 AM
Automatic Visualization
Leland Wilkinson, H2O
11:00 AM
Cherry-Picking for Complex Datasets
Presentation
David Banks, SAMSI and Duke University
11:30 AM
Bayesian Penalty Mixing with the The Spike and Slab Lasso
Presentation
Edward George, University of Pennsylvania
CS28 -
Bayesian Computations and Applications
Invited
Fri, May 18, 10:30 AM - 12:00 PM
Grand Ballroom E
Organizer(s): Ehsanolah Soofi, University of Wisconsin at Milwaukee
Chair(s): Ehsanolah Soofi, University of Wisconsin at Milwaukee
10:30 AM
Analysis of Crimean-Congo Hemorrhagic Fever Incidents with Dynamically Weighted Particle Filter
Presentation
Duchwan Ryu, Northern Illinois University
11:00 AM
Non-Negative Matrix Factorization for The Exponential Family Based on Generalized Dual Divergence and Intrinsic Information
Karthik Devarajan, Fox Chase Cancer Center, Temple University Health System
11:30 AM
Masking Data Using an Entropy Approach
Kurt Pflughoeft, University of Wisconsin Milwaukee
CS29 -
Big Data Visualization
Invited
Fri, May 18, 10:30 AM - 12:00 PM
Grand Ballroom F
Organizer(s): Rida Moustafa, Walmart
Chair(s): Rida Moustafa, Walmart
10:30 AM
Developing Inferential Visual Analytics Systems for Scientific Applications
Chad A. Steed, Oak Ridge National Laboratory
11:00 AM
Data Visualization in Statistical Consulting Applications
Heather Watson, Exponent, Inc.
11:30 AM
Quantization and Enveloping Methods for Scaling Visualization Techniques to Big Data
Rida Moustafa, Walmart
CS30 -
Data Science Programs
Invited
Fri, May 18, 10:30 AM - 12:00 PM
Grand Ballroom G
Organizer(s): Tim Hesterberg, Google
Chair(s): Tim Hesterberg, Google
10:30 AM
NYU Master of Science in Data Science
Presentation
Arthur Spirling, New York University
10:55 AM
Columbia University Master of Science in Data Science
Presentation
Tian Zheng, Columbia University
11:20 AM
WVU Master of Science in Business Data Analytics: Challenges and Experiences with Online Data Science Programs
Presentation
Brad Price, West Virginia University
11:45 AM
Floor Discussion
CS31 -
Recent Advances in Statistical Machine Learning
Invited
Fri, May 18, 10:30 AM - 12:00 PM
Regency Ballroom A
Organizer(s): Eric Chi, North Carolina State University; David Scott, Rice University
Chair(s): David Scott, Rice University
10:30 AM
On the Regularizations for Enforcing Equi-Sparsity
Yiyuan She, Florida State Univresity
11:00 AM
An Alternating Directions Method for Large-scale Multivariate Convex Regression
Jason Xu, University of California Los Angeles
11:30 AM
Tensor Canonical Correlation Analysis
Eric Chi, North Carolina State University
CS32 -
Data Science Partnerships
Invited
Fri, May 18, 10:30 AM - 12:00 PM
Lake Fairfax A
Organizer(s): Sallie Keller, Biocomplexity Institute of Virginia Tech
Chair(s): Sallie Keller, Biocomplexity Institute of Virginia Tech
10:30 AM
Using Multiple Big Data Sources to Manage a Supply Chain
Dave Higdon, SDAL, Virginia Tech
11:00 AM
Partnering for Data Science: The Laboratory for Analytic Sciences
Presentation
Alyson Wilson, North Carolina State University
11:30 AM
University, Government, NGO Partnership Around Statistical Solutions to Urban Challenges
Katherine Bennett Ensor, Rice University
CS33 -
Survey Science
Contributed
Fri, May 18, 10:30 AM - 12:00 PM
Lake Fairfax B
Chair(s): MoonJung Cho, U.S. Bureau of Labor Statistics
10:30 AM
Survey Estimation with Elastic Net Regression: Combining Data Sources to Improve Estimator Efficiency
Kelly Sue McConville, Swarthmore College
10:45 AM
Pseduolikelihood Inference for Quantiles From Complex Surveys
Jing Wang, The University of Texas at Arlington
11:00 AM
Can a Statistician Thrive Using Only Free Software?
Amang Sukasih, RTI International
11:15 AM
Systematic Sampling Design with Application to Data Splitting
Redouane Betrouni, George Mason University
11:30 AM
Incorporating Design Concepts and Methods into the Integration of Multiple Data Sources
John Eltinge, U.S. Census Bureau
11:45 AM
Classification Trees for Privacy in Sample Surveys
Presentation
Rolando Andres Rodriguez, U.S. Census Bureau
CS34 -
Distinguished Students of Edward Wegman
Invited
Fri, May 18, 1:30 PM - 3:00 PM
Grand Ballroom D
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): Edward George, University of Pennsylvania
1:30 PM
On Spectral Graph Clustering
Carey E. Priebe, Johns Hopkins University
2:00 PM
Modeling Topics in Survey Interviewer Notes
Presentation
Wendy Martinez, U.S. Bureau of Labor Statistics
2:30 PM
Eigen-Privy: Adjacency Spectral Embedding for Document Analysis
Presentation
David Marchette, Naval Surface Warfare Center
CS35 -
Advances in Bayesian Analytics
Invited
Fri, May 18, 1:30 PM - 3:00 PM
Grand Ballroom E
Organizer(s): Refik Soyer, George Washington University
Chair(s): Refik Soyer, George Washington University
1:30 PM
Deep Learning: A Bayesian Perspective
Vadim Sokolov, George Mason University
2:00 PM
Bayesian Analysis of Multivariate Non-Gaussian Time Series
Refik Soyer, George Washington University
2:30 PM
Likelihood, Confirmational Tenacity, and Mood Transitions in Bayesian Inference
Nozer D. Singpurwalla, City University of Hong Kong
CS36 -
Data Visualization Platforms
Invited
Fri, May 18, 1:30 PM - 3:00 PM
Grand Ballroom F
Organizer(s): Jim Harner, West Virginia University
Chair(s): Jim Harner, West Virginia University
1:30 PM
Using Shiny to interact with data
Winston Chang, Rstudio
2:00 PM
The Interactive Solution Path in JMP Pro: A Powerful Tool for Visualizing and Exploring Model Diagnostics
Chris Gotwalt, JMP
2:30 PM
RCloud - Collaborative Platform for Visualization and Data Analysis
Simon Urbanek, ATT Research
CS37 -
Statistical Analytics for Data Science
Invited
Fri, May 18, 1:30 PM - 3:00 PM
Grand Ballroom G
Organizer(s): Lynne Billard, University of Georgia
Chair(s): Seyed Yaser Samadi, Southern Illinois University Carbondale
1:30 PM
Time Series Analysis for Symbolic Interval-valued Data
Seyed Yaser Samadi, Southern Illinois University Carbondale
2:00 PM
Privacy Analytics via Aggregate Data: Trade-off between Statistical Efficiency and Privacy
Anand N. Vidyashankar, George Mason University
2:30 PM
Clustering Histogram-valued Data
Lynne Billard, University of Georgia
CS38 -
Statistical Challenges in Large-Scale Data Mining
Invited
Fri, May 18, 1:30 PM - 3:00 PM
Regency Ballroom A
Organizer(s): Tian Zheng, Columbia University
Chair(s): Tian Zheng, Columbia University
1:30 PM
A Scalable Algorithm for Change-Points Computation in Large Graphical Models
Yves Atchade, University of Michigan
2:00 PM
Embedding Approaches for Mining Heterogeneous Information Networks
Presentation
Yizhou Sun, UCLA
2:30 PM
Approximate Data Analytics
Christopher Jermaine, Rice University
CS39 -
Applications of Divide and Recombine to Big Data
Invited
Fri, May 18, 1:30 PM - 3:00 PM
Lake Fairfax A
Organizer(s): William S. Cleveland, Purdue
Chair(s): Soren Harner, MuleSoft
1:30 PM
Divide & Recombine (D&R) with DeltaRho for Big Data Analysis
Presentation
William S. Cleveland, Purdue
2:00 PM
DeltaRho for Deep Analysis of Precipitation and Cloud Observations to Advance the Understanding of Earth's Water Cycle
Wen-wen Tung, Earth, Atmospheric, and Planetary Sciences, Purdue
2:30 PM
Applications of Large-Scale Visualization Using Trelliscope
Presentation
Ryan Hafen, Hafen Consulting LLC
CS40 -
Data Science Foundations
Contributed
Fri, May 18, 1:30 PM - 3:00 PM
Lake Fairfax B
Chair(s): Snehalata Huzurbazar, West Virginia University
1:30 PM
A Grammar for Reproducible and Painless Extract-Transform-Load Operations on Medium Data
Ben Baumer, Smith College
1:45 PM
Perspectives on Deep Learning and Deep Reasoning
Presentation
Rich Haney, Big Data2 Consulting
2:00 PM
Defining the AIM: An Abstraction for Improving Machine Learning Prediction
VICTORIA STODDEN, University of Illinois Urbana-Champaign
2:15 PM
Sensemaking and Five Problems with Big Data Science
Presentation
Michael Latta, Coastal Carolina University - YTMBA Research & Consulting
2:30 PM
Painless Computing Models for Ambitious Data Science
Presentation
Hatef Monajemi, Stanford University
2:45 PM
A Paradigm for Research in Data Science
Presentation
Vardan Papyan, Stanford
PS06 -
Survey Data
E-Poster
Fri, May 18, 3:00 PM - 3:45 PM
Regency Ballroom B
1
Constrained Optimization for Survey Weights
Presentation
Matthew R Williams, Substance Abuse and Mental Health Services Administration
2
Performance Evaluation of Machine Learning Algorithms by K-Fold and Leave-One-Out Cross Validation for Classification of Survey Write-in Responses
Presentation
Andrea Roberson, U.S. Census Bureau
3
Looking Inward: Quality Audits for Demographic Programs at the U.S. Census Bureau
Richard Levy, US Census Bureau
4
Some Dimension Reduction Strategies for the Analysis of Survey Data
Jiaying Weng, University of Kentucky
5
Suggestion of the Confidence Interval of the Cronbach Alpha in Application to Complex Survey Data
Jihnhee Yu, University at Buffalo
6
Secure Distributed Computational Processing for Industry Statistical Data
Cavan Paul Capps, U.S. Census Bureau
CS41 -
Big Data and Data Science in Government, Public Policy, and the Health Sciences
Invited
Fri, May 18, 3:30 PM - 5:00 PM
Grand Ballroom D
Organizer(s): Nozer D. Singpurwalla, City University of Hong Kong; Inez Zwetsloot, City University of Hong Kong
Chair(s): Inez Zwetsloot, City University of Hong Kong
3:30 PM
Building Resilient Communities: Harnessing the Power of Data
Presentation
Sallie Keller, Biocomplexity Institute of Virginia Tech
4:00 PM
Data Foundation for Defense Acquisition: How the Department of Defense Manages and Uses Data to Support Management and Decision-making on the High-value Major Defense Acquisition Programs
Nancy Spruill, OUSD(AT&L)/ARA
4:30 PM
On the the Role of Higher Order Topological Properties in Functionality of Complex Networks
Yulia Gel, UT Dallas
CS42 -
Invitation to Statistical Analysis and Data Mining
Invited
Fri, May 18, 3:30 PM - 5:00 PM
Grand Ballroom E
Organizer(s): Jia Li, Pennsylvania State University
Chair(s): Lynne Billard, University of Georgia
3:30 PM
Fitting High-Dimensional Function-on-Scalar Regression Models via a Functional Augmented ADMM
Matthew Reimherr, Penn State University
4:00 PM
Flexible Supervised Learning Techniques for Block-missing Data
Yufeng Liu, University of North Carolina at Chapel Hill
4:30 PM
Phyloclustering: A Model-Based Approach for Identifying Microbial Populations
Wei-Chen Chen, pbdR Core Team
CS43 -
Dynamic Structural Proteomics: Simulation, Visualization, and Nonparametric Estimation
Invited
Fri, May 18, 3:30 PM - 5:00 PM
Grand Ballroom F
Organizer(s): Juergen Symanzik, Utah State University
Chair(s): Daniel B. Carr, George Mason University
3:30 PM
Biomolecules in Motion: Sample-based Models of Dynamics Elucidating Function and Mechanisms in the Healthy and Diseased Cell
Amarda Shehu, George Mason University
4:00 PM
Local PCA and Extraction of Filamentary Structures
Wanli Qiao, George Mason University
4:30 PM
An Approach to Visualizing Simulated Protein Folding Energy Landscapes as a Function of Four to Six Principal Components
Daniel B. Carr, George Mason University
CS44 -
Data Science Platforms II
Invited
Fri, May 18, 3:30 PM - 5:00 PM
Grand Ballroom G
Organizer(s): Jim Harner, West Virginia University
Chair(s): Jim Harner, West Virginia University
3:30 PM
The Unified Analytics Platform: Unifying Big Data Workloads in Apache Spark
Presentation
Hossein Falaki, Databricks
4:00 PM
Using Microsoft ML Server and Spark for Distributed Computation of Massive Computational Experiments in Data Science and Statistical Inference
Ali Zaidi, Microsoft AI and Research
4:30 PM
The SAS® Platform: Where Point and Click Users and Coders of All Languages Collaborate Seamlessly
Carlos Pinheiro, SAS & Data Science Tech Institute, France
CS45 -
Statistical Machine Learning Applications in Surveys
Invited
Fri, May 18, 3:30 PM - 5:00 PM
Regency Ballroom A
Organizer(s): Wendy Martinez, U.S. Bureau of Labor Statistics
Chair(s): Wendy Martinez, U.S. Bureau of Labor Statistics
3:30 PM
Classification and Regression Trees and Forests for Imputing Data from Sample Surveys
Presentation
MoonJung Cho, U.S. Bureau of Labor Statistics
4:00 PM
Model-Assisted Survey Estimation With Modern Prediction Techniques
Jean Opsomer, Colorado State University
4:30 PM
Calling All Stakeholders: Developing a Demographic Statistical Redesign Agenda
Richard Levy, US Census Bureau
CS46 -
Data Sciences Applications for Critical Health Issues I
Invited
Fri, May 18, 3:30 PM - 5:00 PM
Lake Fairfax A
Organizer(s): William F. Wieczorek, SUNY Buffalo State
Chair(s): Jonathan Lindner, Center for Health and Social Research at SUNY Buffalo State
3:30 PM
Alcohol Abstainers versus Drinkers: Changes in Health Outcomes after 20 Years
Presentation
Kelly S Marczynski, SUNY Buffalo State
4:00 PM
Making Data Speak to User Needs: The Anchor Institution Dashboard
Alban Morina, Center for Health and Social Research at SUNY Buffalo State
4:30 PM
Conceptualization Issues in Analyzing and Communicating Collective Impact Data
Karl Wende, Center for Health & Social Research at Buffalo State
CS47 -
Time-to-Event Models
Contributed
Fri, May 18, 3:30 PM - 5:00 PM
Lake Fairfax B
Chair(s): Rida Moustafa, Walmart
3:30 PM
A Moving 2D Time Series Models
Presentation
Silvey Shamsi, Ball State University
3:45 PM
A Tool to Facilitate Creation of Multiple Time-Based Intervals per Subject
Presentation
Cynthia Sue Crowson, Mayo Clinic
4:00 PM
An Efficient Generalized Least Squares Algorithm for Periodic Regression With Autoregressive Errors
Jaechoul Lee, Boise State University
4:15 PM
Comparison of Emotional States by Time Series Connectivity Analysis of Brain Activity Data
Presentation
Rui Liu, Louisiana Tech University
4:30 PM
Floor Discussion
CS48 -
Distinguished Colleagues of Edward Wegman: Modeling and Data Science
Invited
Fri, May 18, 5:15 PM - 6:15 PM
Grand Ballroom D
Organizer(s): Yasmin H. Said, George Mason University
Chair(s): Yasmin H. Said, George Mason University
5:15 PM
The Revival of Statistical Ranking Methods in The High Technology and Big Data Era: Some Recent Developments
Michael G. Schimek, Medical University of Graz
5:45 PM
Communicating with Data Using Transparent Models
Roy E. Welsch, MIT
CS49 -
Data Analytics Supporting Homeland Security
Invited
Fri, May 18, 5:15 PM - 6:15 PM
Grand Ballroom E
Organizer(s): Eddie Fuller, West Virginia University and Homeland Security
Chair(s): Eddie Fuller, West Virginia University and Homeland Security
5:15 PM
Vast & Varied - Big Data at DHS
Aaron Mannes, Homeland Security
5:45 PM
Using Data Analytics to Support Disaster Response During Harvey and Irma: Social Media, Weather and Other Data Sources
Eddie Fuller, West Virginia University and Homeland Security
CS50 -
Data Science Platforms III
Invited
Fri, May 18, 5:15 PM - 6:15 PM
Grand Ballroom G
Organizer(s): Jim Harner, West Virginia University
Chair(s): Soren Harner, MuleSoft
5:15 PM
Intelligent Application Networks with MuleSoft and TensorFlow
Presentation
Soren Harner, MuleSoft
5:45 PM
An Introduction to the Watson Data Platform
Bernie Beekman, IBM
CS51 -
Predictive Big Data Analytics
Invited
Fri, May 18, 5:15 PM - 6:15 PM
Regency Ballroom A
Organizer(s): Jim Harner, West Virginia University
Chair(s): Jim Harner, West Virginia University
5:15 PM
Interpretable Machine Learning
Presentation
Patrick Hall, H2O.ai
5:45 PM
Big Data with R
Presentation
Edgar Ruiz, Rstudio
CS52 -
Outcomes from the SAMSI Climate Program
Invited
Fri, May 18, 5:15 PM - 6:15 PM
Lake Fairfax A
Organizer(s): David Banks, SAMSI and Duke University
Chair(s): David Banks, SAMSI and Duke University
5:15 PM
Modeling Large Spatial Data: an Application in Air Quality Modeling
Yawen Guan, SAMSI
5:45 PM
Inference on the Future State of the Climate Through Combining Multiple Interdependent Climate Model Outputs With Observations Using Bayesian Hierarchical Models
Huang Huang, SAMSI
CS53 -
Sports and Game Analytics
Contributed
Fri, May 18, 5:15 PM - 6:15 PM
Lake Fairfax B
Chair(s): Rida Moustafa, Walmart
5:15 PM
Predict Video Game Wheel Design Game Strategy
Mason Chen, Stanford OHS
5:30 PM
Apply Multivariate Data Mining on Playing Strategic Video Game
Patrick Giuliano, MorrillLearning Center
5:45 PM
Baseball Pitching and Swing Contact Modeling
Andrew Chen, University of San Francisco
6:00 PM
Predict Basketball Team Winning Record
Mason Chen, Stanford OHS
CS54 -
Dynamic Data Visualization
Contributed
Fri, May 18, 5:15 PM - 6:15 PM
Grand Ballroom F
Chair(s): Chris Gotwalt, JMP
5:15 PM
Dynamic Data Visualization: Bringing Data to Life
Neil W Polhemus, Statgraphics Technologies, Inc.
5:30 PM
Effective Story Telling with Dynamic Data Visualizations
Ruth M Hummel, JMP/SAS
5:45 PM
Exploratory Data Analysis for Predictive Analytics
Mia Stephens, JMP/SAS
6:00 PM
Discussant
Chris Gotwalt, JMP
↑