Back to search menu
Saturday, February 22
Sat, Feb 22
7:30 AM - 2:30 PM
Ballroom Foyer
Registration
Registration
Sat, Feb 22
7:30 AM - 1:00 PM
Regency EF
Exhibits Open
Exhibits
Sat, Feb 22
8:00 AM - 9:15 AM
Regency EF
PS3 - Poster Session 3 and Continental Breakfast
Poster Session
Chair(s): Sudeshna Paul, Emory University
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Sat, Feb 22
9:15 AM - 10:45 AM
Regency A
CS17 - Essential Collaboration Skills
Concurrent Session
Chair(s): Paul Berg, Eli Lilly & Company
Sat, Feb 22
9:15 AM - 10:45 AM
Regency B
CS18 - Statistical Methods in Health Care
Concurrent Session
Chair(s): Mohammed Rahim Uddin Chowdhury, Kennesaw State University
Sat, Feb 22
9:15 AM - 10:45 AM
Regency C
CS19 - Taxonomy Stories: Human-Centered Classification
Concurrent Session
Chair(s): Joshua Lambert, University of Cincinnati
Sat, Feb 22
9:15 AM - 10:45 AM
Regency D
CS20 - Ethics Panel: Ethical Practices at the Intersection of Statistics and Public Service
Concurrent Session
Chair(s): David J. Corliss, Peace-Work
The focus for the Ethics Panel this year is the intersection of statistics and public service. This area leads to ethics questions on privacy of data in the public sphere, advising on ethical best practices for public sector agencies, financial support from industry for statistical testing e.g., pharma approvals, and working to do the best science in an increasingly politicized and polarized world.
Panelists:
Daniel Elchert, ASA Policy Fellow
Wendy Martinez, Bureau of Labor Statistics
Darius Singpurwalla, NSF/ National Center for Science and Engineering Statistics
Sat, Feb 22
10:45 AM - 11:00 AM
Regency EF
Refreshment Break
Other
Sat, Feb 22
11:00 AM - 12:30 PM
Regency A
CS21 - Going Public
Concurrent Session
Chair(s): Jay Mandrekar, Division of Biomedical Statistics and Informatics, Mayo Clinic
Sat, Feb 22
11:00 AM - 12:30 PM
Regency B
CS22 - Markov Models
Concurrent Session
Chair(s): Steven B. Cohen, RTI International
Sat, Feb 22
11:00 AM - 12:30 PM
Regency D
CS23 - Real-World Applications
Concurrent Session
Chair(s): Michelle Sarah Livings, University of Southern California
Sat, Feb 22
11:00 AM - 12:30 PM
Regency C
CS24 - Policy and Support for Practicing Statisticians
Concurrent Session
Chair(s): Ron Gangnon, University of Wisconsin-Madison
Sat, Feb 22
12:30 PM - 2:00 PM
Lunch (on own)
Other
Sat, Feb 22
2:00 PM - 4:00 PM
Regency B
PCD1 - Meta-Analysis Using Stata
Practical Computing Demo
Instructor(s): Houssein Assaad, StataCorp LLC
Organizer(s): Brooke Erchinger, StataCorp LLC
This workshop will cover the use of Stata to perform meta-analysis (MA), a statistical technique for combining the results from several similar studies. The course will provide a brief introduction to MA and will demonstrate how to perform MA in Stata 16. Stata’s new meta command offers full support for MA—from computing various effect sizes and producing basic meta-analytic summary and forest plots to accounting for between-study heterogeneity and potential publication bias. A number of case studies demonstrating how to conduct an MA within Stata will be provided. These examples will focus on the interpretation of MA under various models, meta-regression and its postestimation features, subgroup analysis, small-study effect and publication bias, and various types of forest, funnel, and other plots. No prior knowledge of Stata is required, but basic familiarity with MA will prove useful.
Outline & Objectives
Outline
This workshop is geared toward researchers wanting to perform MA and those who already
know about MA and wish to learn how to do it using Stata.
1. Brief overview of MA
2. Data setup and effect sizes using meta set and meta esize
• Effect sizes for binary data
• Effect sizes for continuous data
• Generic (precomputed) effect sizes
3. MA models
• Random-effects model (seven estimation methods)
• Fixed-effects model (Mantel–Haenszel and inverse-variance methods)
• Common-effect model (Mantel–Haenszel and inverse-variance methods)
4. Graphical and numerical MA summary using meta summarize and meta forestplot
• Standard MA
• Subgroup MA with one or many grouping variables
• Cumulative MA with and without stratification
1
5. Meta-regression
• Continuous and categorical moderators
• Fixed-effects and random-effects regression
• Multiplicative and additive residual heterogeneity
• Knapp–Hartung standard-error adjustment
• Postestimation features: prediction, bubble plots, etc.
6. Small-study effects and publication bias
• Standard and contour-enhanced funnel plots
• Traditional and random-effects versions of tests for funnel-plot asymmetry or
small-study effects
• Nonparametric trim-and-fill method
Performance objectives
Participants of this workshop will walk away with the following knowledge:
• A brief overview of MA as a statistical procedure
• How to declare and compute effect sizes using meta set and meta esize
• How to summarize the meta-analytic results via meta sumamrize and meta forestplot
• How to interpret the results under different MA models
• How to address the problem of heterogeneity
• How to perform meta-regression using meta regress
• How to assess the validity of the MA against the threat of publication bias
• How to test for funnel-plot asymmetry using meta bias
• How to conduct a trim-and-fill analysis using meta trimfill
• How to differentiate between various reasons behind funnel-plot asymmetry
This presentation will provide methods and formulas and demonstrate how to perform MA
with real data. Participants who bring their own laptop will be able to interactively follow
along provided they have Stata 16 installed and a working Internet connection for down-
loading datasets from http://www.stata-press.com. However, interactive participation is
not required. The notes will provide sufficient information to reproduce all analyses at the
attendees’ convenience.
About the Instructor
Houssein Assaad is a Senior Statistician and Software Developer at StataCorp LLC and
the primary developer of Stata’s MA suite. Houssein has a PhD in statistics from the
University of Texas at Dallas. He is a former research assistant professor at Texas A&M
University, where his research focused on longitudinal and functional data analysis.
Relevance to Conference Goals
This demonstration will provide researchers with the tools to use MA in real-world applica-
tions. Participants will learn about MA as a statistical procedure and how to perform the
steps of MA in Stata.
Sat, Feb 22
2:00 PM - 4:00 PM
Carmel AB
PCD2 - Introducing the SAS BGLIMM Procedure for Bayesian Generalized Linear Mixed Models
Practical Computing Demo
Instructor(s): Amy Shi, SAS Institute, Inc.
Organizer(s): Fang Chen, SAS Institute, Inc.
SAS/STAT® 15.1 includes PROC BGLIMM, a new, high-performance, sampling-based procedure that provides full Bayesian inference for generalized linear mixed models (GLMMs). PROC BGLIMM models data from the exponential family distributions that have correlations or nonconstant variability; uses syntax similar to that of the MIXED and GLIMMIX procedures (the CLASS, MODEL, RANDOM, REPEATED, and ESTIMATE statements); deploys optimal sampling algorithms that are parallelized for performance; handles multilevel nested and non-nested random-effects models; and fits models to multivariate or longitudinal data with repeated measurements. PROC BGLIMM provides convenient access, with improved performance, to Bayesian analysis of complex mixed models that you could previously perform with the MCMC procedure. This workshop starts with a general discussion of Bayesian GLMM, then presents the important features of PROC BGLIMM, showing you how to use it for estimation, inference, and prediction.
Outline & Objectives
OUTLINE
1. Overview of Bayesian GLMM
2. Syntax and options of PROC BGLIMM
3. Demonstration of PROC BGLIMM through examples
3.1 Simple normal regression
3.2 Logistic regression with random intercepts
3.3 Normal regression with repeated measurements
3.4 Non-nested logistic random-effects model with prediction
3.5 Poisson regression with random effects
3.6 Repeated growth measurements with internal difference
Target Audience
This presentation is intended for a broad audience of statisticians who are interested in Bayesian inference for generalized linear mixed models. It would be helpful for attendees to have a basic understanding of normal regression analysis, generalized linear mixed models, and Bayesian methods, but it is not required.
LEARNING OUTCOMES
(a) Performance objectives
By attending this presentation, participants will improve their knowledge of generalized linear mixed models and Bayesian methods, and they will be able to use the BGLIMM procedure in SAS/STAT software to conduct Bayesian analyses.
(b) Content and instructional methods
The presentation will alternate between the use of slides and software demonstrations. Handouts given to attendees will cover both.
About the Instructor
Amy Shi is a senior research statistician developer in the Advanced Analytics Division at SAS Institute Inc. She received a Ph.D. in biostatistics from the University of North Carolina at Chapel Hill. She joined SAS in 2010, and her work involves implementation of Bayesian methods in software. Amy’s main responsibility is developing and enhancing SAS’ Bayesian capability, with a focus on generalized linear mixed models, discrete choice models, and multilevel hierarchical settings.
Relevance to Conference Goals
Sat, Feb 22
2:00 PM - 4:00 PM
Big Sur AB
PCD3 - AutoStat: A Single Application for Visualization, Data Querying, and Analytics Encompassing AI, Machine Learning, and Statistics
Practical Computing Demo
Organizer(s): Clair Alston-Knox, Predictive Analytics Group
Data is abundant in modern society, and a raft of statistical and machine learning algorithms have been developed to assist researchers, managers and lay-people to understand what inferences can be made from their data, and what decisions would best progress their goal. And yet, the current \p-value crisis in science" is evidence that even in the scientific community, access to these sophisticated algorithms is not owing through to many researchers, particularly those who do not have dedicated statistical or data science support.
The AutoStat Institute was founded by a group of academics and consultants who believe that this issue is, in a large part, due to the need to code in programs like R or Python to gain access to these algorithms. While the operability of these and similar platforms continues to improve, there are many potential users of data that will never have the skill set, time, interest or level of exposure required to become au fait with these packages. As a result, many users are excluded from realising the potential of the Big Data World by virtue of a coding barrier. AutoStat solves this problem by offering its users a modern feel GUI environment for sophisticated statistical analysis that aims to provide academics, students, business and interested people access to scalable modern algorithms and visualizations in a code free environment.
Outline & Objectives
This 2-hour workshop will focus on the user experience and provide practical demonstrations of both businessand research projects from the practical implementations of
Data management: Making new variables with the calculator tool, various methods for easy
data splitting test / train, merging datasets and much more.
Visualisations: Easy exploratory plots through to sophisticated layering approaches for publication and presentation quality output.
Model Building: A range of machine learning and statistical models (both frequentist and
Bayesian approaches)
Results and Inference: Standard outputs and tools to create users own inference metrics
Team work: Project sharing and collaboration from early stage data management, modeling
and report writing,
Tutorials and other help facilities to enable the user to get full benefit from their data analysis.
We will then illustrate how the software can enhance the research or business output using real case studies and implementing the following tools:
Pipeline construction for ease of updating results as new data becomes available via easy point,
click and record;
Dashboard building for effective deployment to end users and broadening the reach of your
research;
Document builders that are available in AutoStat with a range of templates that can be
customized by the user.
About the Instructor
Dr Clair Alston-Knox is a Senior Statistician with Predictive Analytics Group (Melbourne Australia). She had been an research and academic statistician since 1992, with a number of biometric and statistical consulting positions in government and universities. She joined Predictive Analytics and the AutoStat Institute in 2018 because her teaching, consulting, advising and ethics committee roles were frequently frustrated by researchers who were very capable of understanding the objective and benefits of statistical or machine learning approaches, but did not have the resources to learn the required platform to enable next level analysis.
Dr Theo Gazos is the Managing Director of Predictive Analytics Group. Theo has over 25 years of experience building economic and econometric models that isolate and quantify the impact of changing market dynamics (domestic and international), competition effects and government policy on private and government sector organisations. Theo is passionate about bringing the power of statistics and machine learning to all levels within organisations, and has used his years of experience to develop an interface and user ow within AutoStat R that makes this objective achievable.
Relevance to Conference Goals
Communication, collaboration and career development
AutoStat is an ideal environment for sophisticated statistical analysis, such as Bayesian models with stochastic search variable selection. The report building, collaboration and visualization feature all assist users in communicating outcomes.
Data Modeling and Analysis, Data Science and Big Data
AutoStat will help different users of big data in many different ways. For example, the point and click nature of AutoStat will allow data analysts to perform sophisticated machine learning and produce the standard results by default without needing to implement code, decide on the most appropriate libraries or construct their own visualisations. The Bayesian models provided in AutoStat R are highly optimised and scalable to big data. Default settings have been based on the latest research in the area of each model, are well documented and are prominently displayed so that users are aware of their settings (and can easily change them).
Software, Programming and Data Visualization
AutoStat provides modern graphics using drop and drag, with many customisable styles and the option of layering within charts. Users can produce high quality graphics without the need to code.
Sat, Feb 22
2:00 PM - 4:00 PM
Regency A
T1 - Applied Use of R, GitHub, and Markdown for Reproducible Workflows for Small Data Teams
Tutorial
Many organizations have limited personnel and resources available to building efficient data workflows. As organizations grow, having solid documentation of processes, reproducible analyses and systemic collaboration tools are essential for maintaining efficient workflows.
This tutorial will walk through setting up documentation and reproducibility using R, Github and Markdown for emerging data scientists and small data teams. Participants will learn best practices for documentation and collaboration, and essential elements for reproducibility via hand-on training in RStudio.
Following this session, participants will have the tools to return to their organizations ready to build reproducible, documented data workflows.
Outline & Objectives
Students will obtain the following hands-on skills:
1. Foundational understanding of why documentation and reproducibility are important.
2. Setup and installation of required software to build workflows in RStudio, Github and documentation in R-Markdown.
3. Understand the necessary components of reproducibility, including:
a. Identified data sources
b. Clear workflows and timelines
c. Version control and code
4. Understand the necessary components of documentation, including:
a. Metadata
b. Building organizational best-practices
c. The fundamentals of useful commenting
d. Combining narrative, code and documentation
e. Organizational transparency
Following this session, participants will have the tools to return to their organizations ready to build reproducible, documented data workflows.
About the Instructor
Dr. Karin Neff is the Data and Assessment specialist for Bozeman Public Schools where she works in a data-team of one to build data stories to aid in student growth and achievement. Dr. Neff relies heavily on open source tools to maintain analytic integrity and reproducibility in the public sector. Dr. Neff received her doctorate in Ecology and Environmental Sciences from Montana State University where she helped develop laboratory best practices, contributed to documentation strategies and mentored emerging scientists.
Relevance to Conference Goals
This course will provide an opportunity for emerging analysts to establish best practices in reproducibility and documentation that will serve them for their entire careers. It will also provide tools and information for organizations with small data teams to build workflows that will scale as their organizations and analytic needs grow.
Sat, Feb 22
2:00 PM - 4:00 PM
Regency C
T3 - Project Management Principles for Statisticians
Tutorial
Project Management Institute (PMI) indicated that: - 58% of organizations fully understand the value of project management - 93% of organizations report using standardized project management practices - 68% of organizations in PMI’s annual survey said that they used outsourced or contract project managers in 2018 - 23% of organizations use standardized project management practices across the entire organization - 33% use standardized practices, but not across all departments - 7% of organizations don't use any standard practices at all
Outline & Objectives
Scope: The goal of this workshop is to demonstrate how to apply the basic principles of the Project Management Institute's Body of Knowledge (PMBOK) the workplace.
Objectives:
? Learn the basic PMBOK templates, such as charter, project plan, budget, risk management, and presentation;
? Understand how to use the basic PMBOK templates using Google drive;
? Draft a charter, project plan, budget, risk management, and presentation on Google.
Benefits:
? Understand the principles of project management based on the PMBOK
? Learn how to apply basic project management tools such as project charter, project management plan, and risk management plan; and
? Draft a presentation for managers.
Level: Basic
Software: Google drive
About the Instructor
Ana Valentín serves as an Enterprise Service Program Manager for the Enterprise Service Branch in the Service Delivery Division under National Oceanic and Atmospheric Administration (NOAA) Office of Chief Information Officer. In this capacity, Ana leads various teams of technology projects strengthen NOAA’s Mission. Ana promotes diversity and inclusion through the Latinos@NOAA Employees Resource Group (ERG) an organization that she co-founded on 2014 and recipient of the 2018 NOAA’s Administrator Award. Ana taught undergraduate statistics and math courses and a graduate clinical research course for six years. Ana also had published research articles and has been presenting at the League United Latin American Citizens Federal Training Institute national conferences professional development workshops. Ana has a BA and MPH from the University of Puerto Rico, a MS from University of Fairfax, and graduate certificates from: George Washington University, University of Maryland University College, and the United States ARMY War College. In her spare time, Ana collaborates with various non-profit, while pursuing a D.Sc. on Cyber-security from Marymount University in Virginia.
Relevance to Conference Goals
Relevance to Conference Goals:
? Better communicate and collaborate with their clients and customers
? Have a positive effect on their organization or enhance their professional
Sat, Feb 22
2:00 PM - 4:00 PM
Regency D
T4 - Introduction to Bayesian Data Analysis
Tutorial
Instructor(s): An-Ting Jhuang, UnitedHealth Group R&D; Christina Phan Knudson, University of St. Thomas
Download Handouts
This short course introduces Bayesian statistics at a level appropriate for all practitioners in both academia and industry. This two-hour course introduces fundamental Bayesian concepts, model creation, diagnostics, and interpretation of results.
Examples and sample code will develop participants’ intuition and practical abilities. Learners will understand the differences between frequentist statistics and Bayesian statistics; explain the importance and use of priors, posteriors and likelihoods; understand the use and function of Markov chain Monte Carlo (MCMC) methods; write R code to create Bayesian models; examine convergence of posterior samples; and integrate results into decision-making.
Participants will implement these skills with several examples using practical models (linear regression and logistic regression) with real-world data sets. This workshop will broaden participants’ skill-sets for solving real-world problems.
Outline & Objectives
1. Intro to Bayesian concepts
2. Examples: coin flip, linear regression, logistic regression
3. Interpreting results
4. Conjugate priors
5. MCMC samplers: Why do we need them? How do they work?
6. MCMC convergence: definition, intuition, diagnostics, R code, packages
7. Larger exampler with survival data
--Basic goal: create a logistic regression to model the log odds of survival based on various predictors (e.g. gender, fare class, adult vs child)
--Intermediate: prediction
--Advanced: evaluate impact of the prior distribution, the Monte Carlo sample size, the inclusion/exclusion of variables
8. Review
Goal: introduce participants to the Bayesian statistical framework. Participants will understand and gain hands-on experience with priors, likelihoods, and posteriors; Markov chain Monte Carlo (MCMC) samplers; MCMC convergence; and the basic Bayesian workflow.
About the Instructor
An-Ting Jhuang holds a PhD in statistics from North Carolina State University. She has developed new Bayesian methods to tackle problems in epidemiology and material science. Her research focuses on sparse signal detection in spatial and spatiotemporal statistics, and exposure assessment. She is a Principal Data Scientist at UnitedHealth Group Research & Development in Minnesota. On a day-to-day basis, she identifies research directions and applies statistical methods to solve scientific and business questions in the health-care field.
Christina Knudson holds a PhD in statistics from the University of Minnesota. She is an assistant professor at the University of St. Thomas in Minnesota. She is the author and maintainer of the R package glmm, which is downloaded from CRAN over 1000 times per month. Her most recent contribution is “Revisiting the Gelman-Rubin Diagnostic” (Vats and Knudson), which stabilizes the Gelman-Rubin (GR) statistic, proposes a principled GR threshold for terminating samplers, and connects effective sample size to the GR statistic. Additionally, she is the organizer of the Twin Cities chapter of R Ladies.
Relevance to Conference Goals
Our goal of jump-starting participants’ Bayesian statistics abilities directly aligns with the conference goal of providing participants with the opportunity to learn new statistical methodologies and best practices in statistical analysis. We have designed this short course to broaden applied statisticians skill sets so that they can better consult with and aid customers and organizations solve real-world problems. Our short course will teach statistical techniques that participants can apply to their jobs as applied statisticians; participants will leave the workshop having practiced with several examples using practical models (linear regression and logistic regression) with real data sets.
Sat, Feb 22
4:00 PM - 4:15 PM
Regency EF
Refreshment Break
Other
Sat, Feb 22
4:15 PM - 5:30 PM
Regency A
GS2 - Closing Session
General Session
Chair(s): David J. Corliss, Peace-Work
The Closing Session is an opportunity for you to interact with the CSP Steering Committee in an open discussion about how the conference went and how it could be improved in future years. CSPSC 2021 vice chair, David J. Corliss, will lead a panel of committee members as they summarize their conference experience. The audience will then be invited to ask questions and provide feedback. The committee highly values suggestions for improvements gathered during this time. Each attendee will have an opportunity to win a door prize.