Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 182 - Modern Applications of Statistical Methods in Marketing
Type: Contributed
Date/Time: Tuesday, August 4, 2020 : 10:00 AM to 2:00 PM
Sponsor: Section on Statistics in Marketing
Abstract #313023
Title: A "Novel" Application of Representation Learning to Understand How People Consume Books
Author(s): Sarah Cox*
Companies: Penguin Random House
Keywords: representation learning; glove; natural language processing; embeddings
Abstract:

To sell products, it is critical to understand what you're selling and how each product relates to others. There are many ways of building taxonomies, often manually, but these tend to be difficult and time-consuming. We can to use clustering methods to achieve this goal, but this tends to be easier when we have dense features, as opposed to sparse features often found in e-commerce data sets. At Penguin Random House, we might be interested in automatically identifying which of the books we publish might be considered part of a series (e.g. George R. R. Martin's "A Song of Ice and Fire" novels). How might we automatically identify those if the best data we have are records of which consumers purchased which book, and when? Can we use the order in which readers made purchases to tell us something more about our own books? We will discuss the use of algorithms typically used for word representation (e.g. GloVe) to create product embeddings from the Penguin Random House library based on a "language" of product ID's instead of words. We will visualize these representations and discuss ways to use them to understand the way that consumers purchase and read the books that we publish.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program