This is the program for the 2010 Joint Statistical Meetings in Vancouver, British Columbia.

Abstract Details

Activity Number: 248
Type: Contributed
Date/Time: Monday, August 2, 2010 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #308956
Title: LNRE Methods in Estimating Impressions in Search Engine Results
Author(s): Carrie Grimes and Meeyoung Park*+
Companies: Google and Google
Address: 1600 Amphitheatre Parkway, Mountain View, CA, 94043,
Keywords: LNRE ; MLE ; Good-Turing estimator ; sparse data ; impressions
Abstract:

Mining historical data to predict future appearances or "impressions" of a web page in search engine results is important for improving search engine efficiency. However, because the population of documents is extremely large, historical data about impressions may be very sparse. Here we use sampled query traffic to show that even for very large samples, the number of single-frequency pages continues to increase as the data collection time window expands. We introduce a novel approach to impression frequency estimation by using LNRE (Large Number of Rare Events) methods that have traditionally been used in modeling the occurrence of unique words in a text. Using the Good-Turing method, we obtain more accurate predictions than with the Maximum Likelihood Estimator. We show that this improved accuracy translates to substantive benefits in designing a multi-tiered web search index.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2010 program




2010 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.