eventscribe

The eventScribe Educational Program Planner system gives you access to information on sessions, special events, and the conference venue. Take a look at hotel maps to familiarize yourself with the venue, read biographies of our plenary speakers, and download handouts and resources for your sessions.

close this panel
support

Technical Support


Phone: (410) 638-9239

Fax: (410) 638-6108

GoToMeeting: Meet Now!

Web: www.CadmiumCD.com

close this panel
←Back

249 – Estimation and Inference Methods with Complex Survey Data

Insight Discovery for Decision Tree Models

Sponsor: Section on Statistical Learning and Data Mining
Keywords: Data Mining, Decision Tree, Tree Diagram, Data Insight

Jing Shyr

IBM

Jane Chu

IBM

Weicai Zhong

IBM

The decision tree model is a popular data mining tool in predictive analytics. The goal of building it in most applications is for prediction only. The question of identifying which leaf nodes have different target distributions from the root node remains unanswered. Such a missing part should provide further insights and understanding into the predictive structure of the data while it is often overlooked by a tree model user. It might be possible to find some of such leaf nodes in checking the tree diagram when the number of leaf nodes is small or the target distributions between leaf nodes and the root node are very different. However, it becomes more challenging or even impossible when there exist hundreds of leaf nodes or the target distributions between leaf nodes and the root node are not that different. In this paper we propose a systematic and efficient system to identify these leaf nodes based on several tests and present the results in an intuitive way with graphs and texts so it is easy for a tree model user to discover insights. In addition, all tests are based on already computed statistics in the leaf nodes, therefore there is little extra computational cost.

"eventScribe", the eventScribe logo, "CadmiumCD", and the CadmiumCD logo are trademarks of CadmiumCD LLC, and may not be copied, imitated or used, in whole or in part, without prior written permission from CadmiumCD. The appearance of these proceedings, customized graphics that are unique to these proceedings, and customized scripts are the service mark, trademark and/or trade dress of CadmiumCD and may not be copied, imitated or used, in whole or in part, without prior written notification. All other trademarks, slogans, company names or logos are the property of their respective owners. Reference to any products, services, processes or other information, by trade name, trademark, manufacturer, owner, or otherwise does not constitute or imply endorsement, sponsorship, or recommendation thereof by CadmiumCD.

As a user you may provide CadmiumCD with feedback. Any ideas or suggestions you provide through any feedback mechanisms on these proceedings may be used by CadmiumCD, at our sole discretion, including future modifications to the eventScribe product. You hereby grant to CadmiumCD and our assigns a perpetual, worldwide, fully transferable, sublicensable, irrevocable, royalty free license to use, reproduce, modify, create derivative works from, distribute, and display the feedback in any manner and for any purpose.

© 2013 CadmiumCD