Abstract:
|
This one-day seminar will be a practical introduction to and an overview of data mining. Many of the standard techniques of data mining, including modern regression methods (lasso, etc.), regression trees, neural networks, principal component regression, random forests, and boosting method will be covered. The course will be problem solving-based, using real case studies from industry to illustrate which methods work well, when, and why. We will emphasize problem formulation, the challenges of the data, and the communication back to decision makers to effect maximum impact in the organization. No prerequisites other that a knowledge of the basics of regression are assumed. The applications will come from a variety of industries and include some applications from my personal experiences as a consultant for companies that deal with such topics as financial services, chemical processing, pharmaceuticals, and insurance.
|