Data Mining Concepts and Tools

Jump to a chapter

Introduction (00:08)
What Does Data Mining Do? (00:46)
Process of Data Mining (01:15)
Mining Architecture: SSAS and Data Mining Tools (02:29)
Tools for Data Mining (04:54)
Demo Introduction: Tour of Data Mining Tools (08:10)
Demo: Excel and Data Mining Add-ins (08:41)
Demo: Data Mining with SSDT (10:11)
Mining Cases (17:10)
Mining Structures (20:06)
Mining Models (21:50)
Mining Algorithms (22:50)
Describing Cases Columns and Case Keys (26:35)
Column Data Types (28:09)
Column Content Types (29:52)
Demo: Data and Content Types (32:38)
Column Usage, Input and Predict (34:38)
Demo: Input and Predict Columns in a Mining Model (36:58)
Discretization of Data (38:10)
Column Distributions (40:10)
Nested Cases (41:58)
Demo: Nested Cases with Decision Trees (43:30)
Next Steps: Building a Model (48:19)
Summary (49:03)

This 50-minute video introduces the fundamental concepts of Data Mining, a powerful analytical technology. We start by introducing to you the process of data mining and the SQL Server Analysis Services (SSAS) Data Mining architecture, using the Multidimensional and Data Mining Mode mode of SSAS. We introduce data mining tools, starting with Excel with the free Data Mining Add-Ins for Oﬃce, and focusing on SQL Server Data Tools (SSDT), which are well suited to longer-duration analytical mining projects.

The most fundamental concept in data mining is that of Cases, which represent the entities you wish to analyse, such as customers, products, or events. The simplest form of a case is just a flat, denormalized row of data. We briefly explain other formats of cases, too: Customer Signatures, which contains as-of validity dates, and Nested Cases, on which we focus towards the end of this tutorial, when you can also see a demo comparing the use of Decision Trees with, and without, nesting.

You will also learn about the concepts of Mining Structures, used to describe Cases, Mining Models, and Mining Algorithms. We briefly introduce 9 of the Microsoft data mining algorithms: Naïve Bayes, Clustering, Decision Trees, Association Rules, Sequence Clustering, Neural Networks, Logistics Regression, Linear Regression, and Time Series. They will be explained in more detail in other modules of this series.

The remainder of this module discuses Column Data Types (especially Text, Long, and Double), and Column Content Types, focussing on the diﬀerences between Continuous, Discrete, and Discretized data. You will hear about diﬀerent approaches to automatic Discretization, including Equal Areas, Clusters, and Thresholds technique, and about assisting the algorithms by hinting the statistical distribution of data in a column, such as Normal, LogNormal, or Uniform.

To help you learn, there are 5 demos in this module, which you can follow using your own datasets, Adventure Works from GitHub, or by downloading our educational dataset, HappyCars, available when you purchase access to this course.

Log in or purchase access to play the video.

Data Mining with SQL Server SSAS

Introduction to Data Mining with Microsoft SQL Server 24-min Watch with Free Subscription
Data Mining Concepts and Tools 50-min
Data Mining Model Building, Testing and Predicting with Microsoft SQL Server and Excel 1-hour 20-min
What Are Decision Trees? 10-min Free—Watch Now
Decision Trees in Depth 1-hour 54-min
Why Cluster and Segment Data? 9-min Watch with Free Subscription
Clustering in Depth 1-hour 50-min
What is Market Basket Analysis? 10-min Watch with Free Subscription
Association Rules in Depth 1-hour 35-min
HappyCars Sample Data Set for Learning Data Mining
Additional Code and Data Samples (R, ML Services, SSAS) Get with Free Subscription

Purchase a Full Access Subscription

Individual Subscription

$480/year

Access all content on this site for 1 year.
Purchase

Group Purchase

from $480/year

For small business & enterprise.
Group Purchase

You can also redeem a prepaid code.
Payments are instant and you will receive a tax invoice straight away.
We oﬀer sales quotes/pro-forma invoices, and we accept purchase orders and bank transfers.
Your satisfaction is paramount: we oﬀer a no-quibble refund guarantee.
See pricing FAQ for more detail.

Data Mining Concepts and Tools Purchase the entire course

Fundamentals of SQL Server Data Mining

Jump to a chapter

Data Mining with SQL Server SSAS

Purchase a Full Access Subscription

Individual Subscription

Group Purchase

In collaboration with

Company

Courses

Resources

Help

Search form

Data Mining Concepts and Tools Purchase the entire course

Fundamentals of SQL Server Data Mining

Jump to a chapter

Data Mining with SQL Server SSAS

Purchase a Full Access Subscription

Individual Subscription

Group Purchase

Get the Newsletter

In collaboration with