You can call us on +44 (0) 1 225 840 490

Name*
email*
Company*
Telephone*
Information*
  Please enter code (case sensitive)*:

Check

 
 

Data mining and modelling tools

The important processes that have to be clearly delineated for Data Mining and Data Modelling are:

  • Data model: what data will be available and how will it flow?
  • Data gathering: how will data be gathered both in physical and technological terms?
  • Data gathered: what data will be gathered?
  • Data types: what types of data will be gathered?
  • Data formatting: how will data be held?
  • Data warehousing: where will data be held?
  • Data mining: how will we retrieve data from the warehouse?
  • Information modelling: how will we create models and what of?
  • Information access: how will we access the data models and reports?
  • Presentation & reporting: on what will we report?

Most companies want to know essential information about customers at every point of contact, for example: 

  • Lifetime value
  • X sell and upgrade potential
  • Acquisition cost
  • Channel preferences
  • Loyalty/retention
  • Purchase behaviour patterns

Much of the data that they have will have different frequencies of change, refreshment or occurrence. It will be kept for different periods. In some cases, aggregated data may be kept rather than source data. All of these factors effect the data modelling exercise and the eventual modelling software requirements.

Turning the data into useful information requires: 

  • Identifying the issue(s)
  • Assembling the data set(s)
  • Building models
  • Verify models
  • Interpretation of the results
  • Automation of the delivery

Thereafter, modelling tools and techniques have to be used. These can be divided into two groups: theory driven and data driven.

Theory driven modelling (hypothesis testing) attempts to substantiate or disprove preconceived ideas. Theory driven modelling tools require the user to specify most of the model based on prior knowledge and then tests to see if the model is valid. 

Data driven modelling tools automatically create the model based on patterns they find in the data. This also needs to be tested before it can be accepted as valid. Modelling is an iterative process with the final model usually being a combination of prior knowledge and newly discovered information. The engine(s) tools and techniques include:

Statistical tests

  • Correlation
  • Analysis of Variance
  • Linear regression
  • Logistic regression
  • Discriminant analysis

Data driven tools

  • Cluster analysis
  • CHAID (Chi-square Automatic Interaction Detector) decision trees
  • Factor analysis
  • Neural networks
  • Visualisation tool

Written by Richard Hill



ECRM on FacebookECRM on Facebook
ECRM on Google+
ECRM on LinkedInECRM on LinkedIn
ECRM on TwitterECRM on Twitter
ECRM on Twylah
Richard on Google+Richard on Google Plus
Richard on LinkedInRichard on LinkedIn
Richard tweeting as cassyputRichard on Twitter
Richard on Twylah
Local navigationNewsletters

Daily news

2012

April: Website design & development brief

March: Web analytics and webmaster tools

February: Choosing a web design company

January: Mobile web & search strategy

2011      See 2011 list      Hide 2011 list

2010      See 2010 list      Hide 2010 list

2009      See 2009 list      Hide 2009 list

2008      See 2008 list      Hide 2008 list

2007      See 2007 list      Hide 2007 list

2006      See 2006 list      Hide 2006 list

2005      See 2005 list      Hide 2005 list

Essays    See 2005 list      Hide 2005 list

Take the Test
Take the Test Take the test and see how hard your website is working
How hard is your website working for your business? Take the test and see how hard your website is working
Case Studies