From Data to Decision with Big Data and Predictive Analytics

COURSE OVERVIEW

If you try to make sense out of the data you have access to or want to analyse unstructured data available on the net (like Twitter, Linked in, etc...) this course is for you.

It is mostly aimed at decision makers and people who need to choose what data is worth collecting and what is worth analyzing.

It is not aimed at people configuring the solution, those people will benefit from the big picture though.

Quick Overview

Data Sources
Minding Data
Recommender systems
Target Marketing

Datatypes

Structured vs unstructured
Static vs streamed
Attitudinal, behavioural and demographic data
Data-driven vs user-driven analytics
data validity
Volume, velocity and variety of data

Models

Building models
Statistical Models
Machine learning

Data Classification

Clustering
kGroups, k-means, the nearest neighbours
Ant colonies, birds flocking

Predictive Models

Decision trees
Support vector machine
Naive Bayes classification
Neural networks
Markov Model
Regression
Ensemble methods

ROI

Benefit/Cost ratio
Cost of software
Cost of development
Potential benefits

Building Models

Data Preparation (MapReduce)
Data cleansing
Choosing methods
Developing model
Testing Model
Model evaluation
Model deployment and integration

Overview of Open Source and commercial software

Selection of R-project package
Python libraries
Hadoop and Mahout
Selected Apache projects related to Big Data and Analytics
Selected commercial solution
Integration with existing software and data sources

Requirements
Duration

Understanding of traditional data management and analysis methods like SQL, data warehouses, business intelligence, OLAP, etc...
Understanding of basic statistics and probability (mean, variance, probability, conditional probability, etc....)

21 hours (usually 3 days including breaks)

COURSE COMPLETION

Help decision makers and people who need to choose what data is worth collecting and what is worth analyzing.

CREDIT BEARING

This course is NOT credit bearing

COURSE LICENCE

This course is available under Attribution-ShareAlike 2.0 South Africa

more course info