Synthesising Multiple Linked Data Sets and Sequences in R
In my last post I looked at generating synthetic data sets with the ‘synthpop’ package, some of the challenges and […]
In my last post I looked at generating synthetic data sets with the ‘synthpop’ package, some of the challenges and […]
Synthpop – A great music genre and an aptly named R package for synthesising population data. I recently came across […]
In my last post I coded Liar’s Dice in R and some brainless bots to play against. I build on […]
I have been playing Red Dead Redemption 2, immersing myself in the Old West as I did with the first […]
Bayesian Networks are probabilistic graphical models and they have some neat features which make them very useful for many problems. […]
I recently purchased a new laptop with an Intel i7-8750 6 core CPU with multi-threading meaning I have 12 logical […]
Cribbage is one of my favourite card games and have been playing it ever since I could count. It is […]
The problem Consider a case where we have a bag of marbles of size . The bag consists of black […]
There are a few ways to reduce the dimensions of large data sets to ensure computational efficiency such as backwards […]
Sensitive data has it’s restrictions for good reason. Personal data such as names and other identifiable information should be protected. […]
I like to describe Official statistics as the All Bran of statistics, it’s bland and a bit boring but it […]
Neural networks evolved in the computer science domain are often the first thing people think of when they hear machine […]
A need for unsupervised learning or clustering procedures crop up regularly for problems such as customer behavior segmentation, clustering of […]
Recently I was asked about the design matrix (or model matrix) for a regression model and why it is important. […]
Outlier detection and treatment is an important part of data analysis and one of the first things you should do […]