Tutorial: Data Analytics using R

An Introudction to Data Mining with Rattle and R

Dr Graham Williams,
Senior Director, Data Mining,
Australian Taxation Office

Wednesday, 13 November 2013, 15:30 - 17:30


R has become the statistical programming language of choice for practising data miners. It is the most feature rich statistical package, outshining the commercial data mining tools, and is free and open source. There are over 2000 packages available to support any kind of analysis required.

This hands-on tutorial will be conducted in a computer laboratory setting. We will introduce the use of R for data mining, and focus on the common process for delivering models using R. A number of tools for parallel processing and handling larger datasets will be introduced. No previous experience with R is assumed.

Speaker Bio:

Dr Graham Williams is Director of Data Mining at the Australian Taxation Office, and previously Principal Computer Scientist for Data Mining with CSIRO Australia. He is a Senior International Expert and Visiting Professor of the Chinese Academy of Sciences at the Shenzhen Institutes of Advanced Technologies. He is also Adjunct Professor, Data Mining, Fraud Prevention, Security, University of Canberra, and Australian National University.

Graham has been involved in data mining since the 1990s as a researcher and practitioner. He has lead projects with clients including the Health Insurance Commission, the Australian Taxation Office, the Commonwealth Bank, NRMA Insurance Limited, the Commonwealth Department of Health and Ageing, Queensland Health, and the Australian Customs Service. He has developed software and hardware environments for data mining, and implemented web services for the delivery of data mining. His research has included the development of Multiple (or Ensemble) Decision Tree Induction (1989), HotSpots for identifying target areas in very large data collections (1992), WebDM for the delivery of data mining services over the web using XML (1995), and Rattle (2005), a simple to use Graphical User Interface designed to make data mining accessible for data analysts. His popular text book on Data Mining with Rattle and R was published by Springer in 2011. His OnePageR website is an increasingly popular resource for data miners using R.


Computer Science and Information Techology (CSIT) computer lab room N115/N116, ground floor of building 108. From Manning Clark Centre, proceed toward the Union Building, then turn right to cross Sullivans Creek, then right again to pass the Sports Centre and the Purple Pickle on your right. The CSIT build in on the left. Enter from the front, past the lift, turn right and N115/N116 is in front of you.

View Larger Map

For further information contact AusDM13@ausdm.org

Visit us on LinkedIn - AusDM