Now launch the virtual machine and log onto the SAS University Edition.ĭownload the train.csv and test.csv file from the Kaggle website and store them within the shared folder you setup when installed SAS unviersity edition, usually this will be : C:\SASUniversityEdition\myfolders\ . If you haven’t done this yet that you can follow the tutorial here. Outputting a dataset to a CSV file in preparation for submitting it to Kaggleīefore you get started you will need to install a virtual machine and then the SAS university edition files.Using the KEEP statement to only keep variables that you want in your dataset.Using IF…THEN.ELSE logic to modify a dataset.Calculating the proportion of men and women who survived using PROC FREQ.Reading a CSV file into SAS using PROC IMPORT.
It explains how to use the SAS University Edition to do the following: It should be useful both for people who want to learn SAS, but also for those who want to use SAS to enter the Kaggle competition. The tutorial is designed to be roughly equivalent to the first excel lesson available on the Kaggle website. This is the first of our tutorials on using SAS university edition to explore the data from the Kaggle Titanic: Machine Learning from Disaster edition.