There are two go to marketing strategies that COIL can use. same zip code have the same sociodemographic attributes. Safety Also a Leiden Institute of Advanced Computer Science Technical Report 2000-09. The data dictionary ([Web Link]) describes the variables used and their values. It is further divided into a training set (5822 observations) and a test set (4000 observations). Muthu Kumaar Thangavelu (G1101765E) There was a problem preparing your codespace, please try again. This repository is part of the Caravan project/dataset. Compute time series of spatially-averaged meteorological forcings on Google Earth Engine. Fig 3: Derived Variables 3.8 Balancing the training data It has been noticed that the training dataset is not highly representative of positive cases i.e.CARAVAN=1. CUST_SUB_LIFESTYLE_REFLECTION: The corresponding data visualizations can be observed in the uploaded jupyter notebook. The sociodemographic data is derived from zip codes. sign in Registered Office: Pegasus House, Bakewell Road, Orton Southgate, Peterborough, PE2 6YS. Having said that, I have developed analysis that compares overall costs for all eighteen models for classification cutoff values ranging from 0 to 1. Caravan insurance data mining statistical analysis, Product Planning Manager, Oncology & Hospital Specialty Care Marketing at MSD. to use Codespaces. Are you sure you want to create this branch? Here is how you do it. The data consists of 86 variables and includes product usage data and socio-demographic data, Original Owner and Donor: Peter van der Putten Sentient Machine Research Baarsjesweg 224 1058 AA Amsterdam The Netherlands +31 20 6186927 pvdputten '@' hotmail.com, putten '@' liacs.nl TIC Benchmark Homepage: http://www.liacs.nl/~putten/library/cc2000/. How To Reimage Your Computer Windows 10 - How to check the Windows 10 Creators Update is installed - How to reimage a mac computer. Anyone, with as little as streamflow records and catchment boundaries of one (or more) basins, can contribute to extending the Caravan dataset to new regions. You can load the Caravandata set in R by issuing the following command at the console data("Caravan"). When your caravan is being towed, your car insurance policy often only extends to third party cover, so any damage to the caravan itself would be covered under your caravan insurance. Each record consists of 86 attributes, containing sociodemographic data (attribute 1-43) and product ownership (attributes 44-86).The sociodemographic data is derived from zip codes. Lines open Mon-Fri 9am-5.30pm. Additionally, every data that is contributed contains a separate license/info file, attributing your contribution to this project and explaining the source of license specification of this addition. If nothing happens, download GitHub Desktop and try again. KDD. The dataset used is from the CoIL Challenge 2000 datamining competition. Rented house, in the zipcode area of the customer. The training data has 5893 observations, whereas, the test data consists of the remaining 3929 observations. James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013) Since, this dataset was used for the purposes of a challenge, I obtained the data in the form of training data and test data, which is why, there was no need to split the data for my analysis. All Rights Reserved,