Embed this resource in your web site
Model that will predict the quality of risk of a loan application. This dataset was initially created by dr. Hans Hofmann at the Institut fur Statistik und Okonometrie Universitat Hamburg.
Model predicting churn for a telecom company, based on 3,333 instances of customer data.
A model predicting loan delinquency at for loans given by LendingClub.com based on about 50000 loans. Data is available at http://lendingclub.com/info
Model to predict survival on the Titanic, based on Class/Department, Age, Group, City of Embarkment, Job and Survived (as objective field). The data source is from http://www.encyclopedia-titanica.org. The Fare was indexed to current prices, based on a retail index.
This predictive model is capable of distinguishing between bad connections, called intrusions or attacks, and good normal connections. The data source used was part of the data created for The Third International Knowledge Discovery and Data Mining Tools Competition, which was held in conjunction with KDD-99 The Fifth International Conference on Knowledge Discovery and Data Mining.
Model with word counts for about 47,000 text documents, of which roughly 32,000 are novels, 7,500 are supreme court opinions, and 7,500 are webpages from universities. The features are word counts for the 3000 top words by TF*IDF, with stopwords removed.
Model that predicts which percentage of patients will rate a hospital at 9 or 10, based on a dataset of hospital patient surveys. It shows which survey items and which scores are important for a good overall result. The data is from a list of hospital ratings for the Hospital Consumer Assessment of Healthcare Providers and Systems (HCAHPS). HCAHPS is a national, standardized survey of hospital patients about their experiences during a recent inpatient hospital stay. https://data.medicare.gov/dataset/Survey-of-Patients-Hospital-Experiences-HCAHPS-/rj76-22dk
A model mapping income and education level to poverty rate for over 3000 U.S. counties.
A model that predicts the use of a sample of glass based on chemical composition. Useful in forensics when broken glass is found at a crime scene.