A dataset relating characteristics of telephony account features and usage and whether or not the customer churned.
Source
Churn Data Set from Discovering Knowledge in Data: An Introduction to Data Mining
Info about the top 10,000 Twitter users with more followers
SocialBro application
Dataset of Telecom company to predict churn. 5000 instances.
Dataset with 3,333 instances of customer behavior and churn indicator.
Info about the top 10,000 Companies with more followers in Twitter
Email data collected from my own inbox, with importance set by Gmail's Priority Inbox. I have removed content features (for privacy reasons). The remaining features are:
...
See Bootstrapping Machine Learning for more information.
1 million Github repos got from Github API