Data

We provide here public datasets that you might find useful for exploring data science.

Sample datasets:

  • Unbalanced Credit Card Dataset (144MB CSV file) originally available as creditcard.Rdata. The dataset is described in the paper Calibrating Probability with Undersampling for Unbalanced Classification, A. Dal Pozzolo, O. Caelen, R. A Johnson and G. Bontempi, IEEE Symposium Series on Computational Intelligence (SSCI), Cape Town, South Africa, 2015. It is also available from Kaggle. Provided here with permission.