Beta-boosted ensemble for big credit scoring data

Maciej Zieba & Wolfgang K. Härdle
In this work we present a novel ensemble model for a credit scoring problem. The main idea of the approach is to incorporate separate beta binomial distributions for each of the classes to generate balanced datasets that are further used to construct base learners that constitute the final ensemble model. The sampling procedure is performed on two separate ranking lists, each for one class, where the ranking is based on prepotency of observing positive class....
This data repository is not currently reporting usage information. For information on how your repository can submit usage information, please see our documentation.