Learning Task with Future Leaked Information

From GM-RKB
Jump to navigation Jump to search

An Learning Task with Future Leaked Information is a learning task where the modeler has visibility to unknowable information.



References

2017

  • https://www.kaggle.com/alexisbcook/data-leakage
    • QUOTE: ... But the model will be very inaccurate when subsequently deployed in the real world, because even patients who will get pneumonia won't have received antibiotics yet when we need to make predictions about their future health.

      To prevent this type of data leakage, any variable updated (or created) after the target value is realized should be excluded. ...

2011

2010

  • Rosset, S., Perlich, C., Swirszcz, G., Liu, Y., and Prem, M. 2010. Medical data mining: lessons from winning two com-petitions. Data Mining and Knowledge Discovery. 20(3) 439-468.