What is Data Leakage in Machine Learning?
Data Leakage refers to the situation where information from outside the training data is unintentionally used during the model training process, leading to over-optimistic performance estimates. Data leakage can occur when features that are not available during actual deployment are used or when data from the future is mistakenly included.