Working with data linked to rainfall, this project aims to predict water levels of Italian water bodies. One of the main challenges in this Kaggle competition is the state of the raw data. Not only does each water body have unique predictor and target variables but the availability of data for each variable varies widely leading to problems in automating cleaning, feature generation ,and model building.