Repository logo
  • English
  • العربية
  • বাংলা
  • Català
  • Čeština
  • Deutsch
  • Ελληνικά
  • Español
  • Suomi
  • Français
  • Gàidhlig
  • हिंदी
  • Magyar
  • Italiano
  • Қазақ
  • Latviešu
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Српски
  • Svenska
  • Türkçe
  • Yкраї́нська
  • Tiếng Việt
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. IIT Gandhinagar
  3. Earth Sciences
  4. ES Publications
  5. Filling streamflow data gaps in Indian catchments using machine learning
 
  • Details

Filling streamflow data gaps in Indian catchments using machine learning

Source
EGU General Assembly 2025
Date Issued
2025-04-27
Author(s)
Solanki, Hiren
Mishra, Vimal
Abstract
Complete hydrological time series are critical for effective water resource management, flood and drought forecasting, hydroelectric power optimization, irrigation planning, ecological preservation, and climate change impact assessments. However, significant data gaps in streamflow and water level observations, compounded by extreme hydroclimatic events and quality control issues, hinder accurate modeling and informed decision-making in Indian catchments. The current challenges are particularly pronounced in regions with high climatic variability, where missing data spans 6 to 12 months. To address this, we employed geomorphological, meteorological, and hydrological parameters in combination with the Random Forest method to gap-fill streamflow data at 352 stations across India, except the transboundary basins. To enhance model accuracy and training, we categorized stations into similar-behaving classes using a k-means clustering algorithm based on catchment characteristics. This clustering increased the availability of training data for machine learning models. Streamflow data from each class was trained with 80% of the available data and validated on the remaining 20%. Our results indicate that clustering significantly improves performance, with over 100 stations reporting a >25% increase in Nash-Sutcliffe Efficiency (NSE). Model performance was evaluated for continuous data gaps of 1 week, 1 month, 3 months, 6 months, and 1 year, revealing a decline in accuracy with longer gaps. Despite this, the mean NSE exceeded 0.85 across all clusters. The gap-filled datasets provide robust hydrographs, enabling precise streamflow variability modeling, climate-hydrology interaction evaluation, and improved water resource management strategies.
URI
https://meetingorganizer.copernicus.org/EGU25/EGU25-15030.html
http://repository.iitgn.ac.in/handle/IITG2025/31461
IITGN Knowledge Repository Developed and Managed by Library

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify