Developing Machine Learning Tools For Long-Lead Heavy Precipitation Prediction With Multi-Sensor Data
Keywords
Fast Online Streaming Feature Selection; Heavy Precipitation Predicition; Machine Learning; Nearest Sample Choosing
Abstract
A large number of extreme floods were closely related to heavy precipitation which lasted for several days or weeks. Long-lead prediction of extreme precipitation, i.e., prediction of 6-15 days ahead of time, is important for understanding the prognostic forecasting potential of many natural disasters, such as floods. Yet, long-lead flood forecasting is a challenging task due to the cascaded uncertainty with prediction errors from measurements to modeling, which makes the current physics-based numerical simulation models extremely complex and inaccurate. In this paper, we formulate the modeling work as a machine learning problem and introduce a complementary data mining framework for heavy precipitation prediction. Heavy precipitation that may lead to extreme floods is a rare event. Long-lead prediction requires the corresponding feature space to be sampled from extremely high spatio-Temporal dimensions. Such a complexity makes long-lead heavy precipitation prediction a high dimensional and imbalanced machine learning problem. In this work, we firstly define the extreme precipitation and non-extreme precipitation clusters and then design the Nearest-Sample Choosing method to handle the imbalanced data sets. We introduce streaming feature selection and subspace learning to extract the most relevant features from high dimensional data. We evaluate the machine learning tools using historical flood data collected in the State of Iowa, the United States and associated hydrometeorological variables from 1948 to 2010.
Publication Date
6-1-2015
Publication Title
ICNSC 2015 - 2015 IEEE 12th International Conference on Networking, Sensing and Control
Number of Pages
63-68
Document Type
Article; Proceedings Paper
Personal Identifier
scopus
DOI Link
https://doi.org/10.1109/ICNSC.2015.7116011
Copyright Status
Unknown
Socpus ID
84941248929 (Scopus)
Source API URL
https://api.elsevier.com/content/abstract/scopus_id/84941248929
STARS Citation
Di, Yahui; Ding, Wei; Mu, Yang; Small, David L.; and Islam, Shafiqul, "Developing Machine Learning Tools For Long-Lead Heavy Precipitation Prediction With Multi-Sensor Data" (2015). Scopus Export 2015-2019. 2031.
https://stars.library.ucf.edu/scopus2015/2031