Cause-effect graphs (CEGs) are usually applied for black-box testing of complex industrial systems. The specification process is time-consuming and can result in many errors. In this work, machine learning methods were applied for predicting the feasibility of CEG elements. All information was extracted from graphs contained in CEGSet, a dataset of CEGs. The data was converted to two different formats. The Boolean features format represents relations as separate data rows, whereas the Term-Frequency times Inverse-Document-Frequency (TF-IDF) format represents graphs as data rows. Eight machine learning models were trained on this data. The results of testing by using the 80–20 holdout method indicate that important information is lost when converting the graphs to the TF-IDF format, whereas the Boolean feature format enables 100%-accurate predictions of ensemble methods. The achieved results indicate that pre-trained models can be used as help for domain experts during the CEG specification process.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 firstname.lastname@example.org
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 email@example.com