Logistic regression is widely used in decision problems to classify inputs through training from the previously known training data. In this paper, we propose an approach to detecting similar versions of software by learning with logistic regression on binary opcode information. Because the binary opcode information has detailed information for executing software on an individual machine, the learning from the binary opcode information can provide effective information in detecting similar versions of software. To evaluate the proposed approach, we experiment with two Java applications. The experimental results showed that the proposed logistic regression model can accurately detect similar versions of software after learning from training data. The proposed logistic regression model is expected to be applied in applications for comparing and detecting similar versions of software.
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
Tel.: +1 703 830 6300
Fax: +1 703 830 2300 firstname.lastname@example.org
(Corporate matters and books only) IOS Press c/o Accucoms US, Inc.
For North America Sales and Customer Service
West Point Commons
Lansdale PA 19446
Tel.: +1 866 855 8967
Fax: +1 215 660 5042 email@example.com