National Natural Science Foundation of China (No.61070176, No.61170218, No.61272461);National University Student Innovation Program (No.201311560008);Program of Research Project of Education Department of Shaanxi Province (No.2013JK1200);Natural Science Basic Research Priorities Program of Shaanxi Province (No.2012JM8034);Industrialization Project of Shaaxi Education Department (No.2011JG06)
The feature selection for software birthmark has a direct bearing on software recognition rate.We apply constrained clustering to analyze software features.The within-and between-class distances of features are measured based on mutual information.Information gain and penalty functions are constructed using homogeneous and heterogeneous software features respectively.Then the software birthmark features with high class distinction and minimum redundancy are selected.It is shown the algorithm provide an effective approach for software birthmark feature selection and optimization by analysis and comparison.