Domain-Oriented and Tag-Aided Web Service Clustering Method
TIAN Gang1,2, HE Ke-qing1, WANG Jian1, SUN Cheng-ai2, XU Jian-jian2
1. State Key Laboratory of Software Engineering School of Computer, Wuhan University, Wuhan, Hubei 430072, China;
2. College of Information Science, Shandong University of Science and Technology, Qingdao, Shandong 266590, China
The growing number of web services puts forward higher requirements for searching desired web services and clustering Web services can greatly enhance the discovery of Web service.However,the existing clustering approaches are only for a single type of service documents,and they are lacking of considering the domain characteristic and the tags information of services.To solve these problems,the proposed approach constructs the feature vectors of Web service contents by using ontology empowered SVM and domain oriented feature dimension reduction technology.Then a tag aided service clustering model called T-LDA is proposed to construct the hidden topic representations of Web service and general topical information which has less discriminative power is normalized.Finally all methods mentioned above are combined to form the domain oriented and tag aided Web service clustering (DTWSC).Experimental results show that the proposed approach can improve the effect of clustering.Compared with the approaches of LDA and K-means,the proposed approach achieves better performance of the purity,entropy and F-measure.
田刚, 何克清, 王健, 孙承爱, 徐建建. 面向领域标签辅助的服务聚类方法[J]. 电子学报, 2015, 43(7): 1266-1274.
TIAN Gang, HE Ke-qing, WANG Jian, SUN Cheng-ai, XU Jian-jian. Domain-Oriented and Tag-Aided Web Service Clustering Method. Chinese Journal of Electronics, 2015, 43(7): 1266-1274.
[1] L-J Zhang,J Zhang,H Cai.Services Computing[M].Beijing:Tsinghua University,2007.
[2] Chen Liang,Hu Liukai,Zheng Zibin,et al.WTCluster:Utilizing tags for Web services clustering[A].Proceedings of International Conference on Service-Oriented Computing[C].Berlin:Springer,2011.204-218.
[3] Elgazzar K,Hassan A E,Martin P.Clustering WSDL documents to bootstrap the discovery of web services[A].Proceedings of International Conference on Web Services[C].USA:Piscataway,2010.147-154.
[4] Yu Q,Rege M.On service community learning:A co-clustering approach[A].Proceedings of IEEE International Conference on Web Services[C].USA:Piscataway,2010.283-290.
[5] Liu Jianxiao,He Keqing,Wang Jian,et al.A clustering method for web service discovery[A].Proceedings of International Conference on Services Computing[C].USA:Piscataway,2011.729-730.
[6] Cassar G,Barnaghi P,Moessner K.Probabilistic methods for service clustering[A].Proceedings of International Workshop on Semantic Web Service Matchmaking and Resource Retrieval[C].Shanghai:SRI,2010.4-20.
[7] Blei D M,Ng A Y,Jordan M I.Latent dirichlet allocation[J].Journal of Machine Learning Research,2003,3(2):993-1022.
[8] Rosen-Zvi M,Griths T,Steyvers M,Smyth P.The author-topic model for authors and documents[A].Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence[C].USA:UAI,2004.487-494.
[9] Wang Jian,Zhang Jia,Hung P C K,et al.Leveraging fragmental semantic data to enhance services discovery[A].Proceedings of the 13th International Conference on High Performance Computing and Communications[C].Piscataway,NJ:IEEE,2011.687-694.
[10] Chen L,Wang Y,Yu Q,et al.WT-LDA:User Tagging Augmented LDA for Web Service Clustering[M].USA:Service-Oriented Computing,2013.162-176.
[11] Canny J.GaP:a factor model for discrete data[A].In SIGIR'04:Proceedings of International Conference on Research and Development in Information Retrieval[C].New York,NY:ACM Press,2004.122-129.
[12] 李征,王健,等.一种面向主题的领域服务聚类方法[J].计算机研究与发展,2014,51(2):408-419. Li Zheng,Wang Jian,et al.A topic-oriented clustering approach for domain services[J].Journal of Computer Research and Development,2014,51(2):408-419.(in Chinese)
[13] Wang Xianzhi,Wang Zhongjie,Xu Xiaofei.Semi-empirical service composition:a clustering based approach[A].Proceedings of International Conference on Web Services[C].Piscataway,NJ:IEEE,2011.219-226.
[14] Richi N,Bryan L.Web service discovery with additional semantics and clustering[A].Proceedings of International Conference on Web Intelligence[C].Piscataway,NJ:IEEE,2007.555-558.
[15] Griffiths T L,Steyvers M.Finding scientific topics[A].Proceedings of the National Academy of Sciences of the United States of America[C].USA:NCBI,2004,Vol.101.5228-5235.
[16] Dasgupta S,Bhat S,Lee Y.Taxonomic clustering and query matching for efficient service discovery[A].Proceedings of Conference on Web Services[C].Piscataway,NJ:IEEE,2011.363-370.
[17] Platzer C,Rosenberg F,Dustdar S.Web service clustering using multidimensional angles as proximity measures[J].ACM Transactions on Internet Technology,2009,9(3):1-26.
[18] 孙萍,蒋昌俊.利用服务聚类优化面向过程模型的语义Web服务发现[J].计算机学报,2008,31(8):1340-1353. Sun Ping,Jiang Changjun.Using service clustering to facilitate process-oriented semantic web service discovery[J].Chinese Journal of Computers,2008,31(8):1340-1353.(in Chinese)
[19] Skoutas D,Sacharidis D,et al.Ranking and clustering Web services using multicriteria dominance relationships[J].IEEE Transactions on Services Computing,2010,3(3):163-177.
[20] 杜玉越,薛洁,李彦成.基于服务簇的服务组合替换与分析[J].电子学报,2014,42(11):2231-2238. DU Yu-yue,XUE Jie,LI Yan-cheng.Substitution andanalysis of service composition based on service clusters[J].Acta Electronica Sinica,2014,42(11):2231-2238.(in Cineses)
[21] 江雨燕,李平,王清.基于共享背景主题的Labeled LDA模型[J].电子学报,2013,41(9):1794-1799. JIANG Yu-yan,LI Ping,WANG Qing.Labeled LDA model based on shared background topics[J].Acta Electronica Sinica,2013,41(9):1794-1799.(in Cineses)
[22] 陈江锋,于建军.基于主题模型的结构化Web服务发现机制[J].北京航空航天大学学报,2008,34(6):734-738. Chen Jiangfeng,Yu Jianjun.Topic model based structural Web services discovery[J].Journal of Beijing University of Aeronautics and Astronautics,2008,34(6):734-738.(in Chinese)