[1] Gupta S,Kaiser G,Stolfo S.Extracting context to improve accuracy for HTML conten t extraction[A].Ellis A,Tatsuya H,eds Proc of the 14th Intl Conf.on World Wide Web—Special Interest Tracks and Posters[C].New York:ACM Press,2005.1114-1115 . [2] 王佰玲,方滨兴,云晓春.传统报文捕获平台的性能影响因素分析[J].计算机工程与应用, 2003,39(22):151-152. WANG B L,FANG B X,YUN X C.The analysis of the performance factor in traditional packet capture plat[J].Computer Engineering and Applications,2003,39(22):151-1 52.(in Chinese) [3] 王佰玲,方滨兴,云晓春.零拷贝报文捕获平台的研究与实现[J].计算机学报,2005,28(1):4 6-52. WANG B L,FANG B X,YUN X C.The study and implementation of zero-copy packet capt ure platform[J].Chinese Journal of Computers,2005,28(1):46-52.(in Chinese) [4] 王佰玲.基于良性蠕虫的网络蠕虫主动遏制技术研究[D].黑龙江哈尔滨:哈尔滨工业大学. 2006. [5] 王佰玲,田志宏,张永铮.奇异值分解算法优化[J].电子学报,2010,38(10):2234-2239. Wang B L,Tian Z H,Zhang Y Z.Optimization of singular vector decomposition algori thm[J].Acta Electronica Sinica,2010,38(10):2234-2239.(in Chinese) [6] 欧阳震诤,罗建书,胡东敏,吴泉源.一种不平衡数据流集成分类模型[J].电子学报,2010,38 (1):184-189. OUYANG Z Z,LUO J S,HU D M.An ensemble classifier framework for mining imbalanced data streams[J].Acta Electronica Sinica,2010,38(1):184-189.(in Chinese) [7] 詹英,吴春明,王宝军.一种与缓冲区紧耦合的环形循环滑动窗口的数据流抽取算法[J].电 子学报,2011,39(4):894-898. ZHAN Y,WU C,WANG B.An algorithm for data stream sampling based on ring circular sliding window tightly-coupled with buffer[J].Acta Electronica Sinica,2011,39 (4):894-898.(in Chinese) [8] MacDonald J.Versioned file archiving,compression,and distribution[OL].http://w ww.cs.berkeley.edu/~jmacd/.UC Berkeley,1999. [9] Golaxy中科天玑[OL].http://www.golaxy.cn/,2009. [10] 李连霞.基于多特征的HTML网页内容提取的研究[D].山东济南:山东大学,2008. [11] 林昌平,郑皎凌.基于DOM规范的网页分析技术研究[J].成都信息工程学院学报,2007,(S1): 113-117. [12] TSE SourceCode[OL].http://sewm.pku.edu.cn/src/TSE/,2009. [13] Gomes D,Santos AL,Silva MJ.Webstore:A manager for incremental storage of content s[R].Technical Report,DI/FCUL TR 04-15,Lisbon:University of Lisbon,2004. [14] Sekiguchi Y,Kawashima H,Okuda H,Oku M.Topic detection from Blog documents using users'interests[A].Aberer K,Hara T,eds Proc of the 7th Intl Conf on Mobile Da ta Management(MDM 2006)[C].Washington:IEEE Computer Society,2006.108-111. [15] Wang XY,Xiong FY,Ling B,Zhou A.A similarity-based algorithm for topic explorati on and distillation[J].Journal of Software,2003,14(9):1578-585. [16] McCown F.Dynamic web file format transformations with grace[A].Proc of the 5th Intl Web Archiving Workshop and Digital Preservation[C].2005.22-23. [17] Lampos C,Eirinaki M,Jevtuchova D,Vazirgiannis M.Archiving the Greek Web[A].Pro c.of the 4th Int'l Web Archiving Workshop[C].2004. |