How to assess students' cognitive structure is an important problem in the process of education and teaching.Traditionally,teachers evaluate a student based on their classroom performance and scores,which is lack of efficiency,objectivity,and it is hard to treat anyone equally.To solve this problem,DINA model,which is able to evaluate knowledge proficiency of students,has become a popular Cognitive Diagnosis model with a good interpretation.However,traditional DINA models are all based on small samples.When it comes to large-scale online learning scenario,the calculation will be significantly time-consuming.To address these issues,we first give proof of the convergence of DINA model,and then propose three acceleration methods.To be specific,the first one,called Incremental DINA(I-DINA),can partition the student data into blocks and iterate through the blocks.Then the second one,Maximum-Entropy DINA(ME-DINA),can choose and only access the most powerful students.At last,we combine the advantages of these two methods and propose the last model called Incremental Maximum Entropy DINA(IME-DINA).Extensive experiments on both a real-world dataset and simulation data demonstrate that our models can achieve dozens of acceleration without reducing the effectiveness of DINA Model.
Key words
educational data mining /
cognitive diagnosis /
DINA model /
convergence acceleration /
expectation maximization algorithm
{{custom_keyword}} /
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
References
[1] Premchaiswadi W,Porouhan P.Process modeling and decision mining in a collaborative distance learning environment[J].Decision Analytics,2015,2(1):1-34.
[2] 康叶钦.在线教育的"后MOOC时代"——SPOC解析[J].清华大学教育研究,2014,35(1):85-93.
[3] Anderson A,Huttenlocher D,Kleinberg J,et al.Engaging with massive online courses[A].Proceedings of the 23rd International Conference on World Wide Web[C].ACM,2014.687-698.
[4] Vukicevic M,Jovanovic M Z,Delibasic B,et al.Recommender System for Selection of the Right Study Program for Higher Education Students[M].RapidMiner:Data Mining Use Cases and Business Analytics Applications,2013.
[5] Baker R S,Inventado P S.Educational Data Mining and Learning Analytics[M].Learning Analytics.2014,61-75.
[6] Leighton J P,Gierl M J.Cognitive diagnostic assessment for education:Theory and applications[J].Journal of Qingdao Technical College,2007,45(4):407-411.
[7] DiBello L V,Roussos L A,Stout W.31A Review of cognitively diagnostic assessment and a summary of psychometric models[J].Handbook of Statistics,2006,26:979-1030.
[8] Barnes T.The Q-matrix method:Mining student response data for knowledge[A].Proceedings of the AAAI-2005 Workshop on Educational Data Mining[C].Pittsburgh,PA,2005.1-8.
[9] Junker B,Sijtsma K.Cognitive assessment models with few assumptions,and connections with nonparametric item response theory[J].Applied Psychological Measurement,2001,25(3):258-272.
[10] Fan X.Item response theory and classical test theory:an empirical comparison of their item/person statistics[J].Educational & Psychological Measurement,1998,58(58):357-381.
[11] An X,Yung Y F.Item Response Theory:What it is and how you can use the IRT procedure to apply it[A].Proceedings of the SAS Global Forum 2014 Conference[C].SAS Institute,2014.364-2014.
[12] De La Torre J.DINA Model and parameter estimation:A didactic[J].Journal of Educational and Behavioral Statistics,2009,34(1):115-130.
[13] 张潇,沙如雪.认知诊断DINA模型研究进展[J].中国考试,2013(1):32-37.
[14] de la Torre J.Application of the DINA Model Framework to Enhance Assessment and Learning[M].Self-Directed Learning Oriented Assessments in the Asia-Pacific.New York:Springer,2012.92-110.
[15] Torre J D L,Minchen N.Cognitively diagnostic assessments and the cognitive diagnosis model framework[J].Psicología Educativa,2014,20(2):89-97.
[16] Torre J D L,Douglas J A.Higher-order latent trait models for cognitive diagnosis[J].Psychometrika,2004,69(3):333-353.
[17] Wu R,Liu Q,Liu Y,et al.Cognitive modelling for predicting examinee performance[A].Proceedings of the 24th International Conference on Artificial Intelligence[C].AAAI Press,2015.1017-1024.
[18] Neal R M,Hinton G E.A View of the Em Algorithm that Justifies Incremental,Sparse,and other Variants[M].Learning in Graphical Models.Springer Netherlands,1998.355-368.
[19] Mu J.Handbook of modern item response theory[J].Journal of the American Statistical Association,1997,(92):245-256.
[20] 朱天宇,黄振亚,陈恩红,等.基于认知诊断的个性化试题推荐方法[J].计算机学报,2017,40(1):176-191.
[21] Templin J L,Henson R A.Measurement of psychological disorders using cognitive diagnosis models[J].Psychological Methods,2006,11(3):287-305.
[22] De La Torre J.The Generalized DINA model framework[J].Psychometrika,2011,76(2):179-199.
[23] Dempster A.Maximum likelihood from incomplete data via the EM algorithm[J].Journal of the Royal Statistical Society,1977,39(1):1-38.
[24] Wu C F J.On the convergence properties of the em algorithm[J].Annals of Statistics,1982,11(1):95-103.
[25] Booth J G,Hobert J P.Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm[J].Journal of the Royal Statistical Society,1999,61(1):265-285.
[26] Meng X L,Rubin D B.Maximum Likelihood Estimation via the ECM algorithm:a general framework[J].Biometrika,1993,80(2):267-278.
[27] Liu C,Rubin D B.The ECME algorithm:a simple extension of EM and ECM with faster monotone convergence[J].Biometrika,1994,81(4):633-648.
[28] Mclachlan G J.On Aitken's Method and other approaches for accelerating convergence of the em algorithm[A].Proceedings of the AC Aitken Centenary Conference[C].Dunedin:University of Otago Press(1995),1998.201-209.
[29] Bo T,Meek C,Heckerman D.Accelerating EM for large databases[J].Machine Learning,2001,45(3):279-299.
{{custom_fnGroup.title_en}}
Footnotes
{{custom_fn.content}}
Funding
National High-tech R&D Program of China (863 Program) (No.2015AA015409); National Natural Science Foundation of China for Distinguished Young Schoolars (No.61325010); National Natural Science Foundation of China (No.61672483, No.U1605251); Youth Innovation Promotion Association CAS (会员编号2014299)
{{custom_fund}}