1.麒麟软件有限公司2030实验室,湖南长沙 410000
2.武汉学院信息工程学院,湖北武汉 430212
3.国防科技大学计算机学院,湖南长沙 410000
吴庆波 男,1969年4月出生于浙江省宁波市。中国电子信息产业集团有限公司、麒麟软件有限公司首席科学家,中国电子信息产业集团春天研究院院长,天津市操作系统重点实验室主任、研究员。主要研究方向为国产操作系统的研制和推广应用。E-mail: wuqingbo@kylinos.cn
伍复慧 男,1987年2月出生于江西省赣州市。现为武汉学院信息工程学院副教授。主要研究方向为人工智能、操作系统。E-mail: fuhui.wu@whxy.edu.cn
刘海天 男,1994年12月出生于湖南省涟源市。现为麒麟软件高级研发工程师。主要研究方向为人工智能、操作系统。E-mail: liuhaitian@kylinos.cn
刘军 男,1986年2月出生于江西省新干县。现为麒麟软件2030实验室科研规划部总经理、CCF泛在操作系统开放社区技术委员会委员。主要研究方向为人工智能、操作系统。E-mail: liujun@kylinos.cn
李英俊 男,1988年8月出生于湖南省涟源市。现为麒麟软件有限公司2030实验室x内核研发部经理。主要研究方向为操作系统。E-mail: liyingjun@kylinos.cn
农俊康 男,1986年11月出生于广西壮族自治区百色市。现为麒麟软件有限公司高级研发工程师。主要研究方向为虚拟化、操作系统。E-mail: nongjunkang@kylinos.cn
杜宇琪 男,2004年2月出生于江西省萍乡市。现为麒麟软件有限公司2030实验室助理研发工程师。主要研究方向为机密计算。E-mail: duyuqi@kylinos.cn
吴昊泽 男,2000年5月出生于甘肃省兰州市。现为麒麟软件有限公司2030实验室助理研发工程师。主要研究方向为操作系统。E-mail: wuhaoze@kylinos.cn
陈龙腾 男,2000年11月出生于湖南省涟源市。毕业于南京航空航天大学。现为麒麟软件有限公司助理工程师。主要研究方向为操作系统。E-mail: chenlongteng@kylinos.cn
王静 女,1986年10月出生于湖南省常德市。现为国防科技大学计算机学院助理研究员,国家重点研发计划项目课题负责人,开源发展技术委员会执行委员。主要研究方向为操作系统、开源生态治理。E-mail: wangjing@nudt.edu.cn
收稿:2025-09-04,
录用:2025-12-10,
纸质出版:2026-04-25
移动端阅览
吴庆波, 伍复慧, 刘海天, 等. AIOS综述:安全可信的人工智能原生操作系统[J]. 电子学报, 2026, 54(04): 1584-1611.
WU Qingbo, WU Fuhui, LIU Haitian, et al. Systematic Survey of AIOS: A Secure and Trustworthy AI-Native Operating System[J]. Acta Electronica Sinica, 2026, 54(04): 1584-1611.
吴庆波, 伍复慧, 刘海天, 等. AIOS综述:安全可信的人工智能原生操作系统[J]. 电子学报, 2026, 54(04): 1584-1611. DOI:10.12263/DZXB.20250763
WU Qingbo, WU Fuhui, LIU Haitian, et al. Systematic Survey of AIOS: A Secure and Trustworthy AI-Native Operating System[J]. Acta Electronica Sinica, 2026, 54(04): 1584-1611. DOI:10.12263/DZXB.20250763
随着人工智能技术的快速发展,传统操作系统在安全性、智能性与场景适配性方面面临新的挑战。本文系统梳理了安全可信的人工智能原生操作系统(AI-native Operating System,AIOS)的研究进展与关键技术。首先,设计AIOS的分层架构,包括内核组件库、资源管理服务、共性基础服务以及领域基础软件栈。面向资源、运维与安全的系统级智能化服务框架和围绕安全内核架构、软硬件协同的安全与适配,以及虚拟化支撑的纵深防御安全访问框架构成共性基础服务。通过标准化接口与内核/核心服务在调度、内存、I/O与安全域等方面向上提供领域基础软件栈接口(涵盖人工智能(Artificial Intelligence,AI)软件栈与安全软件栈),支撑行业场景落地。其次,提出人工智能原生操作系统的关键技术:端侧智能操作系统聚焦模型压缩、推理优化(如键值(Key-Value,KV)缓存与分页注意力)与端云协同的异构资源调度及硬件抽象与驱动适配;记忆管理系统围绕记忆存储、检索与管理,结合短期、长期记忆以支撑长上下文理解与动态更新;安全可信机制涵盖内存安全与形式化验证、自动化检测、可信启动与远程证明,以及基于可信执行环境(Trusted Execution Environment,TEE)、机密计算与虚拟化隔离的纵深防御。随后,结合应用场景,综述AIOS在具身智能、工业与移动等领域的部署路径、能力边界与工程权衡,分析端侧推理、实时控制、跨设备协同与多模态交互的实践挑战。进一步,围绕新型操作系统基础技术、智能服务框架与关键技术梳理及安全可信基石,总结架构模块化与软硬件协同、智能服务的资源调度与安全鲁棒性、记忆系统的效率与隐私保护、领域基础软件栈的标准化与生态建设,以及跨平台与跨设备的智能协同与资源共享等开放问题与趋势。本文贡献在于提供结构化视角、术语边界与技术地图,并对典型系统与工具进行对比归纳,为AIOS研究与工程落地提供参考。
With the rapid development of artificial intelligence technology
traditional operating systems face new challenges in security
intelligence
and scenario adaptability. This survey systematically reviews the research progress and key technologies of secure and trustworthy AI-native operating systems (AIOS). First
it introduces the layered architecture of AIOS
including kernel components
resource management
general foundational services (covering both the intelligent service framework and the secure access framework)
and domain-specific software stacks
highlighting the deep integration of artificial intelligence (AI) capabilities with operating systems. The secure access framework focuses on secure kernel architecture
software-hardware collaborative security and adaptation
and virtualization-based in-depth defense. Second
it summarizes the key AIOS technologies: edge-oriented operating systems emphasize model compression
inference optimization (e.g.
key-value (KV) cache and PagedAttention)
edge-cloud collaboration with heterogeneous resource scheduling
and hardware abstraction and driver adaptation; the memory management system centers on memory storage
retrieval
and management
combining short-term and long-term memory to support long context understanding and dynamic updates; the security and trust mechanisms cover memory safety and formal verification
automated analysis
trusted boot and remote attestation
and defense-in-depth via trusted execution environment (TEE)/confidential computing and virtualization isolation. Finally
by examining practical scenarios such as embodied intelligence
industrial internet of things (IoT)
and mobile applications
it analyzes deployment paths
capability boundaries
and engineering trade-offs
and summarizes open issues and trends including architectural modularity
co-design of hardware and software
resource scheduling and security robustness in intelligent services
efficiency and privacy of memory systems
standardization and ecosystem building of domain stacks
and cross-platform/device collaboration and resource sharing. This survey aims to provide a structured perspective
clear terminology boundaries
and a technology map
offering references for AIOS research and engineering practice.
高蕾 , 符永铨 , 李东升 , 等 . 我国人工智能核心软硬件发展战略研究 [J ] . 中国工程科学 , 2021 , 23 ( 3 ): 90 - 97 . DOI: 10.15302/j-sscae-2021.03.008 http://dx.doi.org/10.15302/j-sscae-2021.03.008
Gao L , Fu Y Q , Li D S , et al . Development strategy for the core software and hardware of artificial intelligence in China [J ] . Strategic Study of CAE , 2021 , 23 ( 3 ): 90 - 97 . (in Chinese) . DOI: 10.15302/j-sscae-2021.03.008 http://dx.doi.org/10.15302/j-sscae-2021.03.008
中华人民共和国国务院 . 国务院关于印发新一代人工智能发展规划的通知 [EB/OL ] . ( 2017-07-20 )[ 2021-4-22 ] . https://www.gov.cn/zhengce/content/2017-07/20/content_5211996.htm https://www.gov.cn/zhengce/content/2017-07/20/content_5211996.htm .
The State Council of the People's Republic of China . Circular of the state council on issuing the development plan for a new generation of artificial intelligence [EB/OL ] . ( 2017-07-20 )[ 2021-04-22 ] . https://www.gov.cn/zhengce/content/2017-07/20/content_5211996.htm https://www.gov.cn/zhengce/content/2017-07/20/content_5211996.htm . (in Chinese)
中国信息通信研究院 . 全球人工智能战略与政策观察(2020) [EB/OL ] . ( 2020-12 )[ 2025-10-10 ] . https://www.cai-ct.ac.cn/kxyj/qwfb/ztbg/202012/P020201229520426700957.pdf https://www.cai-ct.ac.cn/kxyj/qwfb/ztbg/202012/P020201229520426700957.pdf .
China Academy of Information and Communications Technology . Global AI strategy and policy review ( 2020 )[EB/OL ] . ( 2020-12 )[ 2025-10-10 ] . https://www.caict.ac.cn/kxyj/qwfb/ztbg/202012/P020201229520426700957.pdf https://www.caict.ac.cn/kxyj/qwfb/ztbg/202012/P020201229520426700957.pdf . (in Chinese)
美国国家科学技术委员会(中国信息通信研究院政策与经济研究所编译组整理) . 美国国家人工智能研究和发展战略计划 [EB/OL ] . ( 2016-10 )[ 2025-10-10 ] . http://www.caict.ac.cn/kxyj/qwfb/ztbg/201804/P020161102381492504518.pdf http://www.caict.ac.cn/kxyj/qwfb/ztbg/201804/P020161102381492504518.pdf . DOI: 10.1007/978-981-15-6548-9 http://dx.doi.org/10.1007/978-981-15-6548-9
National Science and Technology Council of the United States (Compiled by the Compilation Group of the Institute of Policy and Economics, China Academy of Information and Communications Technology) . National artificial intelligence research and development strategic plan [EB/OL ] . ( 2016-10 )[ 2025-10-10 ] . http://www.caict.ac.cn/kxyj/qwfb/ztbg/201804/P020161102381492504518.pdf http://www.caict.ac.cn/kxyj/qwfb/ztbg/201804/P020161102381492504518.pdf . (in Chinese) . DOI: 10.1007/978-981-15-6548-9 http://dx.doi.org/10.1007/978-981-15-6548-9
The White House . Executive order on maintaining American leadership in artificial intelligence [EB/OL ] . ( 2019-02-11 )[ 2025-10-10 ] . https://trumpwhitehouse.archives.gov/presidential-actions/executive-order-maintaining-american-leadership-artificial-intelligence/ https://trumpwhitehouse.archives.gov/presidential-actions/executive-order-maintaining-american-leadership-artificial-intelligence/ .
The White House . The national artificial intelligence research and development strategic plan: 2019 Update [EB/OL ] . ( 2019-06 )[ 2025-10-10 ] . https://www.nitrd.gov/pubs/National-AI-RD-Strategy-2019.pdf https://www.nitrd.gov/pubs/National-AI-RD-Strategy-2019.pdf . DOI: 10.1609/aimag.v39i2.2803 http://dx.doi.org/10.1609/aimag.v39i2.2803
梅宏 , 曹东刚 , 谢涛 . 泛在操作系统: 面向人机物融合泛在计算的新蓝海 [J ] . 中国科学院院刊 , 2022 , 37 ( 1 ): 30 - 37 . DOI: 10.16418/j.issn.1000-3045.20211117009 http://dx.doi.org/10.16418/j.issn.1000-3045.20211117009
Mei H , Cao D G , Xie T . Ubiquitous operating system: Toward the blue ocean of human-cyber-physical ternary ubiquitous computing [J ] . Bulletin of the Chinese Academy of Sciences , 2022 , 37 ( 1 ): 30 - 37 . (in Chinese) . DOI: 10.16418/j.issn.1000-3045.20211117009 http://dx.doi.org/10.16418/j.issn.1000-3045.20211117009
Choi Y , Park S , Cha H . Graphics-aware power governing for mobile devices [C ] // Proceedings of the 17th Annual International Conference on Mobile Systems, Applications, and Services . New York : ACM , 2019 : 469 - 481 . DOI: 10.1145/3307334.3326075 http://dx.doi.org/10.1145/3307334.3326075
Choi Y , Park S , Jeon S , et al . Optimizing energy consumption of mobile games [J ] . IEEE Transactions on Mobile Computing , 2022 , 21 ( 10 ): 3744 - 3756 . DOI: 10.1109/tmc.2021.3058381 http://dx.doi.org/10.1109/tmc.2021.3058381
Du D , Hua Z C , Xia Y B , et al . XPC: Architectural support for secure and efficient cross process call [C ] // Proceedings of the 46th International Symposium on Computer Architecture . New York : ACM , 2019 : 671 - 684 . DOI: 10.1145/3307650.3322218 http://dx.doi.org/10.1145/3307650.3322218
Herder J N , Bos H , Gras B , et al . MINIX 3: A highly reliable, self-repairing operating system [J ] . ACM SIGOPS Operating Systems Review , 2006 , 40 ( 3 ): 80 - 89 . DOI: 10.1145/1151374.1151391 http://dx.doi.org/10.1145/1151374.1151391
Liedtke J . On micro-kernel construction [J ] . ACM SIGOPS Operating Systems Review , 1995 , 29 ( 5 ): 237 - 250 . DOI: 10.1145/224057.224075 http://dx.doi.org/10.1145/224057.224075
Liedtke J . Toward real microkernels [J ] . Communications of the ACM , 1996 , 39 ( 9 ): 70 - 77 . DOI: 10.1145/234215.234473 http://dx.doi.org/10.1145/234215.234473
Elphinstone K , Heiser G . From L3 to seL4 what have we learnt in 20 years of L4 microkernels? [C ] // Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles . New York : ACM , 2013 : 133 - 150 .
Chen H B , Miao X , Jia N , et al . Microkernel goes general: Performance and compatibility in the HongMeng production microkernel [C ] // 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI 24) . Berkeley : USENIX Association , 2024 : 465 - 485 .
Microsoft . MS Windows NT Kernel-mode user and GDI white paper [EB/OL ] . ( 2014-01-01 )[ 2025-08-10 ] . https://learn.microsoft.com/en-us/previous-versions/cc750820(v=technet.10) https://learn.microsoft.com/en-us/previous-versions/cc750820(v=technet.10) .
APple . XNU project [EB/OL ] . ( 2025-07-01 )[ 2025-08-10 ] . https://github.com/apple-oss-distributions/xnu https://github.com/apple-oss-distributions/xnu .
Engler D R . The Exokernel operating system architecture [D ] . Cambridge : Massachusetts Institute of Technology , 1998 .
余琪 . 基于外内核操作系统的调度研究 [D ] . 兰州 : 兰州大学 , 2014 . DOI: 10.14257/astl.2014.45.01 http://dx.doi.org/10.14257/astl.2014.45.01
Yu Q . Research on Exokernel system scheduling [D ] . Lanzhou : Lanzhou University , 2014 . (in Chinese) . DOI: 10.14257/astl.2014.45.01 http://dx.doi.org/10.14257/astl.2014.45.01
李肇中 . 基于外内核操作系统的文件系统研究 [D ] . 兰州 : 兰州大学 , 2016 .
Li Z Z . A study of file system based on Exokernel operating system [D ] . Lanzhou : Lanzhou University , 2016 . (in Chinese)
刘群 . 面向多核处理器的外内核操作系统研究 [D ] . 兰州 : 兰州大学 , 2014 .
Liu Q . A study of Exokernel operating system on [D ] . Lanzhou : Lanzhou University , 2014 . (in Chinese)
胡俊 . 外内核时间子系统的初步研究 [D ] . 兰州 : 兰州大学 , 2014 .
Hu J . Preliminary research on time subsystem in Exokernel [D ] . Lanzhou : Lanzhou University , 2014 . (in Chinese)
Hambarde P , Varma R , Jha S . The survey of real time operating system: RTOS [C ] // 2014 International Conference on Electronic Systems, Signal Processing and Computing Technologies . Piscataway : IEEE , 2014 : 34 - 39 . DOI: 10.1109/icesc.2014.15 http://dx.doi.org/10.1109/icesc.2014.15
Stankovic J A , Ramamritham K . The Spring kernel: A new paradigm for real-time operating systems [J ] . ACM SIGOPS Operating Systems Review , 1989 , 23 ( 3 ): 54 - 71 . DOI: 10.1145/71021.71024 http://dx.doi.org/10.1145/71021.71024
Mantegazza P , Dozio E L , Papacharalambous S . RTAI: Real time application interface [C ] // Linux Journal . New York : ACM , 2000 : 10-es.
Hildebrand D . An architectural overview of QNX [C ] // Proceedings of the Workshop on Micro-kernels and Other Kernel Architectures . New York : ACM , 1992 : 113 - 126 .
Grattan N , Brain M . Windows CE 3.0 [M ] . Upper Saddle River, NJ : Prentice Hall PTR , 2001 .
Burns A , Davis R I . A survey of research into mixed criticality systems [J ] . ACM Computing Surveys , 2018 , 50 ( 6 ): 1 - 37 . DOI: 10.1145/3131347 http://dx.doi.org/10.1145/3131347
Kappel J A , Velte A T , et al . Microsoft virtualization with Hyper-V [M ] . New York : McGraw Hill , 2009 .
Barham P , Dragovic B , Fraser K , et al . Xen and the art of virtualization [J ] . ACM SIGOPS Operating Systems Review , 2003 , 37 ( 5 ): 164 - 177 . DOI: 10.1145/1165389.945462 http://dx.doi.org/10.1145/1165389.945462
Kivity A , Kamay Y , Laor D , et al . KVM: The Linux virtual machine monitor [C ] // Proceedings of the Linux Symposium . Ottawa : OLS , 2007 : 225 - 230 .
Xi S S , Wilson J , Lu C Y , et al . RT-Xen: Towards real-time hypervisor scheduling in xen [C ] // Proceedings of the Ninth ACM International Conference on Embedded Software . New York : ACM , 2011 : 39 - 48 . DOI: 10.1145/2038642.2038651 http://dx.doi.org/10.1145/2038642.2038651
Abeni L , Faggioli D . An experimental analysis of the xen and KVM latencies [C ] // 2019 IEEE 22nd International Symposium on Real-Time Distributed Computing . Piscataway : IEEE , 2019 : 18 - 26 . DOI: 10.1109/isorc.2019.00014 http://dx.doi.org/10.1109/isorc.2019.00014
Cziva R , Pezaros D P . Container network functions: Bringing NFV to the network edge [J ] . IEEE Communications Magazine , 2017 , 55 ( 6 ): 24 - 31 . DOI: 10.1109/mcom.2017.1601039 http://dx.doi.org/10.1109/mcom.2017.1601039
Heiser G , Leslie B . The OKL4 microvisor: Convergence point of microkernels and hypervisors [C ] // Proceedings of the First ACM Asia-Pacific Workshop on Workshop on Systems . New York : ACM , 2010 : 19 - 24 . DOI: 10.1145/1851276.1851282 http://dx.doi.org/10.1145/1851276.1851282
Steinberg U , Kauer B . NOVA: A microhypervisor-based secure virtualization architecture [C ] // Proceedings of the 5th European Conference on Computer Systems . New York : ACM , 2010 : 209 - 222 . DOI: 10.1145/1755913.1755935 http://dx.doi.org/10.1145/1755913.1755935
Zhang F Z , Chen J , Chen H B , et al . CloudVisor: Retrofitting protection of virtual machines in multi-tenant cloud with nested virtualization [C ] // Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles . New York : ACM , 2011 : 203 - 216 . DOI: 10.1145/2043556.2043576 http://dx.doi.org/10.1145/2043556.2043576
Cinque M , Della Corte R , Eliso A , et al . Rt-cases: Container-based virtualization for temporally separated mixed-criticality task sets [C ] // 31st Euromicro Conference on Real-Time Systems (ECRTS 2019) . Wadern : Schloss Dagstuhl-Leibniz-Zentrum für Informatik , 2019 : 5 : 1 - 5 : 22 .
Lankes S , Pickartz S , Breitbart J . HermitCore: A unikernel for extreme scale computing [C ] // Proceedings of the 6th International Workshop on Runtime and Operating Systems for Supercomputers . New York : ACM , 2016 : 1 - 8 . DOI: 10.1145/2931088.2931093 http://dx.doi.org/10.1145/2931088.2931093
Martins J , Ahmed M , Raiciu C , et al . ClickOS and the art of network function virtualization [C ] // Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation . New York : ACM , 2014 : 459 - 473 . DOI: 10.1145/2491185.2491195 http://dx.doi.org/10.1145/2491185.2491195
Kaiser R , Wagner S . Evolution of the PikeOS microkernel [C ] // First International Workshop on Microkernels for Embedded Systems . Kensington : National ICT Australia , 2007 : 50 .
Masmano M , Ripoll I , Crespo A , et al . Xtratum: A hypervisor for safety critical embedded systems [C ] // 11th Real-Time Linux Workshop . Dresden : IEEE , 2009 : 1 - 9 .
martins j , tavares a , solieri m , et al . Bao: A lightweight static partitioning hypervisor for modern multi-core embedded systems [C ] // Workshop on Next Generation Real-Time Embedded Systems (NG-RES 2020) . Dagstuhl : Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik , 2020 : 3 : 1 - 3 : 14 .
Goodarzy S , Nazari M , Han R , et al . SmartOS: Towards automated learning and user-adaptive resource allocation in operating systems [C ] // Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems . New York : ACM , 2021 : 48 - 55 . DOI: 10.1145/3476886.3477519 http://dx.doi.org/10.1145/3476886.3477519
Aaen Springborg A , Albano M , Xavier-de-Souza S . Automatic energy-efficient job scheduling in HPC: A novel SLURM plugin approach [C ] // Proceedings of the SC '23 Workshops of the International Conference on High Performance Computing , Network, Storage, and Analysis . New York : ACM , 2023 : 1831 - 1838 . DOI: 10.1145/3624062.3624265 http://dx.doi.org/10.1145/3624062.3624265
Othmen S , Mansouri W , Khdhir R . Applying artificial intelligence techniques for resource management in the Internet of Things (IoT) [J ] . International Journal of Electrical and Computer Engineering Systems , 2025 , 16 ( 2 ): 183 - 194 . DOI: 10.32985/ijeces.16.2.1 http://dx.doi.org/10.32985/ijeces.16.2.1
Selvarajan G P . AI-driven cloud resource management and orchestration [J ] . International Journal of Innovative Research in Science , Engineering and Technology , 2024 , 13 ( 11 ): 19367 - 19380 . DOI: 10.15680/ijirset.2024.1311206 http://dx.doi.org/10.15680/ijirset.2024.1311206
Qin X X , Zeng F P , Zhang Y . MSNdroid: The Android malware detector based on multi-class features and deep belief network [C ] // Proceedings of the ACM Turing Celebration Conference - China . New York : ACM , 2019 : 1 - 5 .
Alzaylaee M K , Yerima S Y , Sezer S . DL-Droid: Deep learning based Android malware detection using real devices [J ] . Computers & Security , 2020 , 89 : 101663 . DOI: 10.1016/j.cose.2019.101663 http://dx.doi.org/10.1016/j.cose.2019.101663
Benabderrahmane S , Hoang N , Valtchev P , et al . Hack me if you can: Aggregating autoencoders for countering persistent access threats within highly imbalanced data [J ] . Future Generation Computer Systems , 2024 , 160 : 926 - 941 . DOI: 10.1016/j.future.2024.06.050 http://dx.doi.org/10.1016/j.future.2024.06.050
Dusane P , Sujatha G . LogEA: Log extraction and analysis tool to support forensic investigation of linux-based system [C ] // 2021 5th International Conference on Trends in Electronics and Informatics . Piscataway : IEEE , 2021 : 909 - 916 . DOI: 10.1109/icoei51242.2021.9453065 http://dx.doi.org/10.1109/icoei51242.2021.9453065
Das A , Mueller F , Siegel C , et al . Desh: Deep learning for system health prediction of lead times to failure in HPC [C ] // Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing . New York : ACM , 2018 : 40 - 51 . DOI: 10.1145/3208040.3208051 http://dx.doi.org/10.1145/3208040.3208051
Li T G , Ying S , Zhao Y S , et al . Batch jobs load balancing scheduling in cloud computing using distributional reinforcement learning [J ] . IEEE Transactions on Parallel and Distributed Systems , 2024 , 35 ( 1 ): 169 - 185 . DOI: 10.1109/tpds.2023.3334519 http://dx.doi.org/10.1109/tpds.2023.3334519
Liu J , Jia J C , Ma B C , et al . Multi-job intelligent scheduling with cross-device federated learning [J ] . IEEE Transactions on Parallel and Distributed Systems , 2023 , 34 ( 2 ): 535 - 551 . DOI: 10.1109/tpds.2022.3224941 http://dx.doi.org/10.1109/tpds.2022.3224941
Schüpbach A L . Tackling OS complexity with declarative techniques [D ] . Zurich : ETH Zurich , 2012 .
Chen J D , Banerjee S S , Kalbarczyk Z T , et al . Machine learning for load balancing in the Linux kernel [C ] // Proceedings of the 11th ACM SIGOPS Asia-Pacific Workshop on Systems . New York : ACM , 2020 : 67 - 74 . DOI: 10.1145/3409963.3410492 http://dx.doi.org/10.1145/3409963.3410492
Panman de Wit J S , Bucur D , van der Ham J . Dynamic detection of mobile malware using smartphone data and machine learning [J ] . Digital Threats: Research and Practice , 2022 , 3 ( 2 ): 1 - 24 . DOI: 10.1145/3484246 http://dx.doi.org/10.1145/3484246
Wihar J , Mathur R , Northington K , et al . A novel approach of LSTM-based ransomware detection in the Linux operating system kernel [EB/OL ] . ( 2024-09-19 )[ 2025-08-19 ] . https://d197for5662m48.cloudfront.net/documents/publicationstatus/224324/preprint_pdf/7408d68c8fd3caba127923e3a8736481.pdf https://d197for5662m48.cloudfront.net/documents/publicationstatus/224324/preprint_pdf/7408d68c8fd3caba127923e3a8736481.pdf . DOI: 10.36227/techrxiv.172671742.26785099/v1 http://dx.doi.org/10.36227/techrxiv.172671742.26785099/v1
Kommrusch S . Artificial intelligence techniques for security vulnerability prevention [PP/OL ] . V1. arXiv ( 2019-12-14 )[ 2025-12-01 ] . https://doi.org/10.48550/arXiv.1912.06796 https://doi.org/10.48550/arXiv.1912.06796 .
Agarwal T , Tyagi A K . Artificial intelligence and qubit‐based operating systems: Current progress and future perspectives [M ] // Quantum Computing in Cybersecurity . Hoboken : Wiley , 2023 : 121 - 136 . DOI: 10.1002/9781394167401.ch7 http://dx.doi.org/10.1002/9781394167401.ch7
George A S . Emerging trends in AI-driven cybersecurity: An in-depth analysis [J ] . Partners Universal Innovative Research Publication , 2024 , 2 ( 4 ): 15 - 28 .
Temara S . Harnessing the power of artificial intelligence to enhance next-generation cybersecurity [J ] . World Journal of Advanced Research and Reviews , 2024 , 23 ( 2 ): 797 - 811 . DOI: 10.30574/wjarr.2024.23.2.2428 http://dx.doi.org/10.30574/wjarr.2024.23.2.2428
Rizvi M . Enhancing cybersecurity: The power of artificial intelligence in threat detection and prevention [J ] . International Journal of Advanced Engineering Research and Science , 2023 , 10 ( 5 ): 55 - 60 . DOI: 10.22161/ijaers.105.8 http://dx.doi.org/10.22161/ijaers.105.8
Zhang Y J , Zhou L W , Makris Y . Hardware-based real-time workload forensics via frame-level TLB profiling [C ] // 2019 IEEE 37th VLSI Test Symposium . Piscataway : IEEE , 2019 : 1 - 6 . DOI: 10.1109/vts.2019.8758626 http://dx.doi.org/10.1109/vts.2019.8758626
Locher M G . Optimizing IT operations with AIOps: An investigation into the opportunities and challenges for enterprise adoption [D ] . Switzerland : Zurich University of Applied Sciences , 2023 .
Notaro P , Cardoso J , Gerndt M . A survey of AIOps methods for failure management [J ] . ACM Transactions on Intelligent Systems and Technology , 2021 , 12 ( 6 ): 1 - 45 . DOI: 10.1145/3483424 http://dx.doi.org/10.1145/3483424
Zhang L Z , Jia T , Jia M X , et al . A survey of AIOps in the era of large language models [J ] . ACM Computing Surveys , 2026 , 58 ( 2 ): 1 - 35 . DOI: 10.1145/3746635 http://dx.doi.org/10.1145/3746635
Peng Y K , Tian H L , Xian J Y , et al . Framekernel: A safe and efficient kernel architecture via rust-based intra-kernel privilege separation [C ] // Proceedings of the 15th ACM SIGOPS Asia-Pacific Workshop on Systems . New York : ACM , 2024 : 31 - 37 . DOI: 10.1145/3678015.3680492 http://dx.doi.org/10.1145/3678015.3680492
Peng Y K , Tian H L , Zhang J Y , et al . Asterinas: A linux ABI-compatible, rust-based framekernel OS with a small and sound TCB [PP/OL ] . ( 2025-06-04 )[ 2025-11-10 ] . https://doi.org/10.48550/arXiv.2506.03876 https://doi.org/10.48550/arXiv.2506.03876 .
Hazarika A , Poddar S , Rahaman H . Survey on memory management techniques in heterogeneous computing systems [J ] . IET Computers & Digital Techniques , 2020 , 14 ( 2 ): 47 - 60 . DOI: 10.1049/iet-cdt.2019.0092 http://dx.doi.org/10.1049/iet-cdt.2019.0092
Arm Limited . Introduction to the memory tagging extension [EB/OL ] . [ 2025-08-19 ] . https://developer.arm.com/documentation/108035/0100/Introduction-to-the-Memory-Tagging-Extension https://developer.arm.com/documentation/108035/0100/Introduction-to-the-Memory-Tagging-Extension .
Partap A , Boneh D . Memory tagging: A memory efficient design [PP/OL ] . V2. arXiv ( 2022-11-03 )[ 2025-10-11 ] . https://doi.org/10.48550/arXiv.2209.00307 https://doi.org/10.48550/arXiv.2209.00307 .
Arm Limited . Pointer authentication code [EB/OL ] . [ 2025-08-18 ] . https://developer.arm.com/documentation/109576/0100/Pointer-Authentication-Code https://developer.arm.com/documentation/109576/0100/Pointer-Authentication-Code .
Kim H , Lee J , Kim S , et al . How'd security benefit reverse engineers? : The implication of intel CET on function identification [C ] // 2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks . Piscataway : IEEE , 2022 : 559 - 566 . DOI: 10.1109/dsn53405.2022.00061 http://dx.doi.org/10.1109/dsn53405.2022.00061
Microsoft . Kernel-mode hardware-enforced stack protection [EB/OL ] . ( 2023-01-27 )[ 2025-08-19 ] . https://learn.microsoft.com/en-us/windows-server/security/kernel-mode-hardware-stack-protection https://learn.microsoft.com/en-us/windows-server/security/kernel-mode-hardware-stack-protection .
De A , Basu A , Ghosh S , et al . FIXER: Flow integrity extensions for embedded RISC-V [C ] // 2019 Design , Automation & Test in Europe Conference & Exhibition . Piscataway : IEEE , 2019 : 348 - 353 . DOI: 10.23919/date.2019.8714980 http://dx.doi.org/10.23919/date.2019.8714980
Wang Y , Wu J T , Yue T , et al . RetTag: Hardware-assisted return address integrity on RISC-V [C ] // Proceedings of the 15th European Workshop on Systems Security . New York : ACM , 2022 : 50 - 56 . DOI: 10.1145/3517208.3523758 http://dx.doi.org/10.1145/3517208.3523758
Watson R N M , Neumann P G , Woodruff J , et al . Capability hardware enhanced RISC instructions: CHERI instruction-set architecture [R ] . Cambridge : University of Cambridge , 2014 .
Amar S , Chisnall D , Chen T , et al . CHERIoT: Complete memory safety for embedded devices [C ] // Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture . New York : ACM , 2023 : 641 - 653 . DOI: 10.1145/3613424.3614266 http://dx.doi.org/10.1145/3613424.3614266
Yoo S , Jerraya A A . Introduction to hardware abstraction layers for SoC [C ] // 2003 Design, Automation and Test in Europe Conference and Exhibition . Piscataway : IEEE , 2003 : 336 - 337 .
Mayrhofer R , Vander Stoep J , Brubaker C , et al . The Android platform security model (2023) [PP/OL ] . V3. arXiv ( 2024-01-09 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.1904.05572 https://doi.org/10.48550/arXiv.1904.05572 .
Yim K S , Malchev I , Hsieh A , et al . TREBLE: Fast software updates by creating an equilibrium in an active software ecosystem of globally distributed stakeholders [PP/OL ] . V1. arXiv ( 2024-09-19 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2410.02809 https://doi.org/10.48550/arXiv.2410.02809 .
Microsoft . Secure boot [EB/OL ] . ( 2024-05-28 )[ 2025-08-18 ] . https://learn.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-secure-boot https://learn.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-secure-boot .
Jia Y K , Liu S , Wang W H , et al . HyperEnclave: An open and cross-platform trusted execution environment [PP/OL ] . V1. arXiv ( 2022-12-08 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2212.04197 https://doi.org/10.48550/arXiv.2212.04197 .
Microsoft . Virtualization-based security (VBS) [EB/OL ] . ( 2024-05-28 )[ 2025-08-18 ] . https://learn.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-vbs https://learn.microsoft.com/en-us/windows-hardware/design/device-experiences/oem-vbs .
Microsoft . Device guard and credential guard [EB/OL ] . ( 2024-05-28 )[ 2025-08-18 ] . https://learn.microsoft.com/en-us/windows-hardware/drivers/bringup/device-guard-and-credential-guard https://learn.microsoft.com/en-us/windows-hardware/drivers/bringup/device-guard-and-credential-guard .
Wei H J , Lin L K , Lin C Y , et al . Measuring and optimizing the performance of the Android virtualization framework [C ] // Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing . New York : ACM , 2024 : 1533 - 1535 . DOI: 10.1145/3605098.3636097 http://dx.doi.org/10.1145/3605098.3636097
Apple Inc . Virtualization: Create and manage virtual machines on Apple silicon and Intel-based Mac computers [EB/OL ] . ( 2024-06-10 )[ 2025-08-18 ] . https://developer.apple.com/documentation/virtualization https://developer.apple.com/documentation/virtualization .
Randazzo A , Tinnirello I . Kata containers: An emerging architecture for enabling MEC services in fast and secure way [C ] // 2019 Sixth International Conference on Internet of Things: Systems, Management and Security . Piscataway : IEEE , 2019 : 209 - 214 . DOI: 10.1109/iotsms48152.2019.8939164 http://dx.doi.org/10.1109/iotsms48152.2019.8939164
Rutkowska J , Wojtczuk R . Qubes OS architecture [J ] . Invisible Things Lab Tech Rep , 2010 , 54 : 65 .
Karat C M , Vergo J , Nahamoo D . Conversational interface technologies [M ] // The Human-Computer Interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications . New Jersey: L . Erlbaum Associates Inc., 2002 : 169 - 186 . DOI: 10.1201/9781410606723-24 http://dx.doi.org/10.1201/9781410606723-24
Darejeh A , Mashayekh S , Marcus N . Cognitive-based methods to facilitate learning of software applications via E-learning systems [J ] . Cogent Education , 2022 , 9 : 2082085 . DOI: 10.1080/2331186x.2022.2082085 http://dx.doi.org/10.1080/2331186x.2022.2082085
Wang L , Ma C , Feng X Y , et al . A survey on large language model based autonomous agents [J ] . Frontiers of Computer Science , 2024 , 18 ( 6 ): 186345 . DOI: 10.1007/s11704-024-40231-1 http://dx.doi.org/10.1007/s11704-024-40231-1
Mei K , Zhu X , Xu W J , et al . AIOS: LLM agent operating system [PP/OL ] . V5. arXiv ( 2025-08-12 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2403.16971 https://doi.org/10.48550/arXiv.2403.16971 .
Stratton J . An introduction to microsoft copilot [M ] // Copilot for Microsoft 365 . Berkeley, CAA Press, 2024 : 19 - 35 . DOI: 10.1007/979-8-8688-0447-2_2 http://dx.doi.org/10.1007/979-8-8688-0447-2_2
Hou X Y , Zhao Y J , Wang S N , et al . Model context protocol (MCP): Landscape, security threats, and future research directions [PP/OL ] . V3. arXiv ( 2025-10-07 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2503.23278 https://doi.org/10.48550/arXiv.2503.23278 .
Yan A , Yang Z Y , Zhu W R , et al . GPT-4V in wonderland: Large multimodal models for zero-shot smartphone GUI navigation [PP/OL ] . V1. arXiv ( 2023-11-13 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2311.07562 https://doi.org/10.48550/arXiv.2311.07562 .
Zhang C , Yang Z , Liu J X , et al . AppAgent: Multimodal agents as smartphone users [C ] // Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems . New York : ACM , 2025 : 1 - 20 .
Wang J Y , Xu H Y , Ye J B , et al . Mobile-agent: Autonomous multi-modal mobile device agent with visual perception [PP/OL ] . V2. arXiv ( 2024-04-18 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2401.16158 https://doi.org/10.48550/arXiv.2401.16158 .
Zhang C Y , Li L Q , He S L , et al . UFO: A UI-focused agent for windows OS interaction [PP/OL ] . V5. arXiv ( 2024-05-23 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.07939 https://doi.org/10.48550/arXiv.2402.07939 .
Hong W Y , Wang W H , Lv Q S , et al . CogAgent: A visual language model for GUI agents [C ] // 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition . Piscataway : IEEE , 2024 : 14281 - 14290 . DOI: 10.1109/cvpr52733.2024.01354 http://dx.doi.org/10.1109/cvpr52733.2024.01354
Dongle . AutoMate [EB/OL ] . ( 2025-03-20 )[ 2025-10-10 ] . https://github.com/yuruotong1/autoMate https://github.com/yuruotong1/autoMate .
Huang J , Chang K C . Towards reasoning in large language models: A survey [PP/OL ] . V2. arXiv ( 2023-05-26 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2212.10403 https://doi.org/10.48550/arXiv.2212.10403 .
Du Y , Wei F Y , Zhang H Y . AnyTool: Self-reflective, hierarchical agents for large-scale API calls [PP/OL ] . V1. arXiv ( 2024-02-06 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.04253 https://doi.org/10.48550/arXiv.2402.04253 .
Guo T C , Chen X Y , Wang Y Q , et al . Large language model based multi-agents: A survey of progress and challenges [PP/OL ] . V2. arXiv ( 2024-04-19 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.01680 https://doi.org/10.48550/arXiv.2402.01680 .
Google . A2A protocol [EB/OL ] . [ 2025-08-12 ] . https://a2a-protocol.org/latest/ https://a2a-protocol.org/latest/ . DOI: 10.17504/protocols.io.8epv5k7r6v1b/v1 http://dx.doi.org/10.17504/protocols.io.8epv5k7r6v1b/v1
OpenAI . GPT-4o [EB/OL ] . ( 2024-08-28 )[ 2025-08-12 ] . https://platform.openai.com/docs/models/gpt-4o https://platform.openai.com/docs/models/gpt-4o .
Team G , Anil R , Borgeaud S , et al . Gemini: A family of highly capable multimodal models [PP/OL ] . V5. arXiv ( 2025-05-09 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2312.11805 https://doi.org/10.48550/arXiv.2312.11805 .
Sager P J , Meyer B , Yan P , et al . A comprehensive survey of agents for computer use: Foundations, challenges, and future directions [PP/OL ] . V2. arXiv ( 2025-06-04 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2501.16150 https://doi.org/10.48550/arXiv.2501.16150 .
Hu X Y , Xiong T , Yi B , et al . OS agents: A survey on MLLM-based agents for general computing devices use [PP/OL ] . V1. arXiv ( 2025-08-06 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2508.04482 https://doi.org/10.48550/arXiv.2508.04482 .
OpenAI . GPT-4V(ision) [EB/OL ] . ( 2024-09-22 )[ 2025-08-12 ] . https://platform.openai.com/docs/models/gpt-4-turbo-and-gpt-4 https://platform.openai.com/docs/models/gpt-4-turbo-and-gpt-4 .
Muñoz A , Ríos R , Román R , et al . A survey on the (in)security of trusted execution environments [J ] . Computers & Security , 2023 , 129 : 103180 . DOI: 10.1016/j.cose.2023.103180 http://dx.doi.org/10.1016/j.cose.2023.103180
刘国杰 , 张建标 , 杨萍 , 等 . 基于TPCM的容器云可信环境研究 [J ] . 网络与信息安全学报 , 2021 , 7 ( 4 ): 164 - 174 .
Liu Guojie , Zhang Jianbiao , Yang Ping , et al . Research on the trusted environment of container cloud based on the TPCM [J ] . Chinese Journal of Network and Information Security , 2021 , 7 ( 4 ): 164 - 174 . (in Chinese)
徐涛 , 孔帅迪 , 刘才华 , 等 . 异构机密计算综述 [J ] . 吉林大学学报(工学版) , 2025 , 55 ( 3 ): 755 - 770 .
Xu Tao , Kong Sshuaidi , Liu Caihua , et al . Overview of heterogeneous confidential computing [J ] . Journal of Jilin University (Engineering and Technology Edition) , 2025 , 55 ( 3 ): 755 - 770 . (in Chinese)
Zhou Y L , Peng G J , Li Z C , et al . A survey on the evolution of bootkits attack and defense techniques [J ] . China Communications , 2024 , 21 ( 1 ): 102 - 130 . DOI: 10.23919/jcc.ja.2022-0409 http://dx.doi.org/10.23919/jcc.ja.2022-0409
Zheng C M , Li J , Yao X X . Design and implementation of trusted boot based on a new trusted computing dual-architecture [J ] . Computers & Security , 2023 , 127 : 103095 . DOI: 10.1016/j.cose.2023.103095 http://dx.doi.org/10.1016/j.cose.2023.103095
Mohammadzad M , Karimpour J . Using rootkits hiding techniques to conceal honeypot functionality [J ] . Journal of Network and Computer Applications , 2023 , 214 : 103606 . DOI: 10.1016/j.jnca.2023.103606 http://dx.doi.org/10.1016/j.jnca.2023.103606
Huang H X , Zhang J B , Zhang L , et al . SABDTM: Security-first architecture-based dynamic trusted measurement scheme for operating system of the virtual computing node [J ] . Computers & Security , 2024 , 137 : 103648 . DOI: 10.1016/j.cose.2023.103648 http://dx.doi.org/10.1016/j.cose.2023.103648
Chen H . Optimized design of trusted boot architecture based on embedded devices [C ] // 2025 4th International Symposium on Computer Applications and Information Technology . Piscataway : IEEE , 2025 : 1043 - 1046 . DOI: 10.1109/iscait64916.2025.11010744 http://dx.doi.org/10.1109/iscait64916.2025.11010744
Feng D G , Qin Y , Feng W , et al . Survey of research on confidential computing [J ] . IET Communications , 2024 , 18 ( 9 ): 535 - 556 . DOI: 10.1049/cmu2.12759 http://dx.doi.org/10.1049/cmu2.12759
Russinovich M . Confidential computing: Elevating cloud security and privacy [J ] . Communications of the ACM , 2024 , 67 ( 1 ): 52 - 53 . DOI: 10.1145/3624577 http://dx.doi.org/10.1145/3624577
Sabanic P , Misono M , Bodea T , et al . Confidential serverless computing [PP/OL ] . V3. arXiv ( 2025-08-13 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2504.21518 https://doi.org/10.48550/arXiv.2504.21518 .
Popa R A . Confidential computing or cryptographic computing? [J ] . Communications of the ACM , 2024 , 67 ( 12 ): 44 - 51 . DOI: 10.1145/3677616 http://dx.doi.org/10.1145/3677616
Martishin S A , Khrapchenko M V , Shokurov A V . Study of the problem of ensuring security in storage and processing of confidential data [J ] . Programming and Computer Software , 2022 , 48 ( 7 ): 424 - 434 . DOI: 10.1134/s0361768822070064 http://dx.doi.org/10.1134/s0361768822070064
Benadjila R , Khati L , Vergnaud D . Secure storage: Confidentiality and authentication [J ] . Computer Science Review , 2022 , 44 : 100465 . DOI: 10.1016/j.cosrev.2022.100465 http://dx.doi.org/10.1016/j.cosrev.2022.100465
Global Market Insights . Global market insights: Market research reports [EB/OL ] . ( 2025-03-01 )[ 2025-10-10 ] . https://www.gminsights.com/industry-analysis/edge-ai-market https://www.gminsights.com/industry-analysis/edge-ai-market . DOI: 10.22541/au.172719868.82425072/v1 http://dx.doi.org/10.22541/au.172719868.82425072/v1
Qin R Y , Liu D C , Xu C H , et al . Empirical guidelines for deploying LLMs onto resource-constrained edge devices [J ] . ACM Transactions on Design Automation of Electronic Systems , 2025 , 30 ( 5 ): 1 - 58 . DOI: 10.1145/3736721 http://dx.doi.org/10.1145/3736721
Liu L Y , Zhu H B , Chen S L , et al . Privacy-aware task assignment for IoT audit applications on collaborative edge devices [J ] . Security and Communication Networks , 2022 , 2022 : 1336094 . DOI: 10.1155/2022/1336094 http://dx.doi.org/10.1155/2022/1336094
Touvron H , Lavril T , Izacard G , et al . LLaMA: Open and efficient foundation language models [PP/OL ] . V1. arXiv ( 2023-02-27 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2302.13971 https://doi.org/10.48550/arXiv.2302.13971 .
Seedat N , Huynh N , van Breugel B , et al . Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes [PP/OL ] . V3. arXiv ( 2024-06-30 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2312.12112 https://doi.org/10.48550/arXiv.2312.12112 .
Dell . Llama 2: Inferencing on a single gpu [EB/OL ] . [ 2025-08-10 ] . https://infohub.delltechnologies.com/zh-cn/t/llama-2-inferencing-on-a-single-gpu/ https://infohub.delltechnologies.com/zh-cn/t/llama-2-inferencing-on-a-single-gpu/ .
Zhou Z X , Ning X F , Hong K , et al . A survey on efficient inference for large language models [PP/OL ] . V3. arXiv ( 2024-07-19 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2404.14294 https://doi.org/10.48550/arXiv.2404.14294 .
Chung J W , Liu J , Wu Z , et al . The ML .ENERGY leaderboard[EB/OL ] . [ 2025-08-10 ] . https://ml.energy/leaderboard https://ml.energy/leaderboard .
Xu J J , Li Z Y , Chen W , et al . On-device language models: A comprehensive review [PP/OL ] . V2. arXiv ( 2024-09-14 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2409.00088 https://doi.org/10.48550/arXiv.2409.00088 .
李诚 . 面向LLM推理服务的体系结构 [R/OL ] . ( 2024-07-01 )[ 2025-07-20 ] . http://syswonder-cdn.oscommunity.cn/20240706-llmarch-3-lc.pdf http://syswonder-cdn.oscommunity.cn/20240706-llmarch-3-lc.pdf .
Li Cheng . Architecture for LLM inference services [R/OL ] . ( 2024-07-01 )[ 2025-07-20 ] . http://syswonder-cdn.oscommunity.cn/20240706-llmarch-3-lc.pdf http://syswonder-cdn.oscommunity.cn/20240706-llmarch-3-lc.pdf .
Press O , Wolf L . Using the output embedding to improve language models [PP/OL ] . V3. arXiv ( 2017-02-21 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.1608.05859 https://doi.org/10.48550/arXiv.1608.05859 .
Liu Z C , Zhao C S , Iandola F , et al . MobileLLM: Optimizing sub-billion parameter language models for on-device use cases [PP/OL ] . V2. arXiv ( 2024-06-27 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.14905 https://doi.org/10.48550/arXiv.2402.14905 .
Santacroce M , Wen Z X , Shen Y L , et al . What matters in the structured pruning of generative language models? [PP/OL ] . V1. arXiv ( 2023-02-07 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2302.03773 https://doi.org/10.48550/arXiv.2302.03773 .
Ma X Y , Fang G F , Wang X C . LLM-pruner: On the structural pruning of large language models [PP/OL ] . V3. arXiv ( 2023-09-28 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.11627 https://doi.org/10.48550/arXiv.2305.11627 .
Chen T Y , Ding T Y , Yadav B , et al . LoRAShear: Efficient large language model structured pruning and knowledge recovery [PP/OL ] . V2. arXiv ( 2023-10-31 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2310.18356 https://doi.org/10.48550/arXiv.2310.18356 .
Zhang M Y , Chen H , Shen C H , et al . LoRAPrune: Structured pruning meets low-rank parameter-efficient fine-tuning [C ] // Findings of the Association for Computational Linguistics ACL 2024 . Stroudsburg : ACL , 2024 : 3013 - 3026 . DOI: 10.18653/v1/2024.findings-acl.178 http://dx.doi.org/10.18653/v1/2024.findings-acl.178
Frantar E , Alistarh D . SparseGPT: Massive language models can be accurately pruned in one-shot [PP/OL ] . V3. arXiv ( 2023-03-22 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2301.00774 https://doi.org/10.48550/arXiv.2301.00774 .
Sun M J , Liu Z , Bair A , et al . A simple and effective pruning approach for large language models [PP/OL ] . V3. arXiv ( 2024-05-06 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2306.11695 https://doi.org/10.48550/arXiv.2306.11695 .
Shao H , Liu B , Qian Y M . One-shot sensitivity-aware mixed sparsity pruning for large language models [C ] // ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing . Piscataway : IEEE , 2024 : 11296 - 11300 . DOI: 10.1109/icassp48485.2024.10445737 http://dx.doi.org/10.1109/icassp48485.2024.10445737
Gu Y X , Dong L , Wei F R , et al . MiniLLM: On-policy distillation of large language models [PP/OL ] . V6. arXiv ( 2026-01-31 )[ 2025-12-01 ] . https://doi.org/10.48550/arXiv.2306.08543 https://doi.org/10.48550/arXiv.2306.08543 .
Zhang C , Li Q C , Song D W , et al . Towards the law of capacity gap in distilling language models [PP/OL ] . V4. arXiv ( 2025-07-30 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2311.07052 https://doi.org/10.48550/arXiv.2311.07052 .
Agarwal R , Vieillard N , Zhou Y C , et al . On-policy distillation of language models: Learning from self-generated mistakes [PP/OL ] . V3. arXiv ( 2024-01-17 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2306.13649 https://doi.org/10.48550/arXiv.2306.13649 .
Zhu X K , Qi B Q , Zhang K Y , et al . PaD: Program-aided distillation can teach small models reasoning better than chain-of-thought fine-tuning [PP/OL ] . V2. arXiv ( 2024-03-20 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.13888 https://doi.org/10.48550/arXiv.2305.13888 .
Li L H , Hessel J , Yu Y , et al . Symbolic chain-of-thought distillation: Small models can also "think" step-by-step [PP/OL ] . V2. arXiv ( 2024-04-15 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2306.14050 https://doi.org/10.48550/arXiv.2306.14050 .
Wang P F , Wang Z Y , Li Z , et al . SCOTT: Self-consistent chain-of-thought distillation [PP/OL ] . V4. arXiv ( 2023-08-30 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.01879 https://doi.org/10.48550/arXiv.2305.01879 .
Chen Z M , Gao Q Y , Bosselut A , et al . DISCO: Distilling counterfactuals with large language models [PP/OL ] . V3. arXiv ( 2023-06-05 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2212.10534 https://doi.org/10.48550/arXiv.2212.10534 .
Kim S , Mangalam K , Moon S , et al . Speculative decoding with big little decoder [C ] // Proceedings of the 37th International Conference on Neural Information Processing Systems . New York : ACM , 2023 : 39236 - 39256 . DOI: 10.52202/075280-1705 http://dx.doi.org/10.52202/075280-1705
Xu D L , Yin W S , Jin X , et al . LLMCad: Fast and scalable on-device large language model inference [PP/OL ] . V1. arXiv ( 2023-09-08 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2309.04255 https://doi.org/10.48550/arXiv.2309.04255 .
Dao T , Fu D Y , Ermon S , et al . FlashAttention: Fast and memory-efficient exact attention with IO-awareness [PP/OL ] . V2. arXiv ( 2022-06-23 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2205.14135 https://doi.org/10.48550/arXiv.2205.14135 .
Kwon W , Li Z H , Zhuang S Y , et al . Efficient memory management for large language model serving with PagedAttention [C ] // Proceedings of the 29th Symposium on Operating Systems Principles . New York : ACM , 2023 : 611 - 626 . DOI: 10.1145/3600006.3613165 http://dx.doi.org/10.1145/3600006.3613165
Ggerganov , Slaren , Ngxson , et al . llama .cpp: Lightweight library for approximate nearest neighbors and maximum inner product search[EB/OL ] . ( 2025-08-29 )[ 2025-10-10 ] . https://github.com/ggerganov/llama.cpp https://github.com/ggerganov/llama.cpp .
Song Y X , Mi Z Y , Xie H T , et al . PowerInfer: Fast large language model serving with a consumer-grade GPU [C ] // Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles . New York : ACM , 2024 : 590 - 606 . DOI: 10.1145/3694715.3695964 http://dx.doi.org/10.1145/3694715.3695964
Li L C , Qian S , Lu J , et al . Transformer-lite: High-efficiency deployment of large language models on mobile phone GPUs [PP/OL ] . V3. arXiv ( 2024-07-05 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2403.20041 https://doi.org/10.48550/arXiv.2403.20041 .
Ding D J , Mallick A , Wang C , et al . Hybrid LLM: Cost-efficient and quality-aware query routing [PP/OL ] . V1. arXiv ( 2024-04-22 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2404.14618 https://doi.org/10.48550/arXiv.2404.14618 .
Wang Y D , Chen K , Tan H S , et al . Tabi: An efficient multi-level inference system for large language models [C ] // Proceedings of the Eighteenth European Conference on Computer Systems . New York : ACM , 2023 : 233 - 248 . DOI: 10.1145/3552326.3587438 http://dx.doi.org/10.1145/3552326.3587438
Vllm Project Team . VLLM documentation [EB/OL ] . ( 2025-11-15 )[ 2025-12-01 ] . https://docs.vllm.ai/en/stable/ https://docs.vllm.ai/en/stable/ .
Cai F L , Yuan D , Yang Z , et al . Edge-LLM: A collaborative framework for large language model serving in edge computing [C ] // 2024 IEEE International Conference on Web Services . Piscataway : IEEE , 2024 : 799 - 809 . DOI: 10.1109/icws62655.2024.00099 http://dx.doi.org/10.1109/icws62655.2024.00099
Xu X H , Li M , Tao C Y , et al . A survey on knowledge distillation of large language models [PP/OL ] . V4. arXiv ( 2024-10-21 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.13116 https://doi.org/10.48550/arXiv.2402.13116 .
Wang R , Gao Z Y , Zhang L Y , et al . Empowering large language models to edge intelligence: A survey of edge efficient LLMs and techniques [J ] . Computer Science Review , 2025 , 57 : 100755 . DOI: 10.1016/j.cosrev.2025.100755 http://dx.doi.org/10.1016/j.cosrev.2025.100755
Chen C , Borgeaud S , Irving G , et al . Accelerating large language model decoding with speculative sampling [PP/OL ] . V1. arXiv ( 2023-02-02 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2302.01318 https://doi.org/10.48550/arXiv.2302.01318 .
Leviathan Y , Kalman M , Matias Y . Fast inference from transformers via speculative decoding [PP/OL ] . V2. arXiv ( 2023-05-18 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2211.17192 https://doi.org/10.48550/arXiv.2211.17192 .
Gholami A , Hooper C , Keutzer K , et al . KVQuant: Towards 10 million context length LLM inference with KV cache quantization [C ] // Advances in Neural Information Processing Systems 37 . Neural Information Processing Systems Foundation, Inc. (NeurIPS) , 2024 : 1270 - 1303 . DOI: 10.52202/079017-0040 http://dx.doi.org/10.52202/079017-0040
Xiao G X , Tian Y D , Chen B D , et al . Efficient streaming language models with attention sinks [PP/OL ] . V4. arXiv ( 2024-04-07 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2309.17453 https://doi.org/10.48550/arXiv.2309.17453 .
Liu Z C , Desai A , Liao F S , et al . Scissorhands: Exploiting the persistence of importance hypothesis for LLM KV cache compression at test time [PP/OL ] . V2. arXiv ( 2023-08-28 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.17118 https://doi.org/10.48550/arXiv.2305.17118 .
Zhang Z Y , Sheng Y , Zhou T Y , et al . H$_2$O: Heavy-hitter oracle for efficient generative inference of large language models [PP/OL ] . V3. arXiv ( 2023-12-18 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2306.14048 https://doi.org/10.48550/arXiv.2306.14048 .
Liu Z R , Yuan J Y , Jin H Y , et al . KIVI: A tuning-free asymmetric 2bit quantization for KV cache [PP/OL ] . V2. arXiv ( 2024-07-25 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.02750 https://doi.org/10.48550/arXiv.2402.02750 .
Schuster T , Fisch A , Gupta J , et al . Confident adaptive language modeling [PP/OL ] . V2. arXiv ( 2022-10-25 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2207.07061 https://doi.org/10.48550/arXiv.2207.07061 .
Zeng Z Q , Hong Y H , Dai H L , et al . ConsistentEE: A consistent and hardness-guided early exiting method for accelerating language models inference [J ] . Proceedings of the AAAI Conference on Artificial Intelligence , 2024 , 38 ( 17 ): 19506 - 19514 . DOI: 10.1609/aaai.v38i17.29922 http://dx.doi.org/10.1609/aaai.v38i17.29922
Del Corro L , Del Giorno A , Agarwal S , et al . SkipDecode: Autoregressive skip decoding with batching and caching for efficient LLM inference [PP/OL ] . V1. arXiv ( 2023-07-05 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2307.02628 https://doi.org/10.48550/arXiv.2307.02628 .
Dao T . FlashAttention-2: Faster attention with better parallelism and work partitioning [PP/OL ] . V1. arXiv ( 2023-07-17 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2307.08691 https://doi.org/10.48550/arXiv.2307.08691 .
Bikshandi G , Dao T , Ramani P , et al . FlashAttention-3: Fast and accurate attention with asynchrony and low-precision [C ] // Advances in Neural Information Processing Systems 37 . Neural Information Processing Systems Foundation, Inc. (NeurIPS) , 2024 : 68658 - 68685 . DOI: 10.52202/079017-2193 http://dx.doi.org/10.52202/079017-2193
Xia H J , Zheng Z , Li Y C , et al . Flash-LLM: Enabling cost-effective and highly-efficient large generative model inference with unstructured sparsity [PP/OL ] . V1. arXiv ( 2023-09-19 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2309.10285 https://doi.org/10.48550/arXiv.2309.10285 .
Sheng Y , Zheng L M , Yuan B H , et al . FlexGen: High-throughput generative inference of large language models with a single GPU [PP/OL ] . V2. arXiv ( 2023-06-12 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2303.06865 https://doi.org/10.48550/arXiv.2303.06865 .
Xue Z L , Song Y X , Mi Z Y , et al . PowerInfer-2: Fast large language model inference on a smartphone [PP/OL ] . V3. arXiv ( 2024-12-12 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2406.06282 https://doi.org/10.48550/arXiv.2406.06282 .
Liu L , Yang X Y , Shen Y , et al . Think-in-memory: Recalling and post-thinking enable LLMs with long-term memory [PP/OL ] . V1. arXiv ( 2023-11-15 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2311.08719 https://doi.org/10.48550/arXiv.2311.08719 .
Brown T B , Mann B , Ryder N , et al . Language models are few-shot learners [PP/OL ] . V4. arXiv ( 2020-07-22 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2005.14165 https://doi.org/10.48550/arXiv.2005.14165 .
Devlin J , Chang M W , Lee K , et al . BERT: Pre-training of deep bidirectional transformers for language understanding [C ] // Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) . Kerrville : Association for Computational Linguistics , 2019 : 4171 - 4186 . DOI: 10.18653/v1/n19-1423 http://dx.doi.org/10.18653/v1/n19-1423
Bai Y T , Jones A , Ndousse K , et al . Training a helpful and harmless assistant with reinforcement learning from human feedback [PP/OL ] . V1. arXiv ( 2022-04-12 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2204.05862 https://doi.org/10.48550/arXiv.2204.05862 .
Ouyang L , Wu J , Xu J , et al . Training language models to follow instructions with human feedback [PP/OL ] . V1. arXiv ( 2022-03-04 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2203.02155 https://doi.org/10.48550/arXiv.2203.02155 .
Chen T X , Tan Z T , Gong T , et al . Llama SLayer 8B: Shallow layers hold the key to knowledge injection [PP/OL ] . V1. arXiv ( 2024-10-03 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2410.02330 https://doi.org/10.48550/arXiv.2410.02330 .
Hu E J , Shen Y L , Wallis P , et al . LoRA: Low-rank adaptation of large language models [PP/OL ] . V2. arXiv ( 2021-10-16 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2106.09685 https://doi.org/10.48550/arXiv.2106.09685 .
Mitchell E , Lin C , Bosselut A , et al . Memory-based model editing at scale [PP/OL ] . V1. arXiv ( 2022-06-13 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2206.06520 https://doi.org/10.48550/arXiv.2206.06520 .
Dong Q X , Dai D M , Song Y F , et al . Calibrating factual knowledge in pretrained language models [PP/OL ] . V2. arXiv ( 2022-10-18 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2210.03329 https://doi.org/10.48550/arXiv.2210.03329 .
Cheng X , Lin Y K , Chen X Y , et al . Decouple knowledge from parameters for plug-and-play language modeling [PP/OL ] . V2. arXiv ( 2023-09-18 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.11564 https://doi.org/10.48550/arXiv.2305.11564 .
Hartvigsen T , Sankaranarayanan S , Palangi H , et al . Aging with GRACE: Lifelong model editing with discrete key-value adaptors [PP/OL ] . V5. arXiv ( 2023-10-18 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2211.11031 https://doi.org/10.48550/arXiv.2211.11031 .
Chen H , Pasunuru R , Weston J , et al . Walking down the memory maze: Beyond context limit through interactive reading [PP/OL ] . V1. arXiv ( 2023-10-08 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2310.05029 https://doi.org/10.48550/arXiv.2310.05029 .
Edge D , Trinh H , Cheng N , et al . From local to global: A graph RAG approach to query-focused summarization [PP/OL ] . V2. arXiv ( 2025-02-19 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2404.16130 https://doi.org/10.48550/arXiv.2404.16130 .
Guo Z R , Xia L H , Yu Y H , et al . LightRAG: Simple and fast retrieval-augmented generation [PP/OL ] . V3. arXiv ( 2025-04-28 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2410.05779 https://doi.org/10.48550/arXiv.2410.05779 .
Rasmussen P , Paliychuk P , Beauvais T , et al . Zep: A temporal knowledge graph architecture for agent memory [PP/OL ] . V1. arXiv ( 2025-01-20 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2501.13956 https://doi.org/10.48550/arXiv.2501.13956 .
Xu W J , Liang Z J , Mei K , et al . A-MEM: Agentic memory for LLM agents [PP/OL ] . V11. arXiv ( 2025-10-08 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2502.12110 https://doi.org/10.48550/arXiv.2502.12110 .
Ocker F , Deigmöller J , Smirnov P , et al . A grounded memory system for smart personal assistants [PP/OL ] . V1. arXiv ( 2025-05-09 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2505.06328 https://doi.org/10.48550/arXiv.2505.06328 .
Liang X , Niu S M , li Z Y , et al . Empowering large language models to set up a knowledge retrieval indexer via self-learning [PP/OL ] . V1. arXiv ( 2024-05-27 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2405.16933 https://doi.org/10.48550/arXiv.2405.16933 .
Yang H K , Lin Z H , Wang W J , et al . Memory 3 : Language modeling with explicit memory [PP/OL ] . V1. arXiv ( 2024-07-01 )[ 2015-10-10 ] . https://doi.org/10.48550/arXiv.2407.01178 https://doi.org/10.48550/arXiv.2407.01178 .
Park J S , O'Brien J , Cai C J , et al . Generative agents: Interactive simulacra of human behavior [C ] // Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology . New York : ACM , 2023 : 1 - 22 . DOI: 10.1145/3586183.3606763 http://dx.doi.org/10.1145/3586183.3606763
Sun H T , Zhuang Y C , Kong L K , et al . AdaPlanner: Adaptive planning from feedback with language models [PP/OL ] . V1. arXiv ( 2023-05-26 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.16653 https://doi.org/10.48550/arXiv.2305.16653 .
Wang G Z , Xie Y Q , Jiang Y F , et al . Voyager: An open-ended embodied agent with large language models [PP/OL ] . V2. arXiv ( 2023-10-19 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2305.16291 https://doi.org/10.48550/arXiv.2305.16291 .
Wu J L , Zhong X R , Sun J S , et al . Structure-R1: Dynamically leveraging structural knowledge in LLM reasoning through reinforcement learning [PP/OL ] . V1. arXiv ( 2025-10-16 )[ 2025-10-20 ] . https://doi.org/10.48550/arXiv.2510.15191 https://doi.org/10.48550/arXiv.2510.15191 .
Wang W Z , Dong L , Cheng H , et al . Augmenting language models with long-term memory [PP/OL ] . V1. arXiv ( 2023-06-12 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2306.07174 https://doi.org/10.48550/arXiv.2306.07174 .
Huang L , Lan H Z , Sun Z J , et al . Emotional RAG: Enhancing role-playing agents through emotional retrieval [C ] // 2024 IEEE International Conference on Knowledge Graph . Piscataway : IEEE , 2024 : 120 - 127 . DOI: 10.1109/ickg63256.2024.00023 http://dx.doi.org/10.1109/ickg63256.2024.00023
Wang Y , Gao Y F , Chen X S , et al . MEMORYLLM: Towards self-updatable large language models [PP/OL ] . V2. arXiv ( 2024-05-26 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2402.04624 https://doi.org/10.48550/arXiv.2402.04624 .
Shinn N , Cassano F , Berman E , et al . Reflexion: Language agents with verbal reinforcement learning [PP/OL ] . V4. arXiv ( 2023-10-10 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2303.11366 https://doi.org/10.48550/arXiv.2303.11366 .
Zhong W J , Guo L H , Gao Q Q , et al . MemoryBank: Enhancing large language models with long-term memory [J ] . Proceedings of the AAAI Conference on Artificial Intelligence , 2024 , 38 ( 17 ): 19724 - 19731 . DOI: 10.1609/aaai.v38i17.29946 http://dx.doi.org/10.1609/aaai.v38i17.29946
Kang J Z , Ji M M , Zhao Z , et al . Memory OS of AI agent [PP/OL ] . V1. arXiv ( 2025-05-30 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2506.06326 https://doi.org/10.48550/arXiv.2506.06326 .
Li Z Y , Xi C Y , Li C Y , et al . MemOS: A memory OS for AI system [PP/OL ] . V3. arXiv ( 2025-08-05 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2507.03724 https://doi.org/10.48550/arXiv.2507.03724 .
Wu Q Y , Bansal G , Zhang J Y , et al . AutoGen: Enabling next-gen LLM applications via multi-agent conversation [PP/OL ] . V2. arXiv ( 2023-10-03 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2308.08155 https://doi.org/10.48550/arXiv.2308.08155 .
Wang Y , Krotov D , Hu Y Z , et al . M+: Extending MemoryLLM with scalable long-term memory [PP/OL ] . V2. arXiv ( 2025-05-30 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2502.00592 https://doi.org/10.48550/arXiv.2502.00592 .
Zhang Z Y , Dai Q Y , Bo X H , et al . A survey on the memory mechanism of large language model-based agents [J ] . ACM Transactions on Information Systems , 2025 , 43 ( 6 ): 1 - 47 . DOI: 10.1145/3748302 http://dx.doi.org/10.1145/3748302
Zeng R H , Fang J Y , Liu S W , et al . On the structural memory of LLM agents [PP/OL ] . V1. arXiv ( 2024-12-17 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2412.15266 https://doi.org/10.48550/arXiv.2412.15266 .
Tang X R , Hu T Y , Ye M Y , et al . ChemAgent: Self-updating library in large language models improves chemical reasoning [PP/OL ] . V1. arXiv ( 2025-01-11 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2501.06590 https://doi.org/10.48550/arXiv.2501.06590 .
Zhang Z Y , Dai Q Y , Chen X , et al . MemEngine: A unified and modular library for developing advanced memory of LLM-based agents [C ] // Proceedings of the ACM on Web Conference 2025 . New York : ACM , 2025 : 821 - 824 . DOI: 10.1145/3701716.3715299 http://dx.doi.org/10.1145/3701716.3715299
Miller M . Trends, challenge, and shifts in software vulnerability mitigation [C ] // BlueHat IL Conference . Tel Aviv : Microsoft , 2019 : 10 .
Klabnik S , Nichols C , Krycho C . The rust programming language [M ] . San Francisco : No Starch Press , 2018 .
Li S W , Sato H . W-kernel: An OS kernel architecture designed with isolation and customizability [C ] // Proceedings of the 2023 5th International Conference on Software Engineering and Development . New York : ACM , 2024 : 42 - 50 . DOI: 10.1145/3637792.3637796 http://dx.doi.org/10.1145/3637792.3637796
Gu R , Shao Z , Chen H , et al . CertiKOS: An extensible architecture for building certified concurrent OS kernels [C ] // 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16) . Berkeley : USENIX Association , 2016 : 653 - 669 .
Agache A , Brooker M , Iordache A , et al . Firecracker: Lightweight virtualization for serverless applications [C ] // 17th USENIX symposium on networked systems design and implementation (NSDI 20) . Berkeley : USENIX Association , 2020 : 419 - 434 .
Liguori A . The nitro project-next generation aws infrastructure [EB/OL ] . [ 2025-10-10 ] . https://www.old.hotchips.org/hc31/HC31_T1_AWS_Nitro_Hot_Chips_20190818-2.pdf https://www.old.hotchips.org/hc31/HC31_T1_AWS_Nitro_Hot_Chips_20190818-2.pdf .
Amazon Web Services . Bottlerocket [EB/OL ] . [ 2025-08-18 ] . https://bottlerocket.dev/en/ https://bottlerocket.dev/en/ .
Android Open Source Project . Android virtualization framework [EB/OL ] . ( 2025-03-04 )[ 2025-08-19 ] . https://source.android.com/docs/core/virtualization https://source.android.com/docs/core/virtualization .
Li Z F , Narayanan V , Chen X D , et al . Rust for linux: Understanding the security impact of rust in the linux kernel [C ] // 2024 Annual Computer Security Applications Conference . Piscataway : IEEE , 2024 : 548 - 562 . DOI: 10.1109/acsac63791.2024.00054 http://dx.doi.org/10.1109/acsac63791.2024.00054
Levy A , Campbell B , Ghena B , et al . The tock embedded operating system [C ] // Proceedings of the 15th ACM Conference on Embedded Network Sensor Systems . New York : ACM , 2017 : 1 - 2 . DOI: 10.1145/3131672.3136988 http://dx.doi.org/10.1145/3131672.3136988
Levy A , Campbell B , Ghena B , et al . Multiprogramming a 64kB computer safely and efficiently [C ] // Proceedings of the 26th Symposium on Operating Systems Principles . New York : ACM , 2017 : 234 - 251 .
Redox Devel Opers . Redox OS [EB/OL ] . ( 2024-05-31 )[ 2025-08-26 ] . https://www.redox-os.org/ https://www.redox-os.org/ .
The Rustc Dev Guide Contributors . The MIR (Mid-level IR) [EB/OL ] . ( 2024-08-15 )[ 2025-08-26 ] . https://rustc-dev-guide.rust-lang.org/mir/index.html https://rustc-dev-guide.rust-lang.org/mir/index.html .
Astrauskas V , Müller P , Poli F , et al . Leveraging rust types for modular specification and verification [J ] . Proceedings of the ACM on Programming Languages , 2019 , 3 : 1 - 30 . DOI: 10.1145/3360573 http://dx.doi.org/10.1145/3360573
VanHattum A , Schwartz-Narbonne D , Chong N , et al . Verifying dynamic trait objects in rust [C ] // 2022 IEEE/ACM 44th International Conference on Software Engineering: Software Engineering in Practice . Piscataway : IEEE , 2022 : 321 - 330 . DOI: 10.1109/icse-seip55303.2022.9794041 http://dx.doi.org/10.1109/icse-seip55303.2022.9794041
Lattuada A , Hance T , Cho C , et al . Verus: Verifying rust programs using linear ghost types [J ] . Proceedings of the ACM on Programming Languages , 2023 , 7 (OOPSLA 1 ): 286 - 315 . DOI: 10.1145/3586037 http://dx.doi.org/10.1145/3586037
Lattuada A , Hance T , Bosamiya J , et al . Verus: A practical foundation for systems verification [C ] // Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles . New York : ACM , 2024 : 438 - 454 . DOI: 10.1145/3694715.3695952 http://dx.doi.org/10.1145/3694715.3695952
Klein G , Elphinstone K , Heiser G , et al . seL4: Formal verification of an OS kernel [C ] // Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles . New York : ACM , 2009 : 207 - 220 . DOI: 10.1145/1629575.1629596 http://dx.doi.org/10.1145/1629575.1629596
Nipkow T , Wenzel M , Paulson L C . Isabelle/HOL [M ] . Berlin : Springer , 2002 . DOI: 10.1007/3-540-45949-9 http://dx.doi.org/10.1007/3-540-45949-9
Gu L , Vaynberg A , Ford B , et al . CertiKOS: A certified kernel for secure cloud computing [C ] // Proceedings of the Second Asia-Pacific Workshop on Systems . New York : ACM , 2011 : 1 - 5 . DOI: 10.1145/2103799.2103803 http://dx.doi.org/10.1145/2103799.2103803
Paulin-Mohring C . Inductive definitions in the system Coq rules and properties [M ] // Typed Lambda Calculi and Applications . Berlin/Heidelberg : Springer-Verlag , 2006 : 328 - 345 .
Nelson L , Sigurbjarnarson H , Zhang K Y , et al . Hyperkernel: Push-button verification of an OS kernel [C ] // Proceedings of the 26th Symposium on Operating Systems Principles . New York : ACM , 2017 : 252 - 269 . DOI: 10.1145/3132747.3132748 http://dx.doi.org/10.1145/3132747.3132748
de Moura L , Bjørner N . Z 3 : An efficient SMT solver [M ] // Tools and Algorithms for the Construction and Analysis of Systems . Berlin, HeidelbergSpringer2008: 337 - 340 . DOI: 10.1007/978-3-540-78800-3_24 http://dx.doi.org/10.1007/978-3-540-78800-3_24
Li S W , Li X , Gu R , et al . Formally verified memory protection for a commodity multiprocessor hypervisor [C ] // 30th USENIX Security Symposium (USENIX Security 21) . Berkeley : USENIX Association , 2021 : 3953 - 3970 .
Li S W , Li X P , Gu R H , et al . A secure and formally verified linux KVM hypervisor [C ] // 2021 IEEE Symposium on Security and Privacy . Piscataway : IEEE , 2021 : 1782 - 1799 . DOI: 10.1109/sp40001.2021.00049 http://dx.doi.org/10.1109/sp40001.2021.00049
Bai Y K , Li P N , Huang Y B , et al . HyperTEE: A decoupled TEE architecture with secure enclave management [C ] // 2024 57th IEEE/ACM International Symposium on Microarchitecture . Piscataway : IEEE , 2024 : 105 - 120 .
Ling Z , Yan H Y , Shao X H , et al . Secure boot, trusted boot and remote attestation for ARM TrustZone-based IoT Nodes [J ] . Journal of Systems Architecture , 2021 , 119 : 102240 . DOI: 10.1016/j.sysarc.2021.102240 http://dx.doi.org/10.1016/j.sysarc.2021.102240
Jiang J H , Chen C L , Feng S L , et al . Embodied intelligence: The key to unblocking generalized artificial intelligence [PP/OL ] . V1. arXiv ( 2025-05-11 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2505.06897 https://doi.org/10.48550/arXiv.2505.06897 .
Kuipers B , Feigenbaum E A , Hart P E , et al . Shakey: From conception to history [J ] . AI Magazine , 2017 , 38 ( 1 ): 88 - 103 . DOI: 10.1609/aimag.v38i1.2716 http://dx.doi.org/10.1609/aimag.v38i1.2716
Fikes R E , Nilsson N J . Strips: A new approach to the application of theorem proving to problem solving [J ] . Artificial Intelligence , 1971 , 2 ( 3/4 ): 189 - 208 . DOI: 10.1016/0004-3702(71)90010-5 http://dx.doi.org/10.1016/0004-3702(71)90010-5
Brooks R . A robust layered control system for a mobile robot [J ] . IEEE Journal on Robotics and Automation , 1986 , 2 ( 1 ): 14 - 23 . DOI: 10.1109/jra.1986.1087032 http://dx.doi.org/10.1109/jra.1986.1087032
Gat E . Three-layer architectures [C ] // Artificial Intelligence and Mobile Robots . New York : ACM , 1998 : 195 - 210 . DOI: 10.1016/b978-0-08-049945-1.50034-4 http://dx.doi.org/10.1016/b978-0-08-049945-1.50034-4
Metta G , Fitzpatrick P , Natale L . YARP: Yet another robot platform [J ] . International Journal of Advanced Robotic Systems , 2006 , 3 : 8 . DOI: 10.5772/5761 http://dx.doi.org/10.5772/5761
Quigley M , Gerkey B , Conley K , et al . ROS: An open-source Robot Operating System [EB/OL ] . [ 2025-10-10 ] . http://www.robotics.stanford.edu/~ang/papers/icraoss09-ROS.pdf http://www.robotics.stanford.edu/~ang/papers/icraoss09-ROS.pdf .
Macenski S , Foote T , Gerkey B , et al . Robot Operating System 2: Design, architecture, and uses in the wild [J ] . Science Robotics , 2022 , 7 ( 66 ): eabm6074 . DOI: 10.1126/scirobotics.abm6074 http://dx.doi.org/10.1126/scirobotics.abm6074
Kim D S , Tran-Dang H . An overview on industrial Internet of Things [M ] // Industrial Sensors and Controls in Communication Networks: From Wired Technologies to Cloud Computing and the Internet of Things . Cham : Springer International Publishing , 2019 : : 207 - 216 . DOI: 10.1007/978-3-030-04927-0_16 http://dx.doi.org/10.1007/978-3-030-04927-0_16
Hooda R , Kumar A , Shivani , et al . Industrial Internet of Things: An analysis of emergence, component and challenges [J ] . International Journal on Recent and Innovation Trends in Computing and Communication , 2023 , 11 ( 10 ): 2010 - 2017 . DOI: 10.17762/ijritcc.v11i10.8884 http://dx.doi.org/10.17762/ijritcc.v11i10.8884
Mishra S , Tyagi A K , Maity I , et al . Design and implementation of internet of everything's business platform ecosystem [EB/OL ] . [ 2025-10-10 ] . https://d1wqtxts1xzle7.cloudfront.net/104128678/207d8ad0-be85-4864-8fb2-d286176b3074-libre.pdf https://d1wqtxts1xzle7.cloudfront.net/104128678/207d8ad0-be85-4864-8fb2-d286176b3074-libre.pdf . DOI: 10.21203/rs.3.rs-402461/v1 http://dx.doi.org/10.21203/rs.3.rs-402461/v1
Abunahla H , Gadhafi R , Mohammad B , et al . Integrated graphene oxide resistive element in tunable RF filters [J ] . Scientific Reports , 2020 , 10 : 13128 . DOI: 10.1038/s41598-020-70041-x http://dx.doi.org/10.1038/s41598-020-70041-x
Krutwig M C . Suitability of OPC UA for distributed energy monitoring [J ] . Proceedings of the International Conference on Business Excellence , 2019 , 13 ( 1 ): 399 - 410 . DOI: 10.2478/picbe-2019-0035 http://dx.doi.org/10.2478/picbe-2019-0035
Sufian A T , Abdullah B M , Ateeq M , et al . Six-gear roadmap towards the smart factory [J ] . Applied Sciences , 2021 , 11 ( 8 ): 3568 . DOI: 10.3390/app11083568 http://dx.doi.org/10.3390/app11083568
Xing Z H , Lan Y Q , Sun Z J , et al . IoT OS platform: Software infrastructure for next-gen industrial IoT [J ] . Applied Sciences , 2024 , 14 ( 13 ): 5370 . DOI: 10.3390/app14135370 http://dx.doi.org/10.3390/app14135370
Cao D G , Xue D L , Ma Z Y , et al . XiUOS: An open-source ubiquitous operating system for industrial Internet of Things [J ] . Science China Information Sciences , 2022 , 65 : 117101 . DOI: 10.1007/s11432-021-3294-y http://dx.doi.org/10.1007/s11432-021-3294-y
Krajci I , Cummings D . History and evolution of the Android OS [M ] // Android on x86: An Introduction to Optimizing for Intel Architecture . Berkeley, CA : Apress , 2013 : 1 - 8 . DOI: 10.1007/978-1-4302-6131-5_1 http://dx.doi.org/10.1007/978-1-4302-6131-5_1
Pan S D , Guo T C , Zhang L H , et al . A large-scale investigation of semantically incompatible APIs behind compatibility issues in Android apps [PP/OL ] . V2. arXiv ( 2024-06-26 )[ 2025-10-10 ] . https://doi.org/10.48550/arXiv.2406.17431 https://doi.org/10.48550/arXiv.2406.17431 .
Shokouh J . Detecting GNSS attacks on smartphones [EB/OL ] . ( 2013-07-31 )[ 2025-10-10 ] . https://www.diva-portal.org/smash/record.jsf pid=diva2%3A678010&dswid=-5192 https://www.diva-portal.org/smash/record.jsfpid=diva2%3A678010&dswid=-5192 .
Yuan J L , Yang C , Cai D Q , et al . Mobile foundation model as firmware [C ] // Proceedings of the 30th Annual International Conference on Mobile Computing and Networking . New York : ACM , 2024 : 279 - 295 . DOI: 10.1145/3636534.3649361 http://dx.doi.org/10.1145/3636534.3649361
Bhimanapati V B R , Goel D P , Aggarwal A . Integrating cloud services with mobile applications for seamless user experience [J ] . Darpan International Research Analysis , 2024 , 12 ( 3 ): 252 - 268 . DOI: 10.36676/dira.v12.i3.81 http://dx.doi.org/10.36676/dira.v12.i3.81
Atzori L , Girau R , Martis S , et al . A SIoT-aware approach to the resource management issue in mobile crowdsensing [C ] // 2017 20th Conference on Innovations in Clouds, Internet and Networks . Piscataway : IEEE , 2017 : 232 - 237 . DOI: 10.1109/icin.2017.7899418 http://dx.doi.org/10.1109/icin.2017.7899418
Cheng P , Lian X , Jian X , et al . FROG: A fast and reliable crowdsourcing framework [J ] . IEEE Transactions on Knowledge and Data Engineering , 2019 , 31 ( 5 ): 894 - 908 . DOI: 10.1109/tkde.2018.2849394 http://dx.doi.org/10.1109/tkde.2018.2849394
Diniz H B M , Silva E C G F , Nogueira T C C , et al . A reference architecture for mobile crowdsensing platforms [C ] // Proceedings of the 18th International Conference on Enterprise Information Systems . SCITEPRESS - Science and and Technology Publications , 2016 : 600 - 607 . DOI: 10.5220/0005837606000607 http://dx.doi.org/10.5220/0005837606000607
Liu Y M , Yu Z W , Guo B , et al . CrowdOS: A ubiquitous operating system for crowdsourcing and mobile crowd sensing [J ] . IEEE Transactions on Mobile Computing , 2022 , 21 ( 3 ): 878 - 894 . DOI: 10.1109/tmc.2020.3015750 http://dx.doi.org/10.1109/tmc.2020.3015750
Kuenzer S , Bădoiu V A , Lefeuvre H , et al . Unikraft: Fast, specialized unikernels the easy way [C ] // Proceedings of the Sixteenth European Conference on Computer Systems . New York : ACM , 2021 : 376 - 394 . DOI: 10.1145/3447786.3456248 http://dx.doi.org/10.1145/3447786.3456248
Madhavapeddy A , Mortier R , Rotsos C , et al . Unikernels: Library operating systems for the cloud [J ] . ACM SIGARCH Computer Architecture News , 2013 , 41 ( 1 ): 461 - 472 . DOI: 10.1145/2490301.2451167 http://dx.doi.org/10.1145/2490301.2451167
Boos K , Liyanage N , Ijaz R , et al . Theseus: an experiment in operating system structure and state management [C ] // 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20) . USENIX Association , 2020 : 1 - 19 .
刘伟超 , 王国柱 , 黄琛 , 等 . 操作系统的多内核适配装置、方法及工业物联网操作系统 : CN202410248575.9 [P ] . 2024-07-16 .
Liu Weichao , Wang Guozhu , Chen Hhang , et al . Multi-kernel adaptation device, method for operating system and industrial internet of things operating system : CN117827277B [P ] . 2024-07-16 . (in Chinese)
沈昌祥 . 自主可信计算筑牢人工智能安全底座 [J ] . 中国科技产业 , 2025 ( 1 ): 5 .
Shen Changxiang . Independent trusted computing to build a safe base of artificial intelligence [J ] . Science & Technology Industry of China , 2025 ( 1 ): 5 . (in Chinese)
0
浏览量
34
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621