JIA Chunyan,FANG Weijie,XIE Yuwei,et al.Research on campus question answering system supported by retrieval-augmented generation technology[J].Journal on Communications,2024,45(Z2):248-254.
JIA Chunyan,FANG Weijie,XIE Yuwei,et al.Research on campus question answering system supported by retrieval-augmented generation technology[J].Journal on Communications,2024,45(Z2):248-254. DOI: 10.11959/j.issn.1000-436x.2024259.
Research on campus question answering system supported by retrieval-augmented generation technology
To solve the problem of obtaining effective information from the vast amount of campus information for teachers and students
an intelligent campus question answering (QA) system based on RAG was designed. An approach that integrates large language models and domain knowledge for QA system construction was proposed
relying on the campus’s
Everything You Need to Know
project
and using campus information such as procedural guides
frequently asked questions
and normative documents as an external data corpus. A campus knowledge database was constructed
with the RAG Infinity database. To improve the retrieval efficiency of domain knowledge and the accuracy of answers
the prompt approach was proposed. Using RAG for campus QA
the system provides users various service information in an interactive manner
which helps to solve common campus issues
simplify the consultation process for teachers and students
and alleviate the burden on campus management
and enrich campus knowledge resources.
关键词
Keywords
references
Tony H , Stewart T , Kristin T . 第四范式:数据密集型科学发现 [M ] .潘教峰, 张晓林, 等译. 北京 : 科学出版社 , 2012 .
CHEN S P , LIU F L , QIAN Y X , et al . Topic and Knowledge Association Evolution in the Field of Large Language Model-enabled Information Retrieval [J/OL ] .Documentation,Information & Knowledge,( 2024-06-27 )[ 2024-10-20 ] .
ZHAO X , DOU Z C , WEN J R . The development of information retrieval in the era of large language model [J ] . Bulletin of National Natural Science Foundation of China , 2023 , 37 ( 5 ): 786 - 792 .
CAO P J , XIE Y B , WU H Z , et al . The development status, innovation architecture and application prospects of educational big models [J ] . Modern Educational Technology , 2024 , 34 ( 2 ): 5 - 12 .
MIAO F C . Examination of the technique principle of generative AI and its educational applicability [J ] . Modern Educational Technology , 2023 , 33 ( 11 ): 5 - 18 .
YU S Q , XIONG S S . General artificial intelligence teacher architecture based on enhanced pre-trained large models [J ] . Open Education Research , 2024 , 30 ( 1 ): 33 - 43 .
QI S Y , HU H Y , LI H B , et al . Domain question answering system construction approach integrated with large language model [J ] . Journal of Beijing University of Posts and Telecommunications , 2024 : doi.org/10.13190/j.jbupt.2023-279.
WEN S , QIAN L , HU M D , et al . Review of research progress on question-answering techniques based on large language models [J ] . Data Analysis and Knowledge Discovery , 2024 , 8 ( 6 ): 16 - 29 .
ZHANG J Y , WANG T K , YAO C Y , et al . Construction and Evaluation of Intelligent Question Answering System for Electric Power Knowledge Base based on Large Language Model [J/OL ] . Computer Science , 2024 , https://link.cnki.net/urlid/50.1075.TP.20240528.0931.002 https://link.cnki.net/urlid/50.1075.TP.20240528.0931.002
ZHU Q Y, E H H, Research on Vertical Domain Dialogue Systems Based on Large Language Model [J ] . New Generation Of Information Technology , 2023 , 6 ( 17 ): 8 - 16 .
LU Y , YU J L , CHEN P H , et al . Educational application and prospect of generative artificial intelligence: taking ChatGPT system as an example [J ] . Chinese Journal of Distance Education , 2023 ( 4 ): 24 - 31, 51 .
TOUVRON H , MARTIN L , STONE K R , et al . Llama 2: open foundation and fine-tuned chat models [J ] . arXiv Preprint , arXiv: 2307.09288 , 2023 .
ZENG A H , LIU X , DU Z X , et al . GLM-130B: an open bilingual pre-trained model [J ] . arXiv Preprint , arXiv: 2210 . 02414 v 2 , 2022 .
DEVLIN J , CHANG M , LEE K , et al . BERT: pre- training of deep bidirectional transformers for language understanding [C ] // Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . Stroudsburg : ACL Press , 2019 : 4171 - 4186 .
VASWANI A , SHAZEER N , PARMAR N , et al . Attention is all you need [J ] . Advances in Neural Information Processing Systems , 2017 ( 30 ): 5998 - 6008 .
CHANG Y P , WANG X , WANG J D , et al . A survey on evaluation of large language models [J ] . ACM Transactions on Intelligent Systems and Technology , 2024 , 15 ( 3 ): 1 - 45 .
NEUPANE S , HOSSAIN E , KEITH J , et al . From questions to insightful answers: building an informed chatbot for university resources [J ] . arXiv Preprint , arXiv: 2405.08120 , 2024 .
LEWIS P , PEREZ E , PIKTUS A , et al . Retrieval-augmented generation for knowledge-intensive NLP tasks [C ] // Proceedings of the 34th International Conference on Neural Information Processing Systems . Massachusetts : MIT Press , 2020 : 9459 - 9474 .
OpenAI . Our approach to alignment research [EB/OL ] . ( 2023-07-05 ) [ 2024-10-22 ] .
JI Z , LEE N , FRIESKE R , et al . Survey of hallucination in natural language generation [J ] . ACM Computing Surveys , 2023 , 12 ( 55 ): 1 - 38 .
PETRONI F , ROCKTÄSCHEL T , LEWIS P , et al . Language models as knowledge bases? [J ] . arXiv Preprint , arXiv: 1909.01066 , 2019 .
Research on efficient threat intelligence extraction and attack inference method based on large language models
Network behavior twin-driven traffic anomaly detection for the Internet of things
Cooperative optimization method for inference on multi-chiplet large-model accelerators
Research on network configuration analysis technology empowered by large language models
Related Author
PENG Guojun
LI Jiachen
YANG Xiuzhang
LYU Jinzhao
YANG Xiuzhang
PENG Guojun
Guo Naixuan
He Gaofeng
Related Institution
Guizhou Institute of Big Data Industry Development and Applications, Guizhou University
State Key Laboratory of Public Big Data, Guizhou University
Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering, Wuhan University
Guizhou Big Data Academy, Guizhou University
School of Information Engineering, Yancheng Institute of Technology