ZHU Junyi,ZHU Shangming.Development of local knowledge base application using retrieval augmented generation technology[J].Journal on Communications,2024,45(Z2):242-247.
ZHU Junyi,ZHU Shangming.Development of local knowledge base application using retrieval augmented generation technology[J].Journal on Communications,2024,45(Z2):242-247. DOI: 10.11959/j.issn.1000-436x.2024227.
Development of local knowledge base application using retrieval augmented generation technology
Retrieval Augmented Generation (RAG) technology can enable large language models to access external knowledge bases by introducing external documents
thereby large language models can generate more authentic and reliable answers
and effectively solve the problems of outdated data and insufficient corpus. On the basis of introducing the basic architecture and fine-tuning techniques of large language models
the application framework of using retrieval enhanced generation technology to build a local knowledge base system was discussed. The application framework consisted of six parts: loading local documents
splitting documents
embedding splitting fragments
matching text based on questions
constructing prompts
and generating responses. Finally
based on the ERNIE-4.0 model and the AppBuilder development platform
an intelligent question answering system for campus information services was designed and developed
SHU W T , LI R X , SUN T X , et al . Large language models: principles, implementation, and progress [J ] . Journal of Computer Research and Development , 2024 , 61 ( 2 ): 351 - 361 .
TAO J H , NIE S , CHE F H . The evolution and inspiration of large language model technology [J ] . Bulletin of National Natural Science Foundation of China , 2023 , 37 ( 5 ): 767 - 775 .
XU Y M , HU L , ZHAO J Y , et al . Research progress and enlightenment of large language models on multi-lingual intelligence [J ] . Journal of Computer Applications , 2023 , 43 ( S2 ): 1 - 8 .
LUO W , WANG H F . Evaluating large language models: a survey of research progress [J ] . Journal of Chinese Information Processing , 2024 , 38 ( 1 ): 1 - 23 .
ZHAO Y , HE J W , ZHU S C , et al . Security of large language models: current status and challenges [J ] . Computer Science , 2024 , 51 ( 1 ): 68 - 71 .
OMRANI P , HOSSEINI A , HOOSHANFAR K , et al . Hybrid retrieval-augmented generation approach for LLMs query response enhancement [C ] // Proceedings of the 2024 10th International Conference on Web Research (ICWR) . Piscataway : IEEE Press , 2024 : 22 - 26 .
VASWANI A , SHAZEER N , PARMAR N , et al . Attention is all you need [C ] // Proceedings of the 30th Annual Conference on Neural Information Processing Systems . New York : Curran Associates , 2017 : 5990 - 6008 .
BROWN T , MANN B , RYDER N , et al . Language models are few-shot learners [J ] . Advances in Neural Information Processing Systems , 2020 , 33 : 1877 - 1901 .