Abstract
This study evaluates two search methodologies—Hybrid Search and Semantic Search—withinaRetrieval-AugmentedGeneration(RAG)frameworkforanE-Learning use case. The goal was to enhance the accuracy and efficiency of Large Language Models (LLMs), such as GPT-4, through advanced Prompt Engineering Techniques and optimized retrieval processes. Thus, efficient search and chunking methods are essential for improving thequality ofsystem-generatedanswers. UsingtheEvaluation frameworkforRetrieval-AugmentedGeneration(RAG)pipelines, theRagasframework as our testing framework, we measured five key metrics: answer correctness, context recall, context precision, faithfulness, and answer relevancy. The dataset utilized in this study comprises question-answer pairs, with the answers established as ground truth, derived from educational sources such as textbooks, research papers, and lectures with over 215 pages of highly complex theoretical and practical learning material. In order to evaluate the chunking and search methodologies the Ragas testing framework dataset covers 57 questions out of the used educational material related to generative AI concepts and prompt engineering techniques. These source documents were pre-processed into smaller, manageable chunks and indexed using both vector embeddings and keyword-based indexing, aimed at facilitating efficient retrieval and improving response accuracy. The ground truth constituted the benchmark for assessing the performance of the Ragas testing framework. The AI model used for embeddings, OpenAI’s text-embedding-ada-002, generated highdimensional representations to capture deep semantic meanings. The study tested three chunking strategies (Token-Based, Recursive, and BERT-based) and compared the search methods using statistical analyses like ANOVA and paired t-tests. The results show that Hybrid Search consistently outperformed Semantic Search across all metrics. However, the effect size (Cohen’s d 0.11) indicated that the practical difference was negligible. Token-Based Chunking underperformed in Context Recall compared to BERT-based and Recursive Chunking. These findings offer valuable insights for optimizing RAG systems in E-Learning, with future directions focusing on continuously improving chunking techniques and integrating long-context LLMs for enhanced scalability and accuracy.
| Original language | English |
|---|---|
| Pages | 1889–1898 |
| Number of pages | 10 |
| DOIs | |
| Publication status | Published - Dec 2024 |
| Event | AHFE International Conference on Human Factors in Design, Engineering, and Computing - Hawaii Duration: 8 Dec 2024 → 10 Dec 2024 Conference number: 2024 https://www.hawaii.ahfe.org/ |
Conference
| Conference | AHFE International Conference on Human Factors in Design, Engineering, and Computing |
|---|---|
| Abbreviated title | AHFE |
| Period | 08.12.2024 → 10.12.2024 |
| Internet address |
Keywords
- Advanced chunking
- E-learning
- Generative AI
- Hybrid search
- Large language models (LLMS)
- Prompt engineering techniques
- Retrieval-augmented generation (RAG)
- Semantic search
Fingerprint
Dive into the research topics of 'Advanced Methods for Improved Retrieval-Augmented Generation (RAG) System Performance in E-Learning'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver