base_model: vilm/vinallama-7b-chat
library_name: peft
license: llama2
datasets:
- nluai/dataset_dhnl_qna_v2
language:
- vi
tags:
- vietnamese
- academic
- regulations
- nlu
pipeline_tag: text-generation
chatbot-dhnl-v3
Introduction 🎉
Large Language Models (LLMs) are increasingly demonstrating their importance in addressing complex natural language processing tasks. However, they are still limited in generating text related to personalized datasets. Building a support system for answering questions in Vietnamese for students at Nong Lam University, Ho Chi Minh City, based on academic regulations, is a critical and practical task. This study focuses on researching methods for preprocessing Vietnamese data and fine-tuning Large Language Models (LLMs) to align with the specific language characteristics and content of the university's academic regulations. Additionally, the research team has constructed a dataset of the university's academic regulations and developed a Vietnamese text generation service to answer questions related to this dataset, which has been integrated into a chat website utilizing this service.
Developed by:
- Nguyễn Đăng Phước
- Vũ Ngọc Thanh Trúc
Model type: Multimodal Transformer with over 7B parameters
Languages (NLP): Primarily Vietnamese with multilingual capabilities
Fine-tuned from: nluai/dataset_dhnl_qna_v2