Update README.md

Browse files

Files changed (1) hide show

README.md +9 -10

README.md CHANGED Viewed

@@ -34,11 +34,11 @@ inference: false
 This model is a fine-tuned version of [BramVanroy/GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra) on the [snoels/FinGEITje-sft](https://huggingface.co/datasets/snoels/FinGEITje-sft) dataset.
-## Model Description
 FinGEITje 7B is a large open Dutch financial language model with 7 billion parameters, based on Mistral 7B. It has been further trained on Dutch financial texts, enhancing its proficiency in the Dutch language and its knowledge of financial topics. As a result, FinGEITje provides more accurate and relevant responses in the domain of finance.
-## Training and Evaluation Data
 ### Training Data
@@ -57,7 +57,7 @@ The model was evaluated using:
 - **[snoels/FinDutchBench](https://huggingface.co/datasets/snoels/FinDutchBench)**: A Dutch financial benchmark dataset designed to assess the model's performance on various financial tasks.
-## Training Procedure
 FinGEITje was trained following the methodology described in the [Alignment Handbook](https://github.com/huggingface/alignment-handbook).
@@ -103,7 +103,7 @@ The evaluation package includes a set of metrics defined per task, grouped per d
 - **Datasets**: 2.14.6
 - **Tokenizers**: 0.15.2
-## How to Use
 FinGEITje 7B can be utilized using the Hugging Face Transformers library along with PEFT to load the LoRA adapters efficiently.
@@ -145,7 +145,7 @@ response = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(response)
 ```
-## Limitations and Future Work
 While FinGEITje 7B demonstrates significant improvements in understanding and generating Dutch financial content, certain limitations exist:
@@ -155,7 +155,6 @@ While FinGEITje 7B demonstrates significant improvements in understanding and ge
 - **Language Scope**: Primarily designed for Dutch; performance in other languages is not optimized.
 - **Ethical Use**: Users should ensure that the model's outputs comply with ethical standards and do not promote misinformation or harmful content.
 ### Future Work
 - **Data Updates**: Incorporate more recent and diverse financial datasets to keep the model up-to-date.
@@ -163,7 +162,7 @@ While FinGEITje 7B demonstrates significant improvements in understanding and ge
 - **Performance Enhancement**: Fine-tune on more specialized financial topics and complex financial tasks.
 - **Multilingual Expansion**: Extend support to other languages relevant to the financial sector in the Netherlands and Europe.
-## Acknowledgements
 We would like to thank:
@@ -171,7 +170,7 @@ We would like to thank:
 - **Bram Vanroy** ([GitHub](https://github.com/BramVanroy)) for creating [GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra), an open-source Dutch chat model, and for sharing training, translation, and evaluation resources.
 - **Contributors of the [Alignment Handbook](https://github.com/huggingface/alignment-handbook)** for providing valuable resources that guided the development and training process of FinGEITje.
-## Citation
 If you use FinGEITje in your work, please cite:
@@ -185,10 +184,10 @@ If you use FinGEITje in your work, please cite:
 }
 ```
-## License
 This model is licensed under the [Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/) license.
-## Contact
 For any inquiries or questions, please contact [Sander Noels](mailto:sander.noels@ugent.be).

 This model is a fine-tuned version of [BramVanroy/GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra) on the [snoels/FinGEITje-sft](https://huggingface.co/datasets/snoels/FinGEITje-sft) dataset.
+## 📖 Model Description
 FinGEITje 7B is a large open Dutch financial language model with 7 billion parameters, based on Mistral 7B. It has been further trained on Dutch financial texts, enhancing its proficiency in the Dutch language and its knowledge of financial topics. As a result, FinGEITje provides more accurate and relevant responses in the domain of finance.
+## 📊 Training and Evaluation Data
 ### Training Data
 - **[snoels/FinDutchBench](https://huggingface.co/datasets/snoels/FinDutchBench)**: A Dutch financial benchmark dataset designed to assess the model's performance on various financial tasks.
+## ⚙️ Training Procedure
 FinGEITje was trained following the methodology described in the [Alignment Handbook](https://github.com/huggingface/alignment-handbook).
 - **Datasets**: 2.14.6
 - **Tokenizers**: 0.15.2
+## 🛠️ How to Use
 FinGEITje 7B can be utilized using the Hugging Face Transformers library along with PEFT to load the LoRA adapters efficiently.
 print(response)
 ```
+## 🚧 Limitations and Future Work
 While FinGEITje 7B demonstrates significant improvements in understanding and generating Dutch financial content, certain limitations exist:
 - **Language Scope**: Primarily designed for Dutch; performance in other languages is not optimized.
 - **Ethical Use**: Users should ensure that the model's outputs comply with ethical standards and do not promote misinformation or harmful content.
 ### Future Work
 - **Data Updates**: Incorporate more recent and diverse financial datasets to keep the model up-to-date.
 - **Performance Enhancement**: Fine-tune on more specialized financial topics and complex financial tasks.
 - **Multilingual Expansion**: Extend support to other languages relevant to the financial sector in the Netherlands and Europe.
+## 🙏 Acknowledgements
 We would like to thank:
 - **Bram Vanroy** ([GitHub](https://github.com/BramVanroy)) for creating [GEITje-7B-ultra](https://huggingface.co/BramVanroy/GEITje-7B-ultra), an open-source Dutch chat model, and for sharing training, translation, and evaluation resources.
 - **Contributors of the [Alignment Handbook](https://github.com/huggingface/alignment-handbook)** for providing valuable resources that guided the development and training process of FinGEITje.
+## 📝 Citation
 If you use FinGEITje in your work, please cite:
 }
 ```
+## 📜 License
 This model is licensed under the [Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/) license.
+## 📧 Contact
 For any inquiries or questions, please contact [Sander Noels](mailto:sander.noels@ugent.be).