File size: 731 Bytes
2bd18ba b0c9c5d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 |
---
license: mit
datasets:
- gbharti/finance-alpaca
- lavita/ChatDoctor-HealthCareMagic-100k
- laion/OIG
- openai/webgpt_comparisons
- taskydata/GPT4Tools
- DataProvenanceInitiative/cot_submix_original
- 0x70DA/stackoverflow-chat-data
language:
- en
library_name: adapter-transformers
pipeline_tag: text-classification
---
# Attempt to reproduce Mixture-of-LoRAs classifier
Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models
https://arxiv.org/pdf/2403.03432
## Datasets
We evenly sample about 10k training data and 2k validation data on each dataset.
From `laion/OIG` was taken only:
- unified_merged_code_xp3.jsonl
- unified_grade_school_math_instructions.jsonl
- unified_mathqa_flanv2_kojma_cot.jsonl |