File size: 731 Bytes
2bd18ba
 
 
 
 
 
 
 
 
 
 
 
 
 
b0c9c5d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: mit
datasets:
- gbharti/finance-alpaca
- lavita/ChatDoctor-HealthCareMagic-100k
- laion/OIG
- openai/webgpt_comparisons
- taskydata/GPT4Tools
- DataProvenanceInitiative/cot_submix_original
- 0x70DA/stackoverflow-chat-data
language:
- en
library_name: adapter-transformers
pipeline_tag: text-classification
---

# Attempt to reproduce Mixture-of-LoRAs classifier

Mixture-of-LoRAs: An Efficient Multitask Tuning for Large Language Models

https://arxiv.org/pdf/2403.03432

## Datasets

We evenly sample about 10k training data and 2k validation data on each dataset.

From `laion/OIG` was taken only:
- unified_merged_code_xp3.jsonl
- unified_grade_school_math_instructions.jsonl
- unified_mathqa_flanv2_kojma_cot.jsonl