richardr1126 commited on
Commit
c9d6c5c
1 Parent(s): 1b0e109

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -19,6 +19,21 @@ license: bigcode-openrail-m
19
  - Finetuning was performed using QLoRa on a single RTX6000 48GB.
20
  - If you want the merged model: [richardr1126/spider-natsql-wizard-coder-merged](https://huggingface.co/richardr1126/spider-natsql-wizard-coder-merged)
21
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
  ## Citation
23
 
24
  Please cite the repo if you use the data or code in this repo.
 
19
  - Finetuning was performed using QLoRa on a single RTX6000 48GB.
20
  - If you want the merged model: [richardr1126/spider-natsql-wizard-coder-merged](https://huggingface.co/richardr1126/spider-natsql-wizard-coder-merged)
21
 
22
+ ### Spider Dataset
23
+
24
+ [Spider](https://arxiv.org/abs/1809.08887) is a large-scale complex and cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale students
25
+ The goal of the Spider challenge is to develop natural language interfaces to cross-domain databases.
26
+
27
+ This dataset was used to finetune this model.
28
+
29
+ ### NatSQL
30
+
31
+ [NatSQL](https://arxiv.org/abs/2109.05153) is an intermediate representation for SQL that simplifies the queries and reduces the mismatch between
32
+ natural language and SQL. NatSQL preserves the core functionalities of SQL, but removes some clauses and keywords
33
+ that are hard to infer from natural language descriptions. NatSQL also makes schema linking easier by reducing the
34
+ number of schema items to predict. NatSQL can be easily converted to executable SQL queries and can improve the
35
+ performance of text-to-SQL models.
36
+
37
  ## Citation
38
 
39
  Please cite the repo if you use the data or code in this repo.