xl-zhao commited on
Commit
ee7e331
·
verified ·
1 Parent(s): 81206de

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +78 -3
README.md CHANGED
@@ -1,3 +1,78 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ language:
4
+ - en
5
+ ---
6
+ # SubgoalXL - Subgoal-Based Expert Learning for Theorem Proving
7
+
8
+ ## Model Description
9
+
10
+ SubgoalXL is an advanced model for formal theorem proving, leveraging subgoal-based expert learning to enhance LLMs' formal reasoning capabilities. It addresses the challenges of scarce theorem-proving data and multi-step reasoning by optimizing data efficiency and employing subgoal-level supervision. SubgoalXL iteratively refines formal statement, proof, and subgoal generators, allowing it to extract richer information from limited proofs and generate more precise formal proofs.
11
+
12
+ ### Example Usage
13
+
14
+ Below is an example code for using SubgoalXL with its custom prompt template for generating formal proofs:
15
+
16
+ ```python
17
+ from transformers import AutoModelForCausalLM, AutoTokenizer
18
+
19
+ # Load the model and tokenizer
20
+ tokenizer = AutoTokenizer.from_pretrained("xl-zhao/formal_proof_generator_v1_iter3")
21
+ model = AutoModelForCausalLM.from_pretrained("xl-zhao/formal_proof_generator_v1_iter3")
22
+
23
+ informal_statement = "" # Your informal statement here
24
+ formal_statement = "" # The formal statement generated by the model will be inserted here
25
+
26
+ # Prompt option 1
27
+ prompt1 = f"Develop formal proofs using Isabelle, leveraging auxiliary tools such as Sledgehammer to enhance the proving process.\n\n### Input\n(*Informal Statement:\n{informal_statement}*)\n{formal_statement}\n\n### Output"
28
+
29
+ # Prompt option 2
30
+ prompt2 = f"Utilize Isabelle for the systematic verification of theorem proofs, employing Sledgehammer as a supplementary tool. Approach the problem in a step-by-step manner.\n\n### Problem\n{informal_statement}\n\n### Isabelle Proof\n{formal_statement}"
31
+
32
+ # Choose the prompt (both prompts are acceptable)
33
+ prompt = prompt1 # You can switch to prompt2 if needed
34
+
35
+ # Tokenize and generate the formal proof
36
+ inputs = tokenizer(prompt, return_tensors="pt")
37
+ outputs = model.generate(**inputs, max_new_tokens=2048, temperature=0.8)
38
+
39
+ # Decode the output
40
+ formal_proof = tokenizer.decode(outputs[0], skip_special_tokens=True)
41
+ print(formal_proof)
42
+ ```
43
+
44
+ ## Intended Use
45
+
46
+ This model is intended for automated theorem proving, especially in environments that require high levels of mathematical rigor, such as Isabelle. By employing subgoal-based proof generation and expert learning, SubgoalXL is optimized for solving complex reasoning tasks in formal mathematics.
47
+
48
+ ## Performance
49
+
50
+ SubgoalXL sets a new state-of-the-art benchmark in formal theorem proving within the Isabelle environment, achieving an accuracy of 56.1% on the miniF2F dataset, an absolute improvement of 4.9% over previous approaches. The model has successfully solved the following problems:
51
+
52
+ - 41 AMC12 problems
53
+ - 9 AIME problems
54
+ - 3 IMO problems
55
+
56
+ For more information on the model's design and performance, please refer to the paper [SubgoalXL: Subgoal-Based Expert Learning for Theorem Proving](https://arxiv.org/abs/2408.11172).
57
+
58
+ ## Limitations
59
+
60
+ While SubgoalXL excels in formal theorem proving, its usage is tailored to this specific domain and may not generalize well to other tasks beyond formal logic and proof generation.
61
+
62
+ ## Citation
63
+
64
+ If you use SubgoalXL in your research, please cite our paper:
65
+
66
+ ```
67
+ @article{zhao2024subgoalxl,
68
+ title = {SubgoalXL: Subgoal-based Expert Learning for Theorem Proving},
69
+ author = {Zhao, Xueliang and Zheng, Lin and Bo, Haige and Hu, Changran and Thakker, Urmish and Kong, Lingpeng},
70
+ journal={arXiv preprint arXiv:2408.11172},
71
+ url={https://arxiv.org/abs/2408.11172},
72
+ year = {2024},
73
+ }
74
+ ```
75
+
76
+ ## Contact
77
+
78
+ For more information, please visit the project's [GitHub repository](https://github.com/zhaoxlpku/SubgoalXL) or reach out via email at xlzhao22@connect.hku.hk.