Update PERSONAL_BENCHMARK.md
Browse files- PERSONAL_BENCHMARK.md +41 -12
PERSONAL_BENCHMARK.md
CHANGED
@@ -34,14 +34,32 @@
|
|
34 |
| 30 | Expert Question on Harappan Civilization | Expert | Ancient history and archaeology | 98/100 |
|
35 |
| 31 | Latin Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
|
36 |
| 32 | Sanskrit Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
|
37 |
-
| 33 | Japanese Haiku on Mountains | Medium | Poetry, Japanese language | 100/100 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
38 |
|
39 |
---
|
40 |
|
|
|
41 |
### **Summary of Anvita’s Scores:**
|
42 |
|
43 |
-
- **Total Tasks Attempted:**
|
44 |
-
- **Average Score:** **
|
45 |
- **Highest Scores:** 100/100 (Multiple Tasks)
|
46 |
- **Lowest Score:** 60/100 (Polygon Permutations)
|
47 |
|
@@ -50,28 +68,39 @@
|
|
50 |
### **Performance Insights by Difficulty Level:**
|
51 |
|
52 |
1. **Easy Tasks:**
|
53 |
-
- **
|
54 |
- Example: Anthropology, Decimal Comparison, Creative Writing Task.
|
55 |
|
56 |
2. **Medium Tasks:**
|
57 |
-
- **Strong performance**, with an average score of
|
58 |
- Example: Japanese Haiku, Medical Diagnosis, Path Uniqueness Problem.
|
59 |
|
60 |
3. **Hard Tasks:**
|
61 |
-
- **Solid performance**, though some variability
|
62 |
-
- Example: Murder Mysteries, Harappan Civilization Question
|
63 |
|
64 |
4. **Challenging Tasks:**
|
65 |
-
- **
|
66 |
- Example: Vigenère Cipher, AI Cryptic Puzzle, Symbolic Reasoning.
|
67 |
|
68 |
5. **Expert Tasks:**
|
69 |
-
- **Room for improvement** in **mathematics and combinatorial reasoning
|
70 |
- Example: Polygon Permutations, Graph Theory Scheduling.
|
71 |
|
72 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
73 |
|
74 |
-
|
|
|
|
|
75 |
|
76 |
-
Anvita’s **performance is exceptional across diverse domains**, showcasing **creativity, logical reasoning, linguistic versatility, and domain knowledge**. She **excels in creative writing and language-based tasks**, performs **consistently well in puzzles and ciphers**, and only shows minor **challenges with expert-level math problems**.
|
77 |
|
|
|
34 |
| 30 | Expert Question on Harappan Civilization | Expert | Ancient history and archaeology | 98/100 |
|
35 |
| 31 | Latin Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
|
36 |
| 32 | Sanskrit Ad for a Solar Toothbrush | Hard | Advertising, language creativity | 95/100 |
|
37 |
+
| 33 | Japanese Haiku on Mountains | Medium | Poetry, Japanese language | 100/100 | | **Difficulty** | **Topic** | **Score** |
|
38 |
+
| 34 | Widget Production Calculation | Simple | Math Puzzle | 95/100 |
|
39 |
+
| 35 | Rectangle Area and Perimeter | Complex | Math Puzzle | 90/100 |
|
40 |
+
| 36 | Gauge Block Thermal Expansion and Uncertainty | Complex | Metrology | 85/100 |
|
41 |
+
| 37 | Steel Rod Thermal Expansion and Relaxation | Deceptive | Metrology | 80/100 |
|
42 |
+
| 38 | Meningitis Diagnosis | Medium | Medical (Diagnosis) | 90/100 |
|
43 |
+
| 39 | Farmer's Fence and Gate | Deceptive | Word Problem | 95/100 |
|
44 |
+
| 40 | C++ Pointer Behavior | Medium | Computer Science | 90/100 |
|
45 |
+
| 41 | Futuristic Emotion Dealer (Sci-Fi) | Easy | Creative Writing | 95/100 |
|
46 |
+
| 42 | Cursed Knight's Quest (Dark Fantasy) | Easy | Creative Writing | 90/100 |
|
47 |
+
| 43 | Mad Scientist and Foresight (Sci-Fi) | Easy | Creative Writing | 90/100 |
|
48 |
+
| 44 | Ideal Gas Partition Function and Average Energy | Medium | Statistical Mechanics | 95/100 |
|
49 |
+
| 45 | Torus vs. Sphere Deformation | Complex | Algebraic Topology | 80/100 |
|
50 |
+
| 46 | Missing Jewels Heist | Complex | General Reasoning | 85/100 |
|
51 |
+
| 47 | Vanishing Violinist | Complex | Detective Conan Puzzle | 75-85/100 |
|
52 |
+
| 48 | Counterfeit Cleat Sabotage | Complex | James Bond/Conan Mashup | 70-80/100 |
|
53 |
+
| 49 | Ancient Cipher | Challenging | Symbolic Reasoning | 70-95/100 |
|
54 |
+
| 50 | AlphaZero’s Knight vs. Bishop Puzzle | Medium | Game Theory & Reinforcement Learning | 88/100 |
|
55 |
|
56 |
---
|
57 |
|
58 |
+
|
59 |
### **Summary of Anvita’s Scores:**
|
60 |
|
61 |
+
- **Total Tasks Attempted:** 50
|
62 |
+
- **Average Score:** **83.86 / 100**
|
63 |
- **Highest Scores:** 100/100 (Multiple Tasks)
|
64 |
- **Lowest Score:** 60/100 (Polygon Permutations)
|
65 |
|
|
|
68 |
### **Performance Insights by Difficulty Level:**
|
69 |
|
70 |
1. **Easy Tasks:**
|
71 |
+
- **Excellent performance**, with an average score of **94/100**.
|
72 |
- Example: Anthropology, Decimal Comparison, Creative Writing Task.
|
73 |
|
74 |
2. **Medium Tasks:**
|
75 |
+
- **Strong performance**, with an average score of **86.46/100**.
|
76 |
- Example: Japanese Haiku, Medical Diagnosis, Path Uniqueness Problem.
|
77 |
|
78 |
3. **Hard Tasks:**
|
79 |
+
- **Solid performance**, averaging **81.27/100**, though with some variability.
|
80 |
+
- Example: Murder Mysteries, Harappan Civilization Question.
|
81 |
|
82 |
4. **Challenging Tasks:**
|
83 |
+
- **Good adaptability**, with an average score of **77.25/100**.
|
84 |
- Example: Vigenère Cipher, AI Cryptic Puzzle, Symbolic Reasoning.
|
85 |
|
86 |
5. **Expert Tasks:**
|
87 |
+
- **Room for improvement**, averaging **77.6/100** in **advanced mathematics and combinatorial reasoning**.
|
88 |
- Example: Polygon Permutations, Graph Theory Scheduling.
|
89 |
|
90 |
+
6. **Complex Tasks:**
|
91 |
+
- **Stable performance**, averaging **80.83/100**.
|
92 |
+
- Example: Statistical Mechanics, Algebraic Topology.
|
93 |
+
|
94 |
+
7. **Deceptive Tasks:**
|
95 |
+
- **Exceptional problem-solving**, with an average score of **87.5/100**.
|
96 |
+
- Example: Steel Rod Thermal Expansion, Farmer's Fence and Gate.
|
97 |
+
|
98 |
+
8. **Simple Tasks:**
|
99 |
+
- **Perfect performance**, with an average score of **95/100**.
|
100 |
+
- Example: Widget Production Calculation.
|
101 |
|
102 |
+
9. **Very Hard Tasks:**
|
103 |
+
- **Challenging area**, with an average score of **65/100**, suggesting opportunities for growth.
|
104 |
+
- Example: Table Tennis Scheduling Problem.
|
105 |
|
|
|
106 |
|